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From the Editor 


Marketing (JM). My term of editorship began on July 

2005 and continues through June 2008, though the first 
issue for which I am responsible is January 2006. The arti- 
cles from the July 2005 and October 2005 issues were all 
accepted by the previous editor, Ruth Bolton, who has done 
a masterful job in building on the illustrious editorial tradi- 
tion that had been established over the years by her prede- 
cessors. I am very grateful to Ruth and to all the previous 
editors for helping make JM the journal it is today., , 


I is a great honor to assume the editorship of Journal of 


Current Status 


The oldest of the four major journals in marketing, JM 
enjoys a prestigious position as the best-cited marketing 
journal (in two of the last three Institute for Scientific Infor- 
mation citation impact studies), the most selective market- 
ing journal (acceptance rate of 11%), and the marketing 
journal that receives the most submitted manuscripts (345 
in calendar year 2004). In addition, JM has ranked first 
(Baumgartner and Pieters 2003), second (Theoharakis and 
Hirst 2002), and first (Hult, Neese, and Bashaw 1997) in the 
three most recent surveys of the leading marketing journals. 
It is clear that JM enjoys an enviable position among mar- 
keting's most elite academic journals. My charge as editor 
is to do my utmost to maintain and, hopefully, to enhance 
the journal's premier status. 


The Positioning of JM 


Traditionally, JM has been positioned as the broadest, the 
most readable, and the closest to management practice of 
the four leading marketing journals. Its two greatest 
strengths are typified by its two most long-standing article 
awards: the Harold H. Maynard Award for marketing theory 
and the Marketing Science Institute/H. Paul Root Award 
(formerly the Alpha Kappa Psi Award) for advancing the 
practice of marketing. Substantively, JM spans the entire 
field of marketing, so its domain overlaps considerably with 
that of the other major marketing journals. 

It is important that JM maintains its historical strengths. 
These include publishing the best articles that advance mar- 
keting theory and the best articles that are relevant to mar- 
keting practice. During my editorship, JM will reinforce its 
historical strengths while maintaining as broad a position- 
ing as possible. Essentially, JM attempts to publish the best 
substantive articles in marketing. As such, the best articles 
that are currently published in Journal of Marketing 
Research and Marketing Science that are not primarily 
focused on methodology would be welcome in JM. Like- 
wise, the best Journal of Consumer Research articles that 
are relevant to marketing would also be welcome in JM. 
Ultimately, JM will continue to be substantive and problem 
driven. It will strive to publish the best work in marketing of 
any form or style as long as it has relevance to marketing 
theory or marketing practice. 
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This positioning implies that JM will not compromise 
with respect to sophistication. Authors will be encouraged 
to use the simplest methods necessary to address their sub- 
ject, but if more sophisticated methods are required, such an 
approach should not preclude publication. The use of 
sophisticated methods for their own sake will not be 
encouraged, but theoretical or methodological sophistica- 
tion will be encouraged when they help address a substan- 
tive topic. 


Breadth of Coverage 


I believe that all articles in JM should have something sub- 
stantive to say to one of the primary stakeholders of market- 
ing (marketing managers, consumers, or public policy mak- 
ers). Aside from this requirement, JM will be open to any 
topic and any research approach. I especially want to 
emphasize that JM will be receptive to all methodological 
approaches, including (but not limited to) survey research, 
experimental designs, analytical models, ethnographic 
research, structural equation models, econometric analysis, 
and conceptual analysis. All research approaches have 
strengths and weaknesses, and each submitted paper will be 
evaluated on its unique merits. ` 


Publishing Truly Important Articles 


I have found in my own research that, in general, my most 
innovative articles have had the toughest time in the review 
process. Those same articles, once published, have had the 
greatest impact. As editor, I will go out of my way to be 
sympathetic to truly innovative papers. I will give such 
papers extra consideration. On the margin, I would rather 
accept an innovative paper that is controversial than an 
incremental paper that is perfectly done but unsurprising. I 
will give the importance of the problem addressed consider- 
able weight in the publication decision. To achieve this, on 
occasion, I will be willing to overrule a consensus negative 
evaluation by the reviewers to give an innovative paper an 
extra chance. I strongly encourage authors to submit work 
that has an opportunity to shake things up and change the 
thinking in the field. 


An international Journal 


The economy is global, and marketing talent is spread 
around the world. During the seven years that I served as 
editor of Journal of Service Research, more than half of the 
papers that I accepted for publication had at least one non- 
U.S. author. Likewise, non-U.S. authors will have a fair 
opportunity to publish in JM. That is not to say that publi- 
cation will be easy—recall that the journal’s current accep- 
tance rate is 11%—but I will guarantee that there will be no 
bias against non-U.S. authors or non-U.S. data. I have also 
increased non-U.S. participation on the Editorial Review 
Board. 
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Connecting to the Business 
Community 


Traditionally, JM has been the leading outlet for managerial 
research among the leading marketing journals, and I wish 
to maintain JM’s strength in this area. I also want to encour- 
age the submission of articles by practitioners. Because the 
standard of research sophistication at JM is high, coauthor- 
ship with academics is often the easiest way for a practi- 
tioner to create a publishable article, though some practi- 
tioner authors with a prior record of publication in 
academic journals may well be able to produce a publish- 
able article on their own. 


The Editorial Review Board 


I have appointed an Editorial Review Board of 90 experts, a 
slightly smaller board than in recent years. Most of the 
members of the Editorial Review Board are holdovers from 
Ruth Bolton’s board. To attempt to strengthen and broaden 
the board, I have altered its composition in a few important 
ways. First, I have added new board members from a cross- 
section of the marketing field and have particularly empha- 
sized adding people with a broad point of view and an 
active, recent research record. Second, I have increased the 
percentage of non-U.S. board members to the highest per- 
centage at any of the four leading marketing journals. Third, 
to increase the connection to the business world, I have 
appointed five new members from the business community, 
all of whom have significant publication experience. 


The Review Process 


I will assign two to four reviewers to each manuscript and 
attempt to have the reviewers represent different points of 
view. At Journal of Service Research, I maintained an aver- 
age turnaround time for new manuscripts of approximately 
40 days, which I believe was the fastest journal turnaround 
of any marketing-related journal. I will work hard to main- 
tain a culture of prompt reviews at JM as well. If one of the 
assigned reviewers is slow, I may sometimes proceed to a 
decision based on two reviewers rather than wait for all 
reviews to arrive. I will replace habitually tardy review 
board members. With most papers, I intend to make a deci- 
sion in two rounds of review, but the most innovative papers 
may require more rounds because, by definition, such 
papers break new ground and thus tend to be more difficult. 


Conflicts of Interest 


To address potential conflicts of interest, I am formalizing 
some policies. In most cases, these policies have already 
been informally maintained at JM by previous editors. First, 
a guest editor (chosen from the former editors of JM) will 
make decisions about any manuscripts submitted by the edi- 
tor’s current faculty colleagues, current or former doctoral 
students, and current business associates. Second, for all 
award voting conducted by the Editorial Review Board, vot- 
ers will be prohibited from voting for their own articles or 
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for article by colleagues from the same university or com- 
pany, and the editor's articles will not be eligible for any 
award during the editor's term if it is selected by the Edito- 
rial Review Board that the editor has appointed. 


Administration of the Journal 

The American Marketing Association (AMA) is instituting 
a new process for administering the journal. Under this 
process, the editorial assistant for the journal, Marti Tiede- 
man, is located at the AMA offices in Chicago rather than at 
the editor's university. In addition, I have hired a local edi- 
torial assistant, Millicent Locke, for the College Park office. 
Because all communication is now electronic and con- 
ducted through the Internet, this new procedure is antici- 
pated to have minimal negative impact. In fact, the editor 
and editorial assistant have been in separate locations ever 
since the previous editor, Ruth Bolton, moved to Arizona 
state University. 


Strengthening the JM Web Site 


The AMA is adding value to its Web site overall, and the 
same will be done with the JM Web site, which is part of the 
AMA site. Because of the added emphasis on the Web site, 
I have appointed a Web site editor, Siva Balasubramanian, 
to plan, manage, and edit the site. Value will be added to the 
site by migrating several features that previously appeared 
in the hard copy of the journal. For example, the “Book 
Review" section, which will be edited by the new book 
review editor, Ramarao Desiraju, will now appear on the 
Web site along with each article's summary and author 
biography and the Marketing Literature Review. Additional 
features will be added over time. 


Summary 
Journal of Marketing has a proud history and reputation, 
and I wish to maintain the journal's strengths and work to 
make an already fine journal even better. During my editor- 
ship, JM will maintain its broad positioning and will wel- 
come research of all styles and topics, as long as they are 
pertinent to marketing theory or marketing practice. My 
goal is to make JM every author's first-choice outlet for 


substantive research in marketing. 
ROLAND T. RUST 
Editor 





REFERENCES 

Baumgartner, Hans and Rik Pieters (2003), “The Structural Influ- 
ence of Marketing Journals: A Citation Analysis of the Disci- 
pline and Its Subareas Over Time,” Journal of Marketing, 67 
(April), 123-39. 

Hult, G. Tomas M., William T. Neese, and R. Edward Bashaw 
(1997), “Faculty Perceptions of Marketing Journals,” Journal 
of Marketing Education, 19 (1), 37-52. 

Theoharakis, Vasilis and Andrew Hirst (2002), “Perceptual Differ- 
ences of Marketing Journals: A Worldwide Perspective,” Mar- 
keting Letters, 13 (4), 389-402. 


Claes Fornell, Sunil Mithas, Forrest V. Morgeson Ill, & M.S. Krishnan 


Customer Satisfaction and Stock 
Prices: High Returns, Low Risk 


Do investments in customer satisfaction lead to excess returns? If so, are these returns associated with higher 
stock market risk? The empirical evidence presented in this article suggests that the answer to the first question is 
yes, but equally remarkable, the answer to the second question is no, suggesting that satisfied customers are eco- 
nomic assets with high returns/low risk. Although these results demonstrate stock market imperfections with 
respect to the time it takes for share prices to adjust, they are consistent with previous studies in marketing in that 
a firm's satisfied customers are likely to improve both the level and the stability of net cash flows. The implication, 
implausible as it may seem in other contexts, is high return/low risk. Specifically, the authors find that customer sat- 
isfaction, as measured by the American Customer Satisfaction Index (ACSI), is significantly related to market value 
of equity. Yet news about ACSI results does not move share prices. This apparent inconsistency is the catalyst for 
examining whether excess stock returns might be generated as a result. The authors present two stock portfolios: 
The first is a paper portfolio that is back tested, and the second is an actual case. At low systematic risk, both out- 
perform the market by considerable margins. In other words, it is possible to beat the market consistently by invest- 


ing in firms that do well on the ACSI. 


oes the satisfaction of a company's customers have 

anything to do with the company's stock price? In a 

way, it must. Otherwise, it would be necessary to go 
back to the drawing board of how capitalistic market theory 
is supposed to work. The empirical literature on the nature 
of the relationship is growing, but it is still in its infancy in 
many respects. Empirical results (Anderson 1996; Ander- 
son, Fornell, and Lehmann 1994; Anderson, Fornell, and 
Mazvancheryl 2004; Bolton 1998; Fornell 2001; Ittner and 
Larcker 1996, 1998; Rust, Moorman, and Dickson 2002) 
point toward a significant relationship between customer 
satisfaction and economic performance in general, but less 
is known about how the satisfaction of companies' cus- 
tomers translates into securities pricing and investment 
returns, and virtually nothing is known about the associated 
risks. The tacit link between buyer utility and the allocation 
of investment capital is a fundamental principle on which 
the economic system of free market capitalism rests. The 
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degree to which capital flows from investors actually move 
in tandem with consumer utility is a matter of significant 
importance because it is an indication of how well (or 
poorly) markets truly work. A parallel, but narrower, issue 
is the extent to which equity markets themselves are effi- 
cient. The assertion in efficient markets theory is that finan- 
cial prices accurately reflect all public information at all 
times, but financial assets can be correctly priced in equity 
markets even if consumer sovereignty is restricted in prod- 
uct markets. In a monopoly, for example, dissatisfied buyers 
cannot punish the seller by taking their business elsewhere, 
and capital flows are not likely to correspond with con- 
sumer utility, but that does not in itself imply that capital 
markets are inefficient. However, efficient allocation of 
resources 1n the overall economy and consumer sovereignty 
depend on the joint ability of product and capital markets to 
reward and punish companies such that firms that fail to sat- 
isfy customers are doubly punished by both customer 
defection and capital withdrawal. Similarly, firms that do 
well by their customers would be doubly rewarded by more 
business from customers and more capital from investors. 
We begin by estimating the relationship between cus- 
tomer satisfaction and market value of equity. As expected, 
we find a strong relationship. This prompts the question, 
How do investors react to news about changes in customer 
satisfaction? Despite the significant relationship between 
customer satisfaction and market value, we find no evidence 
that they react in a timely manner. Is this because the news 
is somehow factored into share prices already, or is it due to 
market imperfections? The evidence suggests the latter. 
Two stock portfolios, one paper strategy that is back tested 
and another describing an actual case, lead to similar con- 
clusions: It is possible to beat the market consistently with 
investment decisions based on customer satisfaction. Per- 
haps even more remarkable is that these results are not 
associated with a risk premium. Indeed, the opposite is true; 
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the systematic risk is low relative to the market. Although 
these results may appear too good to be true and puzzling 


from a conventional financial perspective, we present argu-- 


ments suggesting that they are in line with empirical find- 
ings both in marketing and in the literature on customer 
satisfaction. 


Customer Satisfaction: Economic 
Effects 


We now turn to the fundamentals behind tbe relationship 
between customer satisfaction and stock prices. Both basic 
neoclassical economics and marketing theory provide a 
general case for a positive association. Conversely, several 
market factors may contribute to a negative relationship. In 
addition to market factors, there are other circumstances in 
which the association could actually be negative. We dis- 
cuss both categories and empirically analyze stock market 
reaction to quarterly news releases from the American Cus- 
tomer Satisfaction Index (ACSI) (Fornell et al. 1996). By 
controlling for other firm-specific news, we test the effect 
of news announcements about customer satisfaction on 
stock prices. 

Real economic growth depends on the productivity of 
economic resources and the quality of the output (as experi- 
enced by the user) that those resources generate. In the final 
analysis, expanding economic activity per se is not what is 
essential. Both marketing and neoclassical economics view 
consumer utility, or satisfaction, as the real standard for 
economic growth. The extent to which buyers financially 
reward sellers that satisfy them and punish those that do not 
and the degree to which investment capital reinforces the 
power of the consumer are fundamental to how markets 
function. A well-functioning market allocates resources, 
including capital, to create the greatest possible consumer 
satisfaction as efficiently as possible. A dissatisfied buyer 
will not remain a customer unless there is nowhere else to 
go or it is too expensive to get there. 

Restrictions on consumer choice may be good for the 
monopolist but, in general, are considered harmful to the 
economy. In a competitive marketplace that offers meaning- 
ful consumer choice alternatives, firms that do well by their 
customers are rewarded by repeat business, lower price 
elasticity, higher reservation prices, more cross-selling 
opportunities, greater marketing efficiency, and a host of 
other things that usually lead to earnings growth (Fornell et 
al. 1996). 

To what degree do markets actually reflect the norma- 
tive tenets of market theory? Under what circumstances 
would news about increasing customer satisfaction con- 
tribute to higher stock prices? Under what circumstances 
would it have the opposite effect? From a cursory glance at 
the literature, it is tempting to assume that news about rising 
customer satisfaction would have an immediate and positive 
effect on stock prices. Beginning with the early contribu- 
tions of Bursk (1966) and Jackson (1985), there is substan- 
tial conceptual logic and empirical evidence to suggest that 
the health of a firm’s customer relationships is a relevant 
indicator of firm performance (Ambler et al. 2002; Bell et 
al. 2002; Berger et al. 2002; Blattberg and Deighton 1996; 
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Bolton, Lemon, and Verhoef 2004; Fornell 1995, 2001; For- 
nell et al. 1996; Hogan, Lemon, and Rust 2002; Rust et al. 
2004). For example, it has been found that customer satis- 
faction has a negative impact on customer complaints and a 
positive impact on customer loyalty and usage behavior 
(Bolton 1998; Fornell 1992). Increased customer loyalty 
may increase usage levels (Bolton, Kannan, and Bramlett 
2000), secure future revenues (Rust and Keiningham 1994; 
Rust, Moorman, and Dickson 2002), reduce the cost of 
future transactions (Reichheld and Sasser 1990), lower 
price elasticity (Anderson 1996), and minimize the likeli- 
hood of customer defection (Anderson and Sullivan 1993; 
Mithas, Jones, and Mitchell 2004). Customer satisfaction 
may also reduce costs related to warranties, complaints, 
defective goods, and field service costs (Anderson, Fornell, 
and Lehmann 1994; Fornell 1992; Garvin 1988). Empirical 
evidence also suggests that customer perceptions of supe- 
rior quality are associated with higher economic returns 
(Aaker and Jacobson 1994; Capon, Farley, and Hoenig 
1990; Fornell 2001). Naumann and Hoisington (2001) 
report positive associations among employee satisfaction, 
customer satisfaction, market share, and productivity mea- 
sures at IBM Rochester. Several case-based research studies 
also find that customer satisfaction is positively associated 
with employee loyalty, cost competitiveness, profitable per- 
formance, and long-term growth (Heskett, Sasser, and 
Schlesinger 1997; Reichheld and Teal 1996). Finally, in a 
recent study of the relationship between customer satisfac- 
tion and shareholder return, Anderson, Fornell, and Maz- 
vancheryl (2004) find a strong relationship between cus- 
tomer satisfaction and Tobin’s q (as a measure of 
shareholder value) after controlling for fixed, random, and 
unobservable factors. 

If, as the empirical evidence suggests, customer satis- 
faction tends to improve repeat business, usage levels, 
future revenues, positive word of mouth, reservation prices, 
market share, productivity, cross-buying, cost competitive- 
ness, and long-term growth and if it tends to reduce cus- 
tomer complaints, transaction costs, price elasticity, war- 
ranty costs, field service costs, defective goods, customer 
defection, and employee turnover, it seems logical to expect 
that these effects will eventually affect stock prices and 
company valuations. Even if a substantial portion of the 
empirical findings were dismissed,! it would be difficult not 
to take seriously the notion of customer satisfaction as a 
real, albeit intangible, economic asset. Putting the argument 
in the context of economic assets allows for a logical link to 
conventional capital asset pricing (see Fornell and Werner- 
felt’s [1987, 1988] economic analysis of the value of keep- 
ing customers and the more recent work by Wayland and 


lFor example, in a study of a catalog retailer, Reinartz and 
Kumar (2000, 2002) find a weaker-than-expected (but significant) 
relationship between customer retention and profitability. The 
strength of any customer retention-profitability relationship 
depends on the cost of creating and maintaining repeat customers. 
If repeat business is created through price discounts or other 
means that do not cause an upward shift in the firm's demand 
curve, the relationship will be weaker. Thus, repeat business pro- 
duced by higher customer satisfaction will be more profitable in 
general than repeat business generated by price discounts. 


Cole [1997], Srivastava, Shervani, and Fahey [1998], Gruca 
and Rego [2005], Gupta, Lehmann, and Stuart [2004], 
Anderson, Fornell, and Mazvancheryl [2004], Hogan, 
Lemon, and Rust [2002], Venkatesan and Kumar [2004], 
and Rust, Lemon, and Narayandas [2005]). Srivastava, 
Shervani, and Fahey (1998) provide the conceptual logic for 
more categorically drawing the link from the empirical 
findings on customer satisfaction to stock returns and share- 
holder value. They identify four major determinants of a 
company’s market value: (1) acceleration of cash flows, (2) 
increase in cash flows, (3) reduction of risk associated with 
cash flows, and (4) increase in the residual value of the 
business. Not only does the customer asset, as defined by 
the expected future discounted net cash flow from current 
and future customers, influence all four determinants, but it 
can also be expressed as the sum of all other economic 
assets of the firm (Wayland and Cole 1997); Gupta, 
Lehmann, and Stuart (2004) also demonstrate this. 

First, the acceleration of cash flows is affected by the 
speed of buyer response to marketing efforts. Because it 
takes less effort to persuade a satisfied customer (Keller 
1993) and because it is more likely that receivables turnover 
is better for firms with satisfied customers, speed of cash 
flow is positively affected. Second, there is also a relation- 
ship between customer satisfaction and levels of net cash 
flows. Marginal costs of sales and marketing are lower, as 
are the requirements for working capital and fixed invest- 
ments (Srivastava, Shervani, and Fahey 1998). In addition, 
revenue growth benefits from more repeat business. Gruca 
and Rego (2005) provide empirical evidence for this. They 
show that increases in customer satisfaction lead to signifi- 
cant cash flow growth. Specifically, one point in customer 
satisfaction (measured by ACSI on a 0—100 scale) was asso- 
ciated with a 7% increase in cash flow. Third, Gruca and 
Rego also find that the risk associated with future cash 
flows is reduced for firms with high customer satisfaction. 
If the variability in cash flows is reduced, the cost of capital 
goes down as well, thus producing yet another source for 
stock price growth. Finally, the residual value of business is 
a function of the size, loyalty, and quality of the customer 
base, all of which are obviously related to the satisfaction of 
this customer base (Srivastava, Shervani, and Fahey 1998). 


Customer Satisfaction and Market 
Value of Equity 
In light of the evidence reported in the literature, it might be 
expected that a significant relationship exists between mar- 
ket value of equity and ACSI. If customer satisfaction truly 
is an economic asset that is not only left off the balance 
sheet but also not fully reflected by the recorded assets, 
there should be a relationship between ACSI scores and 
market capitalization. Following standard practice (Barth 
and McNichols 1994; Ittner and Larcker 1998; Landsman 
1986), the model we define in Equation 1 estimates the 
effect of an identifiable, but intangible, asset on market cap- 
italization in a combined longitudinal cross-sectional set- 
ting, while controlling for accounting book values. In this 
case, we are interested in (1) whether customer satisfaction 
(ACSI) has a significant effect on market value of equity 


and (2) the size of this effect when we control for the influ- 
ence of the recorded assets and liabilities. Thus: 


(1)  InMVE= qQ + B, InBVA + B; InBVL + Ba InACSI, 


where MVE is the market value of equity, BVA is the book 
value of total assets, BVL is the book value of liabilities, 
and ACSI is the respective company score on the American 
Customer Satisfaction Index. 

We obtained the data from 1994—2002 ACSI scores and 
COMPUSTAT company book values for the same years; 
there were a total of 601 observations. The mean value for 
market value is $26 billion. For book value of assets and 
liabilities, the respective means are $23 billion and $15 
billion. 

Table 1 reports the results of fitting Equation 1 by ordi- 
nary least squares. The regression is highly significant with 
an R-square of .70. The ACSI coefficient is significant and 
large. A 1% change in ACSI is associated with a 4.6% 
change in market value. The coefficients for assets and lia- 
bilities are also significant with the expected signs. Table 1 
also includes estimates without the ACSI variable: R-square 
drops to .61, and the negative coefficient for liabilities 
increases. Thus, by omutting customer satisfaction in the 
estimation of company market value, the negative impact of 
liabilities appears to be greater than it should be. This is 
also consistent with the previous discussion that customer 
satisfaction alleviates cash flow volatility and reduces the 
cost of capital. 


Investor Reaction 


We offer the preceding results as empirical evidence in 
favor of the argument that customer satisfaction should be 
considered an economic asset beyond what is recognized on 
the balance sheet. The results are also consistent with the 
finding that customer satisfaction is associated with long- 
term shareholder value. However, they do not address 


TABLE 1 
The Effect of ACSI on Market Value of Equity 
Ln(Market — Ln(Market 
Value) Value) 
ACSI (InACSI) hr 4.592 
(.000) 
Total assets (InBVA) Bo 1.943 2.039 
( 000) (.000) 
Total liabilities (InBVL) ^ s —1.020 —1.157 
(.003) (.003) 
Constant a —20 056 .235 
(.000) (.704) 
N 601 601 
Re 70 .61 
F (.000) (.000) 





Notes: We estimated the models by including dummy variables (not 
shown) for years 1995 through 2002 (1994 was the refer- 
ence year); p values are in parentheses. 
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whether ACSI scores provide information to investors (that 
is not already known to them). How then do investors react 
to news about customer satisfaction? Do they take advan- 
tage of customer satisfaction as a leading indicator of finan- 
cial performance, or do they fail to realize the potential of 
the information? Is the information already factored in 
share prices? If, indeed, customer satisfaction is relevant to 
market valuation, according to efficient markets theory, new 
information about customer satisfaction fluctuations should 
be instantly reflected in stock prices. That is, all else being 
equal, investors should reward firms with increased market 
capitalization as news about the improved value of firms’ 
customer relationships becomes available. Conversely, news 
about deteriorating customer relationships should have the 
opposite effect. 

Such reasoning is in concert with financial valuation 
theory in that customer relationships can be expressed in 
terms of net present value (Blattberg and Deighton 1996; 
Hogan et al. 2002; Rust and Keiningham 1994; Venkatesan 
and Kumar 2004). In competitive markets, the value of sat- 
isfied customers should be worth more than the value of 
dissatisfied customers. 

However, the picture is more complicated. There are 
reasons for suggesting that investors may not react in a pre- 
dictable fashion, and there are circumstances in which stock 
prices drop as a result of news about increasing customer 
satisfaction. Although such a reaction might be at odds with 
how capitalistic markets should function, consumer market 
imperfections (which may be unknown) make it difficult to 
anticipate capital market reactions to customer satisfaction 
news. First, investors may react negatively to news about 
rising customer satisfaction if they believe that the firm is 
giving away too much surplus to the buyers. The aggregate 
surplus that consumers receive from goods and services is a 
measure of consumer welfare, but the difference between 
the maximum price a buyer is willing to pay and the actual 
price is also something that investors may want to capture. 
Investors would be more likely to consider an increase in 
consumer surplus in a negative light if the buyer's switching 
costs were significant, if there was a high degree of product 
differentiation, or if there was some level of monopoly 
power. 

Second, investors may also take an unkind view of 
improvements in customer satisfaction for firms that 
already have substantial leads over their competition. If cus- 
tomers of competing firms were much less satisfied, the 
marginal return of improving customer satisfaction further 
might be called into question. Certainly, it would be diffi- 
cult to ignore economic laws of diminishing returns for the 
investment in any economic asset, regardless of its status on 
the balance sheet. 

Third, and related to the preceding two points, the mar- 
ginal cost of improving customer satisfaction may be too 
high. This is particularly relevant for services, especially 
those that are labor intensive. For example, it has been 
shown (Anderson, Fornell, and Rust 1997) that productivity 
and customer satisfaction are not always compatible in the 
service sector. Improved customer satisfaction may come at 
the expense of productivity, and vice versa. Such a trade-off 
between productivity and customer satisfaction may also 
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contribute to unfavorable market reactions to news about 
customer satisfaction improvements. 

Fourth, there is the possibility of a "reverse causality 
effect.” Customer defection can have a positive effect on 
average customer satisfaction simply because the departing 
customers were the most dissatisfied. Customers who 
remain are less likely to be as discontented. Instead of ben- 
efiting from improved satisfaction, the company simply 
retains a smaller group of (somewhat more satisfied) cus- 
tomers, but often with reduced sales and profits as a result. 
Note that this effect is not the result of consumer market 
imperfections. Indeed, it is more likely to occur in fluid 
markets with low switching barriers and no monopoly 
power. 

Fifth, issues of timing and expectations complicate the 
matter further. The ACSI measures each company once a 
year. Although the measurement is cumulative in the sense 
that it aims to capture all relevant customer experience 
(with the product or service) to date, it is possible for 
changes in customer satisfaction to occur and to have an 
effect on buyer behavior (with implications on revenues and 
profits) in periods between measurements. It is also difficult 
to ascertain market expectations about customer satisfac- 
tion. A firm’s improved ACSI scores may be viewed as neg- 
ative if the firm was expected to show a greater degree of 
improvement, and vice versa. Sources of such expectations 
may be difficult to identify; they may include corporate 
announcements about additional investments in consumer 
service, the hiring of more frontline personnel, investments 
in information technology and customer relationship man- 
agement systems (Krishnan et al. 1999; Mithas, Krishnan, 
and Fornell 2005a, b; Prahalad, Krishnan, and Mithas 2002; 
Rai, Patnayakuni, and Patnayakuni 2006; Sambamurthy, 
Bharadwaj, and Grover 2003; Srinivasan and Moorman 
2005), and a host of things that may affect customer 
satisfaction. 

For all these reasons, it is not easy to predict stock mar- 
ket reactions to news about customer satisfaction for indi- 
vidual companies. It would not be enough to appeal to cap- 
italistic market theory without consideration of an 
assortment of market conditions. Even if the relevant char- 
acteristics were observed and properly assessed, the effect 
may be confounded by the shape of the satisfaction 
response curve, by the firm’s location on the curve, or by 
the reverse causation phenomenon. Thus, even if investors 
do not react to news about customer satisfaction, share 
prices may still be unbiased. Only if it is possible to use 
ACSI information to discover mispriced stocks consistently 
over time and invest accordingly (and thus beat the market) 
can the question about possible pricing bias be addressed. 
Accordingly, we conduct both an event study and portfolio 
studies. We begin with the event study. 

We estimate the rate of return on stock price of firm j on 
day t with the following market model: 


(2) Ry = 05 + p, Rmt + 8j 

where Ry is the rate of return on the common stock of the 
jth firm on day t, Ry, is the market rate of return using the 
Standard & Poor’s (S&P) 500 composite index on day t, Q6 
is an intercept, D, is a slope parameter that measures the 


sensitivity of Rit to the market index, and £r is a disturbance 
term with the usual ordinary least squares properties. The 
S&P 500 is a capitalization weighted index based on a 
broad cross-section of the market and has been used in sev- 
eral previous event studies (MacKinlay 1997). 

We use the market model (Equation 2) to estimate the 
abnormal return for the common stock of firm j on day t, 
such that 


(3) AR, = Ry = (œ + Bj Rmo. 


On the basis of recommended guidelines and commonly 
used practice, we specified the length of the estimation 
period as 255 days, ending 46 days before the event date 
(Cowan 2003; McWilliams and Siegel 1997). 

We averaged daily abnormal returns using the closing 
price at the day of the release over the sample of N firms to 
yield cumulative abnormal returns: 


N R 
2,2, ^R, 
(4) CAR, 7, ==, 


where T; and T, correspond to the first and last days of the 
event period. 

Because the sampling distribution of abnormal returns 
tends to be skewed and leptokurtic (Brown and Warner 
1985; Corrado 1989; Lyon, Barber, and Tsai 1999; 
MeWilliams and Siegel 1997), we used bootstrapping for 
all significance tests. Use of bootstrapped test statistics pro- 
vide a more robust assessment of the statistical significance 
of the results of the event study and is becoming increas- 
ingly common in academic studies (Chatterjee, Richardson, 
and Zmud 2001). 

It is well known that event studies benefit considerably 
from a precise definition of the event period (Barclay and 
Litzenberger 1988; Brown and Warner 1985; Dyckman, 
Philbrick, and Jens 1984). As for the ACSI, there is an exact 
announcement day. Thus, we specify a one-day event 
period. A short event period leads to more efficient estima- 
tion because it reduces the possibility of other factors 
affecting the returns. It also increases the power of the sta- 
tistical tests. 

As we indicated previously, we defined the event as the 
ACSI announcement for a firm as (simultaneously) pub- 
lished in The Wall Street Journal and on the ACSI Web site. 
Because the The Wall Street Journal is a daily publication, it 
seems reasonable to consider the day the ACSI announce- 
ment is published in The Wall Street Journal as the day 
when “the event" occurs. The included events range from 
the second quarter of 1999 to the third quarter of 2002.2 

Following recommended practice (MacKinlay 1997), 
we set up controls so that the customer satisfaction news 


2Although ACSI has measured customer satisfaction since 
1994, before the second quarter of 1999, the results were pub- 
lished once a year in Fortune magazine, making it difficult to pin- 
point the event date because readers received the magazine on dif- 
ferent dates. For example, Ittner and Larcker’s (1998) study used a 
much longer event window (five and ten days) in their analysis of 
the effect of the Fortune publication 


would not be confounded with other news items over the 
same period. We collected all firm-related news items five 
calendar days before and after the event date from leading 
sources: PR Newswire, Dow Jones, and Business Wire. We 
excluded from the analysis events with announcements 
regarding mergers and acquisitions, spin-offs, stock splits, 
chief executive officer and chief financial officer changes, 
layoffs, restructurings, earnings announcements, and law- 
suits. In total, after we eliminated the events with con- 
founded news items during the event windows, the data 
consisted of 161 events for 89 companies for which stock- 
trading data were available for the study period. For manu- 
facturing firms, there were 48 events of rising ACSI scores; 
for service firms, the corresponding number was 32. There 
were 4] events with decline in ACSI for manufacturing and 
40 for services. The results appear in Table 2. 

Although the combined samples for manufacturing and 
services changed in the direction of higher stock prices for 
firms with an increase in ACSI and lower prices for firms 
with a decrease in ACSI, these effects are too weak to be 
distinguished from chance variation. The results for service 
firms are also contradictory; that is, share prices increased 
regardless of the direction of the change in ACSI. 

These results are fairly consistent with the findings of 
Ittner and Larcker (1998). They used a longer event window 
of five days and found no significant effects of ACSI news. 
For a ten-day window, they reported close to significant (at 
the 10% level) abnormal returns but only after they deleted 
outliers (still 97%—98% of the variance in the abnormal 
returns remained unexplained). Note that Ittner and Larcker 
were forced to use cross-sectional analysis (because of the 
limited amount of data available at the time), and thus their 
results cannot be used to test the effect of ACSI on changes 
in stock prices (Lambert 1998).3 


3Although it seems unlikely that the ACSI information would 
trickle out slowly or that investors would simply wait before acting 
on the information, we also calculated cumulative abnormal 
returns for a 5-day window following the release date. Again, the 
null hypothesis of zero abnormal returns cannot be rejected. A 
final test checks for the possibility of prior leakage of information. 
The ACSI results were routinely provided under embargo to the 
public relations and market research units of corporate subscribers 


TABLE 2 
Abnormal Returns For ACSI News 
Announcements 
Event Day 
Type of Abnormal 
News Firm Type N Returns 
Increase Manufactunng 80 34 
in ACSI and services 
Manufactunng 48 .01 
Services 32 .85 
Decrease Manufactunng 81 —.17 
in ACSI and services 
Manufacturing 41 —.54 
Services 40 .20 


Notes: None of the abnormal returns are significant at the 5% level 
based on nonparametric bootstrapped tests (two talled) 
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... Portfolio Studies 


The finding that news about customer satisfaction does not 
move stock prices, even though customer satisfaction is sta- 
tistically significant with respect to market valuation, is not 
necessarily contradictory to efficient markets theory. It 
would still be possible that stock prices fully and quickly 
reflect all available information. It is also possible that firms 
with highly satisfied customers generate higher profits. 
Under efficient markets theory, however, it would not be 
possible to develop trading strategies based on ACSI infor- 
mation that consistently earn excess returns. 

As a way to test this, we consider two stock portfolios 
based on ACSI data. The first is a hypothetical paper portfo- 
lio with simple trading rules. The second is a real-world 
portfolio. Both tests have pros and cons. However, their 
combination leads to more powerful analyses. The major 
advantage of the paper portfolio is that the trading rules are 
clear and explicit. In addition to there being no theoretical 
basis for simulating various permutations of the trading 
strategy, its major disadvantage is that it is hypothetical and 
back tested with perfect hindsight. 

For the real-world portfolio, these advantages and dis- 
advantages are reversed. Actual trading involves many 
details regarding timing, size of investments, number of 
holdings, rebalancing, and so forth, that are not always pre- 
cisely determined by static and explicit decision rules. In 
many ways, the situation is similar to case studies of orga- 
nizations for which theoretical specificity must give way to 
realism. Yet this is also the benefit of a real-world portfolio: 
It presents authentic investments and actual results. 

Most important, however, paper portfolio and the actual 
portfolio share a critical aspect: Both rely solely on ACSI 
data. No other type of data or additional information was 
considered in any of the investment decisions. Although 
details may vary between the portfolios, and assuming an 
absence of contemporaneous underlying share price deter- 
minants that covary with ACSI (we discuss this further sub- 
sequently), it would be difficult to attribute the results to 
sources other than customer satisfaction. 

Because the market does not immediately respond to 
changes in ACSI, it seems reasonable to base the strategy 
on both levels (because they do not appear to be fully 
impounded in the market price) and changes (because they 
do not appear at all to be impounded in the market price). 
To have a diversified portfolio of reasonable size, we 
selected firms in the top 20% of ACSI (relative to their 
competition). In line with Jones and Sasser (1995) and 
Ittner and Larcker (1998), who find that low levels of cus- 
tomer satisfaction appear to have less of an impact, we also 
made the selection conditional on the requirement of being 
above the ACSI national average. 

Because ACSI scores for each individual firm are 
updated once a year, we purchased shares (using the closing 
price) the day the ACSI results were announced and held 
them for a year or longer. If we picked a stock the first year, 


and to The Wall Street Journal about two weeks before the release. 
That corresponds to 10 additional trading days before the event. 
Again, the results are not significant. The results of 5-day and 15- 
day analyses appear in the Appendix. 


8 / Journal of Marketing, January 2006 


we examined its inclusion in the portfolio again the follow- 
ing year. If it met the criteria of being in the top 2096 and 
above the national (average) ACSI level, we held it for 
another year and then subjected it to the same test. If it 
failed the criteria, we sold it. We applied the same principle 
for all stocks. If we did not pick a stock the first year, we 
reexamined it for inclusion the next year, and so on. This 
trading strategy led to a portfolio of 20 companies in 1997 
and 20 in 1998, and there were 26 companies at the conclu- 
sion of the test on May 21, 2003. 

Between February 18, 1997, and May 21, 2003, a 
period when the stock market had both ups and downs, the 
portfolio generated a cumulative return of 4096 (with divi- 
dends and transaction costs excluded but after stock splits, 
if any, were adjusted). It outperformed the Dow Jones 
Industrial Average (DJIA) by 93%, the S&P 500 by 201%, 
and NASDAQ by 335%. Figures 1—3 show the cumulative 
returns over time. 

The results suggest that customer satisfaction pays off 
in up-markets and down-markets. When the stock market 
grew, the stock prices of many firms with highly satisfied 
customers grew even more. The only exception occurred at 
the peak of the stock market bubble in 1999, when NAS- 
DAQ and the S&P 500 generated short-lived, but higher, 
returns. When the stock market dropped in value, the stock 
prices of firms with highly satisfied customers seemed to 
have benefited from some degree of insulation. 

Although the trading rules barred all information 
beyond ACSI, could there be competing explanations for 
the findings? We examine the possibilities. First, and 
regardless of whether efficient markets theory holds, ıt is 
difficult to beat the market. Most professional stock pickers 
do not, and the chance probability of consistent excess 
returns is minute. Second, there are theoretical underpin- 
nings behind these returns that are bolstered by empirical 
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findings in the marketing literature. Third, the abnormal 
returns are not due to compensation for risk. The beta risk 
associated with the portfolio is .78 and thus is substantially 
less than market.^ Fourth, there is no known ACSI covariate 
that accounts for the results. Firms are included in ACSI on 
the basis of size (Fornell et al. 1996), and the ACSI aggre- 
gate should approximately mirror large cap indexes (e.g., 
DJIA). There is no reason to suspect that the top 20% of 
ACSI firms would have the same performance as the bot- 
tom 80%. To verify this empirically, we collected stock 
price data for all ACSI firms. The cumulative return for the 
bottom 80% portfolio was 20.4%, which is virtually identi- 
cal to that of the DJIA (21%), thus ruling out selection cri- 
teria as a covariate of the stock performance. Because book- 
to-market ratios are sometimes mentioned as possible 
predictors of long-term abnormal stock returns (Fama and 
French 1992), we compared these ratios as well. For the top 
20%, the book-to-market ratio was .41 for the full time 
period. The corresponding ratio for the bottom 80% was 
42. Finally, we examined the “size effect.” It has been 
shown that smaller firms’ stocks tend to produce higher 
returns (Banz 1981; Bernard and Thomas 1989; Reinganum 
1981). Are the top 20% of ACSI firms smaller than the bot- 
tom 80%? Consistent with the results in Table 1, the oppo- 
site is true. The average revenue for the top 20% of firms is 
$37.8 billion, compared with $26.4 billion for the bottom 
80%. 

Overall, these results are noteworthy because they sug- 
gest a trading strategy with high returns and low risk, based 
on theory and findings not in finance or economics but 


4The beta values are relative to the S&P 500 over the entire life 
of the portfolio. On a year-by-year basis, the betas are .72 (1997), 
.75 (1998), .74 (1999), .75 (2000), .76 (2001), .81 (2002), and .93 
(2003). 


FIGURE 3 
Cumulative Returns: ACSI Top 20% Versus 
NASDAQ (1997-—2003)2 
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aFebruary 18, 1997, through May 21, 2003. 


rather in the marketing literature. Because of the magnitude 
of the abnormal returns, the findings are perhaps even more 
remarkable considering that most brokerage firms had neg- 
ative abnormal returns on their stock recommendations 
between 1997 and 2001 (McGeehan 2001). Nonetheless, a 
word of caution is warranted. The results refer to a paper 
portfolio that is back tested. As with all such testing, there 
are obvious limitations. In hindsight, it may not be that dif- 
ficult to find a successful trading strategy. In defense of the 
results, however, we note that the trading strategy used was 
extraordinarily simple and straightforward. 

However, not all individual trades were good ones. 
Gateway computers is a case in point. Stock was purchased 
on August 21, 2000, because the company had an ACSI 
score of 78, which was in the top 20% of its industry and 
above the national average. Stock was sold on August 20, 
2001, because the company dropped out of the top 20%. 
Over that period of time, the company’s stock price fell by 
84% because consumer demand for personal computers 
declined sharply. As we discussed previously, there may be 
many reasons that the trading strategy may not work for 
each individual company all the time. In addition, there is 
no reason to believe that high customer satisfaction will 
provide full stock market insulation if primary consumer 
demand falters. It may dampen the fall to some degree, but 
industry demand is affected by many other factors. How- 
ever, from a diversified portfolio, negative industry distur- 
bances should be counterbalanced by positive ones. To 
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some extent, the large losses made on Gateway in one 
period were compensated by a 47% gain in another period 
(1999-2000), as well as Dell Computer's return of 50% in 
1998-1999 and the very large rise of the Wal-Mart (96% in 
1998-1999) and Southwest Airlines (96% in 1998—1999) 
stocks. 

Our final analysis goes from hypothetical paper trading 
to an actual stock portfolio of ACSI companies. Thus, it 
avoids the usual limitations of back testing and provides 
additional evidence about the strength of the relationship 
between customer satisfaction and stock price. As before, 
we considered no other data but ACSI data in the invest- 
ment decisions. We took both long and short positions and 
based trading decisions on ACSI levels, changes, and satis- 
faction elasticity of demand.° The portfolio took long posi- 
tions in firms with high (above competition) and increasing 
(by two points or more) ACSI scores and also when the 
ACSI-customer retention (elasticity) estimate was above 
average. It took short positions when scores were low 
(lower than any competitor) and deteriorating (by two 
points or more). The short positions were generally small 
and ranged over time from 096 to 2096 of the portfolio. We 
held these positions for a one-year period or more. We bal- 
anced the portfolio weighting with approximately the same 
dollar amount for each company at the starting point but 
also with consideration given to industry representation.® 
We periodically rebalanced to dollar equalize the positions, 
and we executed trades throughout the year without atten- 
tion to the date of the ACSI release." We reinvested divi- 
dends and included transaction costs. Trading began in 
April 2000, at a time just before the burst of the stock bub- 
ble from which the market, as of the end of 2004, has not 
yet recovered. 

The portfolio generated impressive positive returns. As 
we show in Figure 4, it outperformed the S&P 500 each and 
every year during the five years since its inception, often by 
a considerable margin. The smallest gain relative to market 
was in 2000 when the S&P dropped by 1296 and the ACSI 
portfolio dropped by 896.8 In the following year, the portfo- 
lio gained 6%, whereas the S&P continued to fall by 13%. 


?For discussions on this type of elasticity, see Fornell (1992) 
and Anderson, Fornell, and Mazvancheryl (2004). 

6The ACSI includes many utilities. To compensate, each utility 
in the portfolio received only half of the dollar amount invested in 
nonutihty companies. 

TAccordingly, some trades were executed subsequent to the 
ACSI release date (approximately 7096), and some were executed 
before that date. Because there was no announcement event effect, 
the positive returns were not helped by this timing. On the con- 
trary, they were somewhat handicapped. The cumulative return for 
the portfolio for all the event days (using the closing price of the 
day of the ACSI release), through February 2003, was —1.696. The 
DJIA and S&P 500 did somewhat better at +.5% and 4.696, 
respectively All trades subsequent to February 2003 were exe- 
cuted after the ACSI release. 

8The 2000 results do not represent the full year. They reflect the 
starting date of the portfolio (April 11, 2000). 
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With the exception of 2002, when the S&P fell by 23% and 
the ACSI portfolio fell by 5%, there have been large gains: 
36% in 2003 (versus 26% for the S&P) and 32% in 2004 
(versus 9% for the S&P). On a cumulative basis (Figure 5), 


FIGURE 4 
Yearly Performance of ACSI Portfolio and S&P 500 
(2000-2004) 
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FIGURE 5 
ACSI Portfollo Versus DJIA and S&P 500 
(2000—2004)4 
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the ACSI portfolio gained 75%, compared with a loss of 
19% for the S&P 500. The DJIA did slightly better at 4%.? 

As with the previous portfolio study, these returns are 
not associated with higher risk. The betas for the real-world 
portfolio are equivalent to the paper portfolio. The mean 
beta risk for all years was .77 relative to the S&P and .76 
relative to the DJIA. On an annual basis, the beta risk of the 
portfolio was consistently below market.!° 

Similar to the hypothetical paper version, share prices 
for firms that did well on ACSI rose faster than the market 
and dropped at a slower rate. However, short selling pro- 
duced weaker returns (less than 396 of the total), implying 
asymmetric impacts from high versus low customer satis- 
faction. However, it would be premature to conclude on the 
basis of this finding alone that the market rewards high cus- 
tomer satisfaction more than it punishes low satisfaction. 
Additional studies would be necessary to draw such a 
conclusion. 


Discussion 


By any standard, the results of the studies we report herein 
are extraordinary. Not only do they show that investments 
based on customer satisfaction produce sizable excess 
returns, but they also upset the basic financial principle that 
assets producing high returns carry high risk. The price of 
any financial asset is determined by the current value of the 
future cash payments it generates, discounted to compen- 
sate for risk and cost of capital. The economic value of sat- 
isfied customers seems to be systematically undervalued, 
even though these customers generate substantia] net cash 
flows with low volatility. Firms that do better than their 
competition in terms of satisfying customers (as measured 
. by ACSI) generate superior returns at lower systematic risk. 
Similar security mispricings, but on a much lower scale, 
have been found in related situations with respect to quality 
awards announcements (Hendricks and Singhal 2001) and 
marketing investments (Penman and Zhang 2001), but most 
discoveries of security mispricings point to “overpricing” 
(Shiller 2000). Under the assumption of efficient markets 
theory, if information about customer satisfaction is rele- 
vant to a company's future economic performance, it should 
be factored into the company's stock price such that it 
would not be possible to earn economic profits by trading 
on this information. Yet we show that this is indeed possi- 
ble. However, this does not imply a wholesale rejection of 
efficient markets theory. There have been many documented 
anomalies over the years (e.g., the January effect, the small- 


9Because the portfolio includes reinvested dividends and trans- 
action costs, the comparison with market indexes is not exactly 
"apples to apples," but the gains relative to market are much 
greater than the net effects from dividends and transaction costs. 
For example, with dividends and transaction costs included, the 
average diversified stock portfolio rose only 1% according to Lip- 
per Research over the same time period. 

lO The year-to-year ACSI portfolio betas relative to the DJIA are 
.87 (2000), .67 (2001), .74 (2002), .82 (2002), and .84 (2004). The 
year-to-year ACSI portfolio betas relative to the S&P 500 are .83 
(2000), .73 (2001), .72 (2002), .82 (2003), and .87 (2004). 


firm effect, the Monday effect, the post-earnings- 
announcement drift). Most of them have since disappeared. 
Whether that will be the case with the customer satisfaction 
effect is yet to be determined, but it is less likely because, 
with the exception of the public release of ACSI and con- 
trary to the January effect and others, the information is not 
costless and requires more sophisticated measurement tech- 
nology than firms typically use today (Ittner and Larcker 
2003). Because investments based on customer satisfaction 
may not be publicly known, ıt would also be much more 
difficult to verify a possible demise. According to Gupta, 
Lehmann, and Stuart (2004), financial analysts have yet to 
give more than scant attention to off-balance-sheet assets, 
even though these assets may be key determinants of a 
firm's market value. By using a discounted cash flow analy- 
sis for estimating the value of customer relationships, 
Gupta, Lehmann, and Stuart find that some companies were 
potentially mispriced, whereas others were not. 

Taken as a whole, the conclusion from these analyses is 
that though firms with highly satished customers usually 
generate positive abnormal returns, news about changes in 
customer satisfaction does not have an effect on stock 
prices. In other words, the primary thesis behind capitalist 
markets and marketing theory is borne out, but not without 
reservations about efficient (equity) markets theory. Specitf- 
ically, there seem to be imperfections with respect to the 
time it takes for stock markets to reward firms that do well 
by their customers and to punish firms that do not. In the 
wake of accounting scandals, the bursting of the stock mar- 
ket bubble, and the continued weakening of the relationship 
between balance sheet assets and future income, it would be 
in the interest of securities research to pay closer attention 
to customer satisfaction and the strength of customer rela- 
tionships. For marketing managers, it is clear that the cost 
of managing customer relationships and the cash flows they 
produce is fundamental to value creation. 

The principal short-term impact of the findings may be 
to boost security analysts’ demand for customer satisfaction 
information. If history is a guide, however, it will be diffi- 
cult for analysts to separate the “good” from the "bad" and 
the relevant from the irrelevant when it comes to this type 
of information. This will be a serious challenge because 
recent experience with valuing intangible assets did not turn 
out well. Just before the stock market bubble burst, Wall 
Street seemed enamored with customer metrics, most of 
which had dubious connections to economic value creation 
and large losses in shareholder wealth as a result. Thus, a 
more prudent approach might be expected this time, but 
appropriate prudence does not suggest eschewing customer 
metrics altogether, which more or less seems to be the case 
for securities research today.!! Rather, the quality of mea- 
surement and economic relevance should dictate how these 
metrics should best be used. For example, although cus- 
tomer satisfaction is strongly related to economic value 


llThis is not to suggest that all securities analysts completely 
ignore customer satisfaction. For example, Merrill Lynch and 
Morgan Stanley have written reports on it, and in general, analysts 
attribute the surge in Apple's stock price in late 2004 to an 
increase in customer satisfaction. 
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creation and the (collective) customer may have informa- 
tion relevant to the financial prospects of the firm before 
investors do, this cannot be leveraged if measurement is 
flawed and unable to extract the information. Ittner and Lar- 
cker (2003) find that most companies’ measurement 
methodologies for customer satisfaction are “mindless,” 
misleading, and too primitive to be useful. This state of 
affairs is unlikely to change unless shareholders, corporate 
boards, and investors put more pressure on companies to 
account for intangible assets more effectively. If accounting 
were to include information about the health of the firm’s 
customer relationships, investors would have a better under- 
standing of the link between the firm’s assets and its capac- 
ity to generate shareholder wealth, and managers would be 
compensated accordingly. Corporate governance would 
benefit as well. 

If investors were to use capital more expediently to 
encourage firms to strive for higher levels of customer satis- 
faction and better quality of measurement, there would be 
many beneficiaries. Consumers would be better off. As con- 
sumer and equity markets become more synchronized, their 
joint influence may contribute to less misallocation of capi- 
tal, quicker deflation of stock market bubbles (see Figures 
1—3), fewer cases of security mispricings, and a better func- 
tioning of markets in general. 

Although investors fail to realize the relevance of ACSI 
news and equity markets exhibit imperfections in this 
regard, the results of this study affirm the workings of free 
capitalistic markets with respect to the relationship between 
consumer utility and the flow of investment capital. 


Although it takes a while for equity markets and consumer 
markets to exercise their joint power, the finding that cus- 
tomer satisfaction and stock prices eventually move 
together suggests that most markets get it right in the long 
run. They apparently offer meaningful consumer choice, 
and short-term fluctuations and other exceptions notwith- 
standing, sellers may take comfort that they will (eventu- 
ally) be rewarded for treating customers well and that they 
risk punishment for treating customers poorly. 


APPENDIX 
Five and Fifteen Days of Cumulative Abnormal 
Returns (CAR) for ACSI News Announcements 





À CAR CAR 
Type of 5-Day 15-Day 
News Firm Type N Window Window 
Increase Manufacturing — 80 1.85 —.95 
in ACSI and services 
Manufacturing 48 .98 1.23 
Services 32 3.15 —4.22 
Decrease Manufacturing 
in ACSI and services 81 —.22 —1.65 
Manufacturing 41 —.78 —2.72 
Services 40 .36 —.55 


a nf ee SR 

Notes: None of the cumulative abnormal returns are significant at 
the 5% level based on nonparametric bootstrapped tests 
(two talled). 
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Market Pioneer and Early Follower 
Survival Risks: A Contingency 
Analysis of Really New Versus 

Incrementally New Product-Markets 


Does the first entrant in a new market have a difficult time surviving or do first-mover advantages provide protec- 
tion from outright failure? This empirical study of 264 new industrial product-markets yields insights into this con- 
troversial research topic. The key data analysis insights arise through a comparison of survival risks in markets 
that were started with a really new product and in those that were started with an incremental innovation. When a 
pioneer starts a new market with a really new product, it can be a major challenge just to survive. In contrast, in 
markets started by an incremental innovation, market pioneer survival risks are much lower. Notably, early follow- 
ers have the same survival risk across both types of markets. Overall, these results indicate that in markets started 
by a really new product, the first to market is often the first to fail. In contrast, in markets started by an incremental 
innovation, it appears that first-mover advantages protect the pioneer from outright failure. 





ost literature reviews on survivors conclude that tries. For example, Robinson and Min report a ten-year sur- 
IM market pioneers tend to have a greater market share vival rate of 66% for market pioneers versus 48% for early 

than later entrants (see, e.g., Kalyanaram, Robin- followers. 
son, and Urban 1995; Szymanski, Troy, and Bharadwaj Because of these divergent results, it seems likely that 
1995; VanderWerf and Mahon 1997). However, the typical under some circumstances, market pioneers have higher 
relationship between order of market entry and survival is survival rates. Under other circumstances, however, they 
controversial. It is important to try to resolve this contro- should have lower survival rates. À contingency analysis 
versy because market pioneers' survival problems can eas- may yield important insights into the complex relationship 


ily offset their market share advantages, and firms may be between order of market entry and survival. Srinivasan, 
discouraged from investing in the costly and risky attempt Lilien, and Rangaswamy (2004) also examine this complex 


to pioneer new markets. relationship. They report that network externalities have a 
Some research concludes that market pioneers experi- negative impact on market pioneer survival. 

ence a greater survival risk than later entrants. Olleros Although network externalities are important, our con- 

(1986, p. 8) cites examples of unsuccessful pioneers of new tingency analysis is based on product innovation newness. 

technologies and observes, “[A]gain and again, we see Product innovation newness is an important aspect of new 

industries emerge over the dead bodies of their early pio- product development that has received considerable 


neers.” Lambkin and Day (1989) and Tellis and Golder research attention (for a literature review on innovation 
(1996) predict higher failure rates for market pioneers, and types, see Garcia and Calantone 2002). Our contingency 
Lilien and Yoon (1990) and Golder and Tellis (1993) pro- analysis compares survival rates for markets that were 
vide empirical evidence in support of this viewpoint. For started with really new products with those that were started 


example, Golder and Tellis report a lifetime market pioneer with incremental innovations. This is part of a continuing 

survival rate of only 53%. effort to shed light on different challenges that marketing 
Other research indicates that first-mover advantages managers face in really new versus incrementally new 

help protect the market pioneer from outright failure. These product-markets (e.g., Schmidt and Calantone 1998; Song 

studies include Agarwal's (1997) product life-cycle analy- and Montoya-Weiss 1998; Urban, Weinberg, and Hauser 

sis, Shepherd's (1999) venture capitalist survey, and Robin- 1996). 

son and Min's (2002) analysis of industrial goods indus- When a pioneer starts a new market with a really new 


product, the costs and risks are unusually high, which 
— o makes survival more difficult. Because early followers can 
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ket pioneer to survive. In addition, a temporary monopoly 
plus first-mover advantages should yield higher survival 
rates for the pioneer than for early followers. 
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We test our contingency analysis with data from two 
sources. We used the Thomas Register of American Manu- 
facturers to identify 264 new product-markets for industrial 
goods and to track market pioneer (first entrant) and early 
follower survival over time. We supplement this with an 
expert survey that estimates product innovation newness for 
each of the 264 new markets. The expert survey classifies 
66 markets as starting with a really new product and 198 as 
starting with an incremental innovation. 

In the data analysis, consistent results surface between 
the descriptive statistics and a' hazard rate model that 
accounts for both observed and unobserved heterogeneity. 
The results reveal that market pioneers that launched a 
really new product had much lower survival rates than pio- 
neers that launched an incremental innovation. For exam- 
ple, the 12-year pioneer survival rate for really new prod- 
ucts is only 23% versus 61% for incremental innovations. 

A. surprising result is that early followers do not have 
meaningful survival risk differences in really new versus 
incrementally new product-markets. For example, the 12- 
year survival rates in really new versus incrementally new 
product-markets are essentially equal at 38% and 39%, 
respectively. This indicates that in really new product- 
markets, early followers that learn from the market pio- 
neer's experience can materially reduce their survival risk. 

The research controversy we described can be resolved 
by examining survival risk differences within really new 
and incrementally new product-markets. In incremental 
innovation product-markets, market pioneers have a consis- 
tently higher survival rate than early followers. An opposite 
result occurs in really new product-markets, in which mar- 
ket pioneers have a lower survival rate. Overall, the key 
research conclusion is that the market pioneer is more likely 
to be the first to fail when it starts a new market with a 
really new product but not when it starts the new market 
with an incremental innovation. 


Survival Risk Hypotheses 


The hypotheses we set forth in this section address survival 
risks for market pioneers and early followers. The first two 
hypotheses compare survival risks across markets that were 
started by a pioneer's really new or incrementally new prod- 
uct. The third and fourth hypotheses address survival risk 
within each type of market by comparing market pioneers 
with early followers. 


Really New Versus Incrementally New Product- 
Markets 


To assess product innovativeness, we follow Urban, Wein- 
berg, and Hauser’s (1996) dichotomy of really new versus 
incrementally new products. They state (p. 47), “Really new 
products shift market structures, represent new technolo- 
gies, require consumer learning, and induce behavior 
changes.” At the other end of the spectrum, an incremental 
innovation is designed to satisfy a felt market need and uses 
an existing technology or refinement of it. 

Really new products, such as automobiles and personal 
computers, use new technologies to elicit new market 
needs. Agarwal and Bayus (2002) note that really new prod- 
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ucts are often unreliable and costly when they first appear 
on the market. Because of high levels of market and techno- 
logical uncertainties, they face slow acceptance by potential 
users. A key source of market uncertainty arises from cus- 
tomers' concern about the new product's ability to deliver 
all its promised benefits. Really new products may also 
require the use of complementary hardware (e.g., roads for 
automobiles) or software (e.g., programs for personal com- 
puters) technologies, often supplied by an independent third 
party, whether government or industry (Olleros 1986). 
Potential customers often delay adoption of a really new 
product until the complementary technologies become 
available. 

Really new products often entail high technological 
uncertainty. Potential users tend to be concerned about 
whether a market will eventually adopt the technical stan- 
dards embodied in the really new product. In addition, a 
dominant product design for a really new product can take 
years to emerge (Christensen, Suarez, and Utterback 1998). 
When a pioneer is committed to another product design that 
customers do not want, its survival can be threatened. In 
summary, the resources required to overcome market and 
technological uncertainties associated with the adoption of 
really new products can cause pioneer burnout and eventual 
exit from the market. 

Conversely, incrementally new products involve much 
lower levels of market and technological uncertainties. Mar- 
ket uncertainty is low because these new products provide 
incremental benefits relative to existing products. Because 
the felt need already exists, market research can provide 
more accurate sales forecasts for incrementally new prod- 
ucts than for really new products. Technological uncertainty 
is limited because incrementally new products tend to be 
based on a refinement or extension of established technolo- 
gies. Thus, market pioneers that launch an incrementally 
new product should face low survival risk. 

There is some empirical evidence of high survival risk 
for really new products. Olleros (1986) cites pneumatic 
tires, sewing machines, automobiles, typewriters, heli- 
copters, televisions, jet engines, and transistors as notable 
examples of product-markets in which pioneers did not sur- 
vive. Such examples tend to be really new products that 
serve unmet customer needs. The 36 product categories in 
Golder and Tellis's (1993) work span innovations such as 
video recorders, microwave ovens, dishwashers, personal 
computers, and facsimile machines. They claim (p. 64) to 
have "chosen to identify pioneers in distinctive product cate- 
gories rather than in narrow categories," which suggests that 
their sample is more likely to contain really new products. 

In contrast, Ürban and colleagues (1986), who report a 
zero pioneer failure rate, analyze a sample that contains 
"tightly defined" categories, such as liquid laundry deter- 
gents, instant freeze-dried coffee, fabric softeners, and anti- 
dandruff shampoo. Robinson and Min (2002) report higher 
five- and ten-year survival rates for market pioneers than for 
early followers. Their sample mainly includes incremental 
innovations, such as disposable mops, animal access doors, 
boat lights, and pickle-making machinery. Thus, the empir- 
ical evidence that supports higher pioneer failure rates 
emphasizes really new product-markets, whereas the empir- 


ical evidence that supports lower pioneer failure rates 
emphasizes incrementally new markets. Thus: 


H,: Market pioneers have a greater survival risk for really new 
products than for incrementally new products. 


Market and technological uncertainties associated with 
the adoption of really new products often take years to 
resolve. Complementary technologies are likely to be lim- 
ited until product sales reach a critical mass. The cost and 
risk of adopting a really new product is often very high. 
Such breakthrough innovations tend to be relatively expen- 
sive and deficient in performance. In light of these deficien- 
cies and uncertainties, many potential users adopt a wait- 
and-see attitude. The full resolution of market and 
technological uncertainties may take a long time. Thus, 
early followers should also experience greater resistance in 
gaining customer acceptance in really new product-markets. 


H^: Early followers have a greater survival risk for really new 
products than for incrementally new products. 


Order of Market Entry 


Is the first to market the first to fail? A comparison of mar- 
ket pioneer with early follower survival rates in really new 
and incrementally new product-markets addresses this con- 
troversial research topic. For really new products, a market 
pioneer faces numerous first-mover disadvantages. Lieber- 
man and Montgomery (1988) note first-mover disadvan- 
tages as (1) free rider effects, (2) market and technological 
uncertainties, (3) changing technology or customer needs, 
and (4) incumbent inertia. Because early followers in our 
sample typically entered within two to three years of the 
pioneer, they are also vulnerable to incumbent inertia. Next, 
we highlight the other first-mover disadvantages. 

Imitation is typically less expensive than innovation, so 
early followers have the opportunity to free ride on the pio- 
neer's investments. This should be especially true for really 
new products for which pioneering can be costly. As men- 
tioned previously, market and technological uncertainties 
are often high for really new products, which can require 
huge resources and cause pioneer burnout (Olleros 1986). 
Furthermore, Kerin, Varadarajan, and Peterson (1992, p. 43) 
suggest that “the greater the [market] uncertainty level, the 
lower the likelihood that a first-mover will make sizable 
investments in capacity to achieve scale-dependent cost 
advantages." 

Finally, changing technology and customer needs can 
provide early followers with an opportunity to leapfrog the 
pioneer with a superior product offering. Product superior- 
ity can arise from superior technology or from a superior 
job of satisfying unmet customer needs. 

Two industry studies provide empirical evidence for 
really new product-markets. Across five diagnostic imaging 
product categories, Mitchell (1991) finds that later entrants 
have lower survival risks, and in the rigid disk industry, 
Christensen, Suarez, and Utterback (1998) report that later 
entrants have lower survival risks. On the basis of this theo- 
retical and empirical evidence, we predict the following: 


Haa: For really new products, market pioneers have a greater 
survival risk than early followers. 


Although the disadvantages for market pioneers are 
especially strong for really new products, market pioneer 
advantages can also be especially strong. First, market pio- 
neers benefit from a temporary monopoly, which provides 
temporary protection from competition. À temporary 
monopoly also helps pioneers strengthen their first-mover 
advantages (Brown and Lattin 1994; Huff and Robinson 
1994). Because imitation should be more difficult for really 
new products than for incrementally new products, this 
tends to lengthen the pioneer's monopoly. 

Second, first-mover advantages help protect pioneers 
from outright failure. Pioneer first-mover advantages span 
numerous sources. These sources include a stronger brand 
name (Schmalensee 1982), customer preferences that are 
shaped to favor pioneers (Carpenter and Nakamoto 1989), 
and broader product lines (Robinson and Fornell 1985). In 
really new product-markets, it seems likely that these first- 
mover advantages will be stronger. For example, a really 
new product is more unique than an incremental innovation, 
and this uniqueness should strengthen consumer memory 
and brand name recall. Unique products should be more 
effective in shaping customer preferences in favor of the 
pioneer. Finally, unique products should have more oppor- 
tunities for product line extensions. When longer pioneer 
lead times and stronger first-mover advantages outweigh 
first-mover disadvantages, we predict the following: 


Ha: For really new products, market pioneers have a lower 
survival risk than early followers. 


For incrementally new products, first-mover disadvan- 
tages should be modest. First, there should be fewer oppor- 
tunities to free ride on the pioneer's investments. This is 
because these investments are typically modest compared 
with developing a really new product. Second, incremental 
innovations typically refine and modify existing products, 
so the felt need for these products often exists, and the tech- 
nologies are well established. With established customer 
needs and technologies, market and technological uncer- 
tainties are modest. Finally, changes in customer needs and 
technologies are less likely to arise and to surprise the 
pioneer. 

In addition, market pioneers should still benefit from a 
temporary monopoly and first-mover advantages. Even an 
incremental innovation cannot typically be copied 
overnight, so the temporary monopoly makes it easier to 
survive. Furthermore, first-mover advantages in terms of 
brand name strength, customer preferences that are shaped 
to favor the pioneer, and broader product lines should make 
it easier to survive. 

Empirically, Robinson and Min (2002) and Urban and 
colleagues (1986) report relatively high market pioneer sur- 
vival rates. Because both studies have relatively narrow 
market boundaries, they should be dominated by incremen- 
tal innovations. Thus: 


H4a: For incremental innovations, market pioneers have a 
lower survival risk than early followers. 


Although the pioneer disadvantages we discussed previ- 
ously are not as great in incrementally new product- 
markets, Golder and Tellis (1993) describe three additional 
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disadvantages. First, improper positioning arises when the 
pioneer misses the ideal point and when repositioning costs 
are high. Even for incrementally new product-markets, later 
entrants have a better chance of hitting the ideal point. Sec- 
ond, resource requirements can change over time. An early 
follower is in a better position to predict the key resource 
requirements for survival. Third, a market pioneer may not 
be able or willing to commit the resources necessary to suc- 
ceed. Again, an early follower is in a better position to 
understand the resources required for success. Thus, when 
the pioneer disadvantages outweigh the advantages, we pre- 
dict the following: 


: For incremental innovations, market pioneers have a 
p 
greater survival risk than early followers. 


Data 


The Thomas Register data have been used in numerous sur- 
vival studies (in the economics literature, see, e.g., Agarwal 
1997; Agarwal and Gort 1996, 2001; Gort and Klepper 
1982; in the marketing hterature, see, e.g., Agarwal and 
Bayus 2002; Robinson and Min 2002). The Thomas Regis- 
ter is a national buying guide that provides “a comprehen- 
sive detailed guide to the full range of products manufac- 
tured in the United States" (Lavin 1992, p. 129). The 
Thomas Register attempts to achieve comprehensive cover- 
age by subscribing to a broad range of industry newsletters, 
searching for start-up ventures in university incubators, and 
providing a free listing in each annual issue. For example, 
the 2003 Thomas Register includes approximately 173,000 
firms and more than 65,000 product categories. 

A key Thomas Register data strength 1s that the market 
boundaries are buyer driven. To help industrial buyers, a 
new product category is typically added to a group of 
closely related categories. A new category does not split an 
existing category. For example, an industríal buyer search- 
ing for valves will find air valves, nuclear valves, frostproof 
valves, solar collector drain-down valves, and so forth. 

The Thomas Register's professional staff determines 
market boundaries and the relevant competitors in each 
market. The market boundaries are narrower than those of 
Golder and Tellis (1993), who include broad categories, 
such as microwave ovens, cereals, and soft drinks. How- 
ever, they are typically broader than the boundaries of 
Urban and colleagues (1986), who examine consumer pack- 
aged goods, such as light beer, antacids, and steak sauce. 

Because the Thomas Register is a national buying guide, 
it typically excludes firms with exclusively local sales. By 
excluding local markets, our sample emphasizes regional 
and national markets. In terms of scale of entry, this opera- 
tional definition falls between that of Urban and colleagues 
(1986), who require national distribution, and that of Golder 
and Tellis (1993), who include local markets. 

We identified new markets by comparing annual edi- 
tions of the Thomas Register from 1960 to 1995. We identi- 
fied a new market when it was listed in one year but not in 
prior years. The identification of the 264 new industrial 
markets in our study was the most time-consuming part of 
the data collection process. 
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Thomas Register Data Limitations 


Although it is unlikely that the Thomas Register staff set 
market boundaries that are too broad, what happens when 
the market boundaries are too narrow? In these markets, our 
sample misses the market pioneer that has already entered a 
broader product-market. Thus, our research inadvertently 
compares an early follower with a later entrant or even a 
later entrant with other later entrants. In this situation, the 
true results should be even stronger than those we subse- 
quently report. This is because the 1mportance of order of 
entry should have diminishing marginal returns. For exam- 
ple, greater differences should arise between the market 
pioneer and an early follower than between two later 
entrants. 

Another data limitation arises from our definition of 
"exit" We follow Agarwal's (1997, p. 574) guidelines such 
that an exit arises when the firm's name and address are 
both missing from the product category in a given year. This 
avoids the problem of confusing an exit with a name 
change, such as a conglomerate merger, or confusing an exit 
with an address change from office relocation. Even so, 
when a successful firm is merged or acquired, its name and 
address are often changed. In this case, it is classified as an 
exit. Because exit implies failure rather than success, this 
limitation surfaces in the other Thomas Register survival 
studies we previously cited. 

In an attempt to quantify this potential bias, we gathered 
additional data for the most recent Thomas Register exits 
that arose between 1994 and 2002. There were 178 product 
exits during this nine-year period (see Table 1). We then 
identified cases in which a firm exit arose in the same year 
as the product exit. A firm exit arises when a firm is delisted 
from the Thomas Register. This approach assumes that 
when a merger or acquisition occurs, a product exit and the 
firm exit arise in the same year. 

In Table 1, only 25% of the product exits were accom- 
panied by a firm exit. Furthermore, there were no meaning- 
ful differences associated with either order of entry or prod- 
uct innovation newness. We provide some additional 
insights for the 45 cases for which a product exit and the 
firm exit arise in the same year. We base these insights on a 
Google search, an Internet search for bankruptcy court fil- 
ings, and calls to the most recent telephone number listed in 
the Thomas Register. 

For these 45 cases, 10 were a result of either a merger or 
an acquisition, 4 were a result of bankruptcies, and 31 had 
an unknown outcome.! For the mergers and acquisitions, it 
is important to know whether the acquired firm was suc- 
cessful or unsuccessful in the product category that was 
being tracked. Successful product category performance 
should not be classified as an exit, whereas unsuccessful 
performance can be classified as an exit. 

Although there are not enough data to determine suc- 
cess at the time of éxit, we assume that when product cate- 





lFor the 31 cases with unknown outcomes, we were unable to 
contact the acquired firm by calling the most recent telephone 
number listed in the Thomas Register. Because successful firms 
should attempt to maintain telephone contact with established cus- 
tomers, this outcome seems more likely to arise for unsuccessful 
firms. 


TABLE 1 
Product and Firm Exits, Mergers and Acquisitions, and Bankruptcies Between 1994 and 2002 








Number 
(Percentage) 
of Product 
Exits That Mergers or 
Product Accompanled ———— Number of Untraceable 
Exits by a Firm Exit Survivors Exits Bankruptcles Cases 
Really New Products 
Market pioneers 16 4 (25) 0 1 0 3 
Early followers 36 8 (22) 1 0 2 5 
Incrementally New 
Products 
Market pioneers 44 12 (27) 1 1 2 8 
Earty followers 82 21 (26) 0 6 0 15 
Total 178 45 (25) 2 8 4 31 





gory performance is successful, the acquiring firm should 
remain active in the product category through 2002 
(roughly three years after the acquired firm’s exit). In Table 
1, the acquiring firm remained active in the product cate- 
gory for only two of the ten mergers and acquisitions. This 
suggests that even when a Thomas Register exit arises from 
a merger or an acquisition, the acquired firm is more likely 
to be struggling than successful. Overall, these results indi- 
cate that mergers and acquisitions of successful firms do not 
seriously bias the Thomas Register exit measure. 


Key Definitions 


In each product-market, the market pioneer is defined as the 
market’s first entrant. Because our research compares mar- 
ket pioneers with early followers, our analysis excludes 
markets with multiple first-year entrants. This is because a 
unique first entrant (market pioneer) cannot be identified. 
When we identified a unique first entrant, we tracked the 
market forward on an annual basis until we identified 
another entrant. We call this entrant and any other entrants 
in that year early followers. 

We surveyed industry and research experts to estimate 
the degree to which each new product-market was started 
with a really new product or an incremental innovation. We 
provided each expert with Urban, Weinberg, and Hauser’s 
(1996) conceptual distinctions between really new and 
incrementally new products. In these conceptual distinc- 
tions, a really new product creates a new market, relies on 
new technology, and requires customer learning. An incre- 
mentally new product is designed to satisfy a felt market 
need and uses an existing technology or refinement of it. In 
the rating scale designed to elicit the newness of a product 
innovation, 1 is an incremental innovation and 7 is a really 
new product. Table 2 provides the survey questionnaire for 
chemicals and allied products. 

For each of the 264 new markets, four to nine experts 
rated the degree of product innovativeness when the new 
market started. Of the 39 experts, 15 had a doctoral degree, 
most had industry work experience, and all were pursuing 
scientific research. Approximately 90% were employed in a 
research-oriented university that has most of its engineering 


departments ranked in the top 20 nationally. Each expert 
evaluated products only in his or her field of expertise.? 

To the best of our knowledge, the experts' ratings were 
internally consistent across the product categories. Even so, 
in the questionnaire pretest, some experts in basic research 
labs gave consistently low ratings for the degree of product 
innovativeness. To control for this bias, our final survey 
included some well-known really new products that served 
as benchmarks, which we derived from prior research stud- 
ies. For chemical and allied products, for example, our sur- 
vey included DDT, saccharin, penicillin, and streptomycin 
(Gort and Klepper 1982). In other categories, we included 
the personal computer, the laser jet printer, the Walkman, 
Web TV (Urban, Weinberg, and Hauser 1996), transistors, 
electric shavers, and radar detectors (Gort and Klepper 
1982). 

To adjust for individual response bias, in the first step, 
we computed each respondent's benchmark score, which is 
the average score across the well-known really new prod- 
ucts. In the second step, we subtracted the respondent's 
benchmark score from the corresponding raw score. For 
each expert, we considered a product really new when the 
raw score was greater than or equal to the benchmark score. 
We classified a product as really new when the majority of 
experts agreed that a given product was really new. This 
procedure yielded 66 really new products and 198 incre- 
mental innovations. 

To quantify the reliability of the product innovation 
newness measure, we determined Cronbach's alpha, which 
estimates the average correlation in interexpert ratings. 
Table 3 reports the results across two-digit Standard Indus- 
trial Classification (SIC) code industries. Cronbach's alphas 
ranged from .76 to .96, which indicates an acceptable 
degree of reliability. 


2The surveys were conducted in person. Although it was not 
part of the formal survey, ın most cases, the expert could not iden- 
tify the market pioneer or the first early follower. This may arise 
because the average year of market inception was in the 1970s. 
Thus, it does not appear that early entrant survival knowledge 
biased the degree of product innovativeness measure. 
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TABLE 2 
Survey Questionnaire for Chemicals and Allied Products 





Please evaluate whether each of the following products was a really new product or an incremental new product at the time of 


its initlal commercial introduction. Note that: 


*A really new product, or basic Innovation, creates a new market, relies on new technology, and requires customer 


learning. 


eAn incremental new product, or modification, is designed to satisfy a felt market need and uses existing technology or 


refinement of it. 


Product Category 


Carbon monoxide gas 
DDT 

Dlelectnc gas 
Chrome dyes 

Indium compounds 
Zinc hardeners 
Lanthanides 
Saccharin 

Bromide lead 

Lead sillco fluoride 
Streptomycin 

Lead titanate 

Lithium acetate 
Lithium niobate 
Magneslum perchlorate 
Peniclllin 

Manganous sulfate 


Incremental 
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Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
Don't know 
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a i ld ori a a a Rc ens ue 
Notes: This Is a part of the questionnaire for chemicals and allled products. The actual survey questionnaire Includes 34 chemical and allied 


products. 


TABLE 3 
Interexperts Rellability: Cronbach's Alpha 





Number of Number of Cronbach's 


Survey Group Industry (Two-Diglt SIC Code) Markets Raters Alpha 
Textile mills products (22), apparel and fabrics (23), lumber and wood products (24), 

paper and allied products (26), printing, publishing, and allled industries (27), and 

other industrial products 19 4 .95 
Chemicals and allled products (28) 25 9 .87 
Materials and fabricated metal products (30, 32, 33, 34): Subgroup 1 40 4 .84 
Materials and fabricated metal products (30, 32, 33, 34): Subgroup 2 16 4 .96 
Machinery and computer equipment (35) and other industrial products: Subgroup 1 61 8 .90 
Machinery and computer equipment (35) and other industrial products: Subgroup 2 24 4 93 
Electronic and electrical equipment (36): Subgroup 1 24 7 .88 
Electronic and electrical equipment (36): Subgroup 2 12 5 .86 
Measuring, analyzing, and controlling equipments (38): Subgroup 1 18 8 .86 
Measuring, analyzing, and controlling equipments (38): Subgroup 2 25 5 .76 
Average 26.40 5.64 .88 





Overall, the benchmark adjustment to the product inno- 
vativeness measure appears to reduce individual response 
bias. This is because it yields a reliable measure and tends 
to strengthen the hazard rate modeling results we discuss 
subsequently. 


Descriptive Statistics 


Across the entire sample, the market pioneer 12-year sur- 
vival rate is 5296. Golder and Tellis (1993) report a lifetime 
market pioneer survival rate of 5396. Does this imply that 
the first to market is the first to fail? In our entire sample, 
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the 12-year early follower survival rate is only 39%. 
Because market pioneers have significantly higher survival 
rates, it appears that first-mover advantages help protect 
pioneers from outright failure. Although this conclusion 
applies to the entire sample, more accurate insights arise 
through a comparison of really new product-markets with 
incremental innovations. 

In our sample, the really new product-markets have 66 
market pioneers and 146 early followers. The incremental 
innovation product-markets have 198 market pioneers and 
340 early followers. The average year of market inception is 
1978 for really new products and 1975 for incremental 


innovations. Average pioneer lead time, which is the length 
of the pioneer’s monopoly, is only 2.5 years. 

Table 4 describes the sample in terms of two-digit SIC 
codes and new product newness. The really new products 
include 13 chemical products, such as gelling agents, cellu- 
Jose propionate, indium compounds, dielectric gas, pary- 
lene, and so forth. Really new products in other industries 
include voice-activated microphones, osmometers, blood 
pressure monitors, video microscopes, bomb detectors, and 
computer screen projectors. The incremental product- 
markets include welding nozzles, poultry chillers, asphalt 
concrete, boat coolers, bottle sorters, fiberglass washers, 
boat loaders, and ice-dicing machinery. The Appendix pro- 
vides a complete list of the product-markets, the year of 
market inception, and the degree of product innovativeness. 

Using 4-, 8-, and 12-year survival rates, in Table 5, we 
provide initial hypothesis-testing insights. We do not use 
longer time horizons, because right-censoring problems 
limit the sample size. Twelve-year survival rates provide the 
most information, so we highlight them. 

For H,, 12-year survival rates for market pioneers are 
much lower for really new products than for incremental 
innovations. At 23% versus 61%, incremental innovation 
survival rates are almost three times higher than those of 
really new products. This difference can be explained by the 
greater market and technological uncertainties that pioneers 
must overcome when they launch a really new product. This 
result provides strong support for H}. 

For Hp, it is surprising that there are no meaningful sur- 
vival rate differences for early followers in really new ver- 
sus incrementally new product-markets. This result is also 
surprising because these early followers typically entered 
the new product-markets only two to three years after the 
pioneer. This suggests that early followers benefit from the 


efforts and experience of the pioneer in overcoming market 
and technological uncertainties. 

For H3, market pioneers in really new product-markets 
have significantly lower 12-year survival rates than early 
followers (23% versus 38%). For Hy, an opposite sign 
arises: Market pioneers have significantly higher 12-year 
survival rates (61% versus 39%). The higher levels of mar- 
ket and technological uncertainties in really new product- 
markets, which should decrease market pioneer survival 
rates, can explain this sign reversal. 

Two industry examples help illustrate these results. 
First, in our expert survey, a voice-activated microphone 
was classified as a really new product. It was pioneered in 
1992 by Sherwood Communications Associated Ltd., which 
entered the market as a start-up company with approxi- 
mately $1 million in assets. The company exited the market 
in 1993, which is the same year the first early follower, 
Telephonics Corporation, entered. Telephonics was an 
industry incumbent and had experience manufacturing 
headsets and microphones. Its asset size was greater than 
$250 million. Telephonics survived through 2002 and was 
an active player in the market. 

A second example is a welding nozzle, which was clas- 
sified as an incrementally new product in our expert survey. 
It was pioneered in 1987 by Maryland Ceramic and Steatite 
Co., which was an industry incumbent with more than $1 
million in assets. In 2002, it was one of 22 market sur- 
vivors. The first early follower was Lurmark Ltd., a start-up 
company. It entered the market in 1990 but exited in 1993. 


Hazard Rate Analysis 


Although descriptive statistics are useful, a multivariate 
hazard rate analysis provides more accurate hypothesis- 








TABLE 4 
Sample by Two-Digit SIC Code (n = 750) 
Really New incrementally 
Products New Products All 
Industry (Two-Diglt SIC Code) 
Textile mills products (22) 0 3 3 
Apparel and fabrics (23) 0 3 3 
Lumber and wood products (24) 0 5 5 
Paper and allied products (26) 0 4 4 
Printing, publishing, and allied Industries (27) 0 2 2 
Chemicals and allied products (28) 13 12 25 
Rubber and plastics products (30) 3 19 22 
Stone, clay, glass, and concrete products (32) 0 8 8 
Primary metal industries (33) 2 5 7 
Fabricated metal products (34) 2 17 19 
Machinery and computer equipment (35) 13 62 75 
Electronic and electncal equipment (36) 12 24 36 
Measuring, analyzing, and controlling equipments (38) 17 26 43 
Other Industries 4 8 12 
Total 66 198 264 
Number of Firms 
Market ploneers 66 198 264 
Early followers 146 340 486 
Total 212 538 750 
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TABLE 5 





Descriptive Statistics 
4-Year 8-Year 12-Year 
^N Survival Survival Survival 

Market Pioneers (H4) l 

Really new products 66 76% 53% 23% 

incrementally new products 198 92% 79% 61% 

|t-statistics| 3 72** 4.17" 5.62** 
Early Followers (H3) 

Really new products 146 74% 50% 38% 

Incrementally new products 340 76% 52% 39% 

It-statlstics| .45 .47 .32 
Really New Products (H3) 

Market ploneers 66 7696 53% 23% 

Early followers 146 74% 50% 38% 

[t-statistics] 27 44 2.17" 
Incrementally New Products (H4) 

Market pioneers 198 92% 7996 61% 

Early followers 340 76% 52% 39% 

|t-statistics| 5.53"* 6.64™ 5.05** 
*p< .05 
"*p < .01. 


Notes. We assume that nght-censored observations at the 10th or 11th year survive for 12 years. 


testing results. A hazard rate is the conditional likelihood of 
a firm's failure in time period t, given that the firm has sur- 
vived through time period (t — 1). Two widely recognized 
benefits of a hazard rate analysis are that it (1) represents 
the impact of time-varying explanatory variables on a firm's 
likelihood of failure and (2) addresses the right-censoring 
problem (Helsen and Schmittlein 1993, p. 397). Right cen- 
soring occurs when a firm does not have a completed spell 
within the data observation window and when all that is 
known is that its survival duration lasted from the time of its 
market entry until the end of the observation window. 
Because our data are annual observations, we use a discrete 
hazard rate model that incorporates the impact of time- 
varying covariates (e.g., competitive firm density in a 
product-market) on a firm's likelihood of failure. We model 
the impact of time on a firm's likelihood of failure nonpara- 
metrically to capture any.arbitrary pattern in the firm's 
baseline hazard rate over time (see, e.g., Han and Hausman 
1990; Meyer 1990). In other words, we assume that the 
baseline hazard rate function is a step function of time that 
is constant within a year but is allowed to vary from one 
year to another. An advantage of this approach is that it 
enables consistent estimation of the effect of explanatory 
variables for arbitrary underlying baseline hazard functions. 
Finally, the hazard rate model incorporates the impact of 
unobserved heterogeneity on firm hazard rates. 

In the hazard rate model, let t denote the survival dura- 
tion of an entrant with the cumulative probability distribu- 
tion function F(t).? The associated hazard rate and survival 


3Vanhuele and colleagues (1995) use the same hazard rate 
model to analyze marnage dissolution data. Their article has a 
more complete hazard rate model description. 
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functions are h(t) and S(t) = 1 — F(t). Now, consider that we 
observe firm i through a total of t, time periods (years in our 
case), where t; can range from one year to the entire length 
of the data observation window. That is, assume that firm i 
has survived through the year (t, — 1) and that we observe it 
into the tth period [t — 1, t) to determine whether it failed 
during the year or was right censored during the period. If 
firm i exits during the t,th year, the probability of its failure 
in the t,th period is [S(t, — 1) — S(t)]. However, if it was cen- 
sored during the t,th year, all that can be determined is that 
the firm survived through the end of period (t; — 1); thus, the 
probability of that event is S(t, — 1), under the assumption 
that censoring occurs at the beginning of the interval. 
Therefore, we can write the likelihood function for firm i, 
which we are able to observe into the t,th year, as follows: 


(1) L(t;)=[s¢, -p-s xs, - n], 


where the indicator variable 6, takes the value of one if firm 
iis right censored during the t,th year and zero if firm i exits 
the market during the t,th year. Multiplying and dividing the 
right-hand side of Equation 1 by [S(t; — 1)]& - 1, we obtain 


S(t, -) - S(t,) 
S(t, — 1) 


1-8, 
| x/ S(t, - D] 


= [he ^ x[sc, -], 


(2) L(t) = | 


where h(tj) is the hazard rate function representing the 
chance of firm i's exit in the tth period, given that it has 
survived through the period (t, — 1). We now explain our 
derivation of the expression for hazard rate function, h(t;). 
We then obtain an expression for the associated survival 
function S(t, — 1). 


Ly 


Overall, the hazard rate of firm i in time period t, 
depends on the baseline hazard rate function and the firm’s 
covariate values in time period t,, X,(t,). Firm i's hazard rate 
in time period t, can be expressed as 


(3) h[t.; X, (t,)] 2 À x P(t) x OLX, (t,)], 


where Aj is the base hazard rate, ọ(t;) is a step function that 
captures time (or age) dependence of the hazard rate, and 
Q[X,(t)] is a time-varying covariate function. As we men- 
tioned, the age dependence function @(t,) is modeled as a 
step function that remains constant within a year equal to, 
for example, C, in year t but is allowed to vary from year to 
year. The time dependence component of the hazard rate 
function, qXt,), in year t, is 


(4) 9(t,) = exp[C, Dit, 


where the dummy variable, D(t;), equals one in year t, and 
zero in all other years. No dummy variable is used for the 
first year because it is not possible to identify both Ap and 
C,. Therefore, Ay 1s the time dependence component of the 
hazard rate function in the first year. Positive (negative) 
estimates of the time-varying C, coefficients for the other 
years indicate a higher (lower) hazard rate than the first- 
year hazard rate, Ap. 

A firm's hazard rate also depends on its characteristics 
(e.g., its order of entry in a product-market) and product- 
market factors, some of which may vary with time (e.g., 
competitive firm density). The impact of covariates on firm 
i's hazard rate in period t, is 


(5) CX, (t,)] = exp[BX, (t, )], 


where X,(t,) is a vector of (possible) time-varying covariates 
and B is a vector of the covariate coefficients. A positive 
estimate of a B coefficient implies that an increase in the 
value of the covariate will enhance the likelihood of firm 
failure. Specifically, when the jth covariate increases by 
one, the conditional probability of firm failure changes by 
[exp(B;) - 1] x 100%. 

Substituting the terms @(t,) and o[X,(t,)] from Equations 
4 and 5 in Equation 3, we obtain 


(6) h[t,; X, (t,)] = Ag exp[C,D(t,) + BX, (t,)]. 


The survivor function associated with the hazard rate in 
Equation 6 is given by (Lancaster 1990; Yamaguchi 1991) 


(7) S(t) = exp[-H(t)], 

where 

(8) H(t,) = [nenas 
0 


Although the time-varying covariates (e.g., competitive firm 
density in a product-market) can change from year to year, 
we assume that they remain constant within each year. 
Given this simplifying assumption, Vanhuele and col- 
leagues (1995) show that Equation 8 can by rewritten as 


(9) H[t; B,(t)] = AoB,(t), 


where B,(t,) = E'_,exp[BX,(t) + C,D(t]. Substituting Equa- 
tion 9 into Equation 8 yields 


(10) S[t,, B, (t,)] = expl-ApB, (t, )]. 


We can now substitute the survival function from Equation 
10 into the expression for the likelihood function in Equa- 
tion 1 and obtain the log-likelihood function for the N firms 
in our sample (see Vanhuele et al. 1995): 


N 
(11) LL = Y (q - 8,)log(expI-AoB, (t, — D] 


12] 


— exp[-ApB, (t,)]} — §,A9B, (t, — D). 


The final step in our hazard rate analysis is to account 
for unobserved heterogeneity. A firm’s likelihood of failure 
may depend on some factors that are not measurable and 
others for which data are not available. Thus, firms may 
have different mean survival rates even when their observ- 
able characteristics are the same. Unobserved heterogeneity 
in the base hazard rate is modeled by allowing Ào to vary 
across the population according to a time-1nvariant distribu- 
tion. Thus, the likelihood function contributions, which we 
previously derived, are conditional on a specific value of Ag 
(see Equation 11). We obtain the unconditional likelihood 
function by weighting the conditional likelihood by the 
relative occurrence of Ag. This is specified as 


(12) Lt)» [L(t ho 8p )do, 
0 


where g(Ao) is called the (unobservable) mixing distribu- 
tion. Because of the gamma distribution’s flexibility, it is 
often used as the mixing distribution (Heckman and Singer 
1984; Lancaster 1979). Vanhuele and colleagues (1995) 
show that assuming a gamma mixing distribution leads to a 
closed-form solution for the log-likelihood function: 


ar 


N 
(13) LL = Ce ners 


1= 1 


QI 
1B, (t, -1)+ (1— 8) exp[BX, (t,) + C, D(t,)] + " 


where œ and r are the scale and shape parameters of the 
gamma mixing distribution. The coefficient of variation of 
the mixing distribution is often interpreted as a measure of 
the degree of heterogeneity. For the gamma distribution, CV 
reduces to 


giving the r parameter the interpretation that a low (high) r 
value suggests a relatively more (less) heterogeneous 
population. 

Control Varlables 


Table 6 provides the variable definitions, means, and stan- 
dard deviations for the full model specification. Key model 
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TABLE 6 
Varlable Definitions 


ee À——M € 


Varlable 

Really new product-market 
Market pioneer 

Early follower 


Heally new product-market x 


market pioneer 


Market pioneer x In pioneer 
lead time 


Early follower x In pioneer 
lead time 


Early follower x (In pioneer 
lead time)? 


Ln firm size 


Incumbent 


Ln calendar year of market 
inception 


Economic decline, 
N, 


N? 
N 
Chemicals 
Electronics 
Machinery 
Instruments 


Definition 
1 for a really new product-market, 0 otherwise 
1 for the market's first entrant, O otherwise 
1 for each entrant In the first year of entry after the pioneer, 0 otherwise 


1 for a market ploneers in a really new product-market, O otherwise 


For the market pioneer, the natural logarithm of its lead time years over 
the second entrant plus 1.0, 0 otherwise 


For early followers, the natural logarrthm of their delay after the pioneer's 
entry plus 1.0, 0 otherwise 


The squared value of the preceding Interaction term 


The natural logarithm of the firm's tangible asset size divided by its 
industry average 


1 if the entering firm was In existence in period (t — 1), 0 if otherwise 


The natural logarithm of the calendar year of market Inception minus 1900 
1 If annual gross domestic product declined, O if otherwise 


Normallzed firm density at time t equals the number of competitors In the 
market divided by the average number over the ten-year period after the 
pioneers monopoly ends 


The squared value of the normalized firm density at time t divided by 100 
The normalized firm density at the time of market entry 

1 for a chemical product (SIC = 28), 0 otherwlse 

1 for an electronic product (SIC = 36), 0 otherwise 

1 for a machinery product (SIC = 350—356), 0 otherwise 

1 for an instrument product (SIC = 38), 0 otherwise 


M 
(SD) 


28 (45) 
35 (48) 
.65 (48) 


.08 (.28) 
40 (.61) 
69 (.62) 


86 (1.11) 
—24 (1.82) 


.67 (42) 


4.34 (.12) 
.18 (.38) 
1.81 (1.85) 


07. (.18) 
38 (31) 
.07 (.26) 
16 (.37) 
20 (40) 
17 (38) 


Notes: For In firm size, the industry average is defined at the two-diglt SIC code. Dividing firm density by the average firm density of the market 
for ten years after the monopoly period normalizes the firm density variables. The means and standard deviations are based on 750 
observations For the time-varying vanables (Economlc decline, N,, and N2), we measure the statistics at the last observation perlod of 


the sample. 


specification variables we used in the hypothesis testing are 
the really new product-market dummy variable, a market 
pioneer dummy variable, and a really new product-market x 
market pioneer interaction term. Next, we discuss all the 
control variables. 


Conventional economic variables. Geroski's (1995) lit- 
erature review, suggests that the likelihood of survival is 
greater for larger firms than for smaller firms. The Thomas 
Register includes the tangible asset size for each firm. 
Because the average tangible asset size varies over indus- 
tries and has changed over time, we compare entrant firm 
size at the time of market entry with the average industry 
firm size and categorize the entrants as larger versus smaller 
firms relative to the industry average. We define the indus- 
try at the two-digit SIC code level. 

Incumbents are likely to have higher survival rates than 
start-up ventures because of their prior experience, market 
knowledge, brand equity, and customer relationships. In 
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line with the work of Agarwal and Gort (1996), we define 
incumbents as firms in existence in year (t — 1). 

In recent decades, increasing international competition 
and shorter new product development cycles have led to an 
increase in competitive intensity (Griffin 1997, p. 450). If 
competitive intensity has increased from the 1960s to the 
1990s, hazard rates should be positively influenced by the 
calendar year of market inception. 

We controlled for abnormally high hazard rates in the 
years of economic decline with a time-varying dummy 
variable. The dummy variable equals one in years when 
there is a decline in the annual gross domestic product 
growth rate. Audretsch (1991) finds that the ten-year firm 
survival rates vary considerably across different industries. 
Thus, the hazard rate model also includes four industry- 
specific dummy variables. 


Population ecology variables. Population ecologists 
predict two contrasting effects of "density dependence" on 


firm hazard rates: legitimation and competition (Hannan 
and Carroll 1992; Hannan and Freeman 1989). Legitima- 
tion of a product innovation increases as the number of 
firms in the new product-market increases, thus resulting in 
the decline of hazard rates of all firms. However, there is a 
ceiling on the impact of the legitimacy process. At higher 
levels of firm density, legitimation reaches a saturation 
level, and the competitive effect of firm density dominates, 
resulting in higher hazard rates for all firms. Overall, the 
population ecology theory predicts a U-shaped relationship 
between firm density and hazard rate in new product- 
markets. Thus, the hazard rate model includes a firm den- 
sity and a firm density-squared term.^ 

Density dependence theory also predicts a persisting 
competitive effect of firm density at the time of a firm's 
entry into a product-market. Carroll and Hannan (1989) 
attribute this effect to two sources: resource scarcity and 
tight niche packing. They contend that intense competition 
at the time of market entry creates the condition of resource 
scarcity. With scarce resources, market entrants that find it 
difficult to move to full-scale operation face strong selec- 
tion pressures. Carroll and Hannan argue that another con- 
sequence of intense competition at the time of market entry 
is tight niche packing. Because market entrants seldom 
compete head-to-head with market incumbents, they are 
forced to pursue marginal target-market opportunities. Car- 
roll and Hannan (1989), Carroll and colleagues (1996), and 
Banbury and Mitchell (1995) provide empirical evidence in 
support of the adverse impact of firm density at the time of 
market entry on tbe probability of firm survival. 


Strategic variables. Because increasing pioneer lead 
time tends to strengthen first-mover advantages, we include 
pioneer lead time in the hazard rate model. Increasing pio- 
neer lead time should decrease a market pioneer's hazard 
rate. For early followers, increasing pioneer lead time can 
increase their hazard rates because pioneers gain strength 
with longer lead time. However, a short delay can help them 
survive because in the first year or two of the market's evo- 
lution, decreased market and technological uncertainty can 
yield substantial benefits. In summary, we predict a U- 
shaped relationship for early follower hazard rates and pio- 
neer lead time (Robinson and Min 2002). We use the nat- 
ural logarithm of pioneer lead time to adjust for diminishing 
marginal returns. 

We now present the full specification of the hazard 
rate's time-varying covariate function. For ease of presenta- 
tion, deleting the i subscripts in the right-hand side of the 
expression for $[X,(&)] simplifies Equation 5's covariate 
function. 


(14)  $[X,(t,)] = Exp[Bo + B, (really new product-market) 
+ B (market pioneer) 
+ B4 (really new product-market 
x market pioneer) 





4To adjust for differences across product-markets, we normalize 
firm density by dividing it by the average number of competitors 
over the ten-year period following the end of the pioneer’s 
monopoly. 


+ B4 (market pioneer x In pioneer lead time) 
+ B; (early follower x In pioneer lead time) 
+ Bg (early follower x In pioneer lead time?) 
+ B; (In firm size) + Bg incumbent) 

+ Bs (In calendar year of market inception) 
+ Bjo (economic decline,) 


+ ON, + ONE + O3Nz 


+h, Y Industry, ], 


s-l 


where N, is the normalized firm density at time t, Nf is the 
square of the normalized firm density at time t, and Nr is the 
normalized firm density at the time of firm i's entry into the 
market. 

In Equation 14, the really new product-market variable 
equals one for a really new product-market and zero for an 
incremental innovation product-market. The market pioneer 
dummy variable equals one for the first entrant and zero for 
an early follower. In Equation 14, the derivative of the logit 
of $[X,(t)] with respect to really new product-market is 
B, + B4 (market pioneer). Thus, the estimated value of D; + 
B4 tests H, (i.e., the effect of really new versus incremen- 
tally new products on the hazard rate of a market pioneer). 
If B, + B4 is positive, pioneers have a higher exit rate for 
really new versus incrementally new products. For Hy, if By 
is positive, early followers have a higher hazard rate in 
really new versus incrementally new product-markets. 

To test H; (i.e., the effect of order of entry on survival 
risk in really new product-markets), we need to consider the 
differential impact of the order of entry (i.e., market pioneer 
versus early follower) on the hazard rate in Equation 14. 
Subtracting the derivative of o[X,(t,)] with respect to early 
follower from the derivative of o[X,(t,)] with respect to the 
market pioneer, we find that to test H4, we need to consider 
the sign and significance of ([Bo + B4 + B4 In lead time] — 
[Bs In lead time + Bg In lead time?]). If this estimate is posi- 
tive and significant, there is support for H3,, indicating that 
the survival risk for market pioneers versus early followers 
is higher for really new products. Similarly, the estimate of 
(I5 + B4 In lead time] — [Bs In lead time + Be In lead time?]) 
tests H4. If this estimate is negative and significant, there is 
support for Ha4,, indicating that pioneers experience lower 
hazard rates for incremental innovations than early 
followers. 


Results 


Table 7 displays the hazard rate model results. We based 
statistical significance on conservative two-tailed tests. In 
H,, we predicted that market pioneers have a greater sur- 
vival risk in really new product-markets. At the bottom of 
Table 7, market pioneers experience significantly greater 
survival risk for really new versus incrementally new prod- 
ucts. Specifically, the survival risk of market pioneers is 
189% ([exp(1.06) — 1] x 100%) greater for really new prod- 
ucts than for incrementally new products. Recall that there 
is almost a threefold difference in the 12-year pioneer sur- 
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TABLE 7 


Hazard Rate Analysis (n = 750) 
aaae ee ee R 


t-Statistic 


Variable Estimate 
oe“ uL -—sue 
a 2651.40 .98 
r 1.35 2.43™ 
D2 .63 2.88*** 
D3 .89 . 3.80*** 
D4 .99 3.78*** 
D5 1.00 3.39*** 
D6 1.07 3.30*** 
D7 1.15 3.25*™ 
D8 1.40 3.76*** 
D9 1.60 4.06*** 
D10 1.65 3.86*** 
D11 1.67 3.61*** 
D12+ 1.80 3.54*** 
Really new product-market (B4) .20 1.20 
Market pioneer (Bo) —1.53 —2.56*** 
Heally new product-market x market ploneer (Bs) .86 2.90*** 
Market ploneer x In pioneer lead time (Ba) —.34 —1.50 
Early follower x In pioneer lead time Bs —2 23 —2.52** 
Early follower x (In ploneer lead time)? .81 2.36** 
Ln firm size (By) —.05 —1.47 
Incumbent (Ba) —.94 —2.31™ 
Ln calendar year of market inception (Bg) 1.31 5.14*** 
Economic decline, (B40) 46 4.40*** 
Nt (04) —.44 —2.79*** 
Nê (82) 6.83 2.83'** 
Ny (04) 1.05 4.19*** 
Chemlcals (y1) —.04 —.18 
Electronics (yo) .05 29 
Machinery (Y3) .09 53 
Instruments (Y4) 21 1.22 
Log-likellhood —2081.5 
Test of Hypotheses 
H4 (B4 + Bs) 1.06 4.18*** 
Hə (B4) .20 1.20 
H3 (Bo + B3 + B4 In lead time — Bs In lead time — Bg In lead time?) 43 1.77* 
H4 (Bo + B4 In lead time — Bs In lead time — Bg In lead time2) —.44 —2.23™ 
*p< 10. 
**p « O5. 
">p < 01. 


vival rate for really new versus incrementally new products 
(23% versus 61%). This indicates that for market pioneers, 
the challenge of resolving market and technological uncer- 
tainties in really new product-markets often outweighs the 
benefits of first-mover advantages. 

In H5, we predicted a greater survival risk for early fol- 
lowers in really new product-markets. In Table 7, the esti- 
mate has the expected sign, but it is not statistically signifi- 
cant. This result is consistent with the descriptive statistics: 
12-year early follower survival rates are 3896 for really new 
products versus 39% for incrementally new products. These 
results indicate that the typical 2- to 3-year delay in entering 
a really new product-market helps an early follower resolve 
key market and technological uncertainties. 

For really new products, in H3,, we predicted that mar- 
ket pioneers have a greater survival risk than early follow- 
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ers. At the bottom of Table 7, the difference is tested at the 
average pioneer lead time level of 1.98 years for the really 
new product sample. It is statistically significant at the 10% 
level. This difference is also managerially significant with a 
54% ([exp(.43) — 1] x 100%) greater survival risk than early 
followers. This indicates that in really new product-markets, 
the challenge of managing the market and technological 
uncertainties often outweigh the benefits of first-mover 
advantages. 

In contrast, Hy, reveals that for incremental innovations, 
market pioneers have a substantially higher chance of sur- 
vival than early followers. These market pioneers experi- 
ence a 36% (= [exp(—.44) —1] x 100%) lower survival risk 
than early followers, which is significant at the 5% level. 
This supports the prediction that for incrementally new 
products, market pioneers have a greater chance of survival. 


Turning now to the coefficient estimates of the control 
variables, we observe that they all have the expected signs. 
The impact of lead time on the pioneer survival chance has 
the expected negative sign and is statistically significant at 
the 13% level. This indicates that increasing pioneering lead 
time reduces a market pioneer’s survival risk. 

The results support a U-shaped relationship between an 
early follower’s hazard rate and a pioneer’s lead time. Thus, 
a short delay appears to help an early follower, but an addi- 
tional delay increases an early follower’s survival risk. 
Incumbency significantly decreases survival risk. Further- 
more, the calendar year of market inception has the 
expected positive sign and 1s statistically significant. This 
indicates increasing survival risk in markets started in 
recent years. There is also a significantly greater survival 
risk in years of economic decline. 

The estimates of 0; and 05 show the impact of firm den- 
sity on a firm’s hazard rate. As expected, the estimates for 
0, and 0, are negative and positive, respectively. Both esti- 
mates are statistically significant at the 196 level. This U- 
shaped relationship indicates that an initial increase 1n the 
number of firms increases its legitimation, thus lowering 
firm hazard rates. At higher levels of firm density, legitima- 
tion reaches a saturation level, and the competitive effect 
dominates with an associated adverse impact on firm sur- 
vival rates. 

As predicted, the estimate of 03 is negative and statisti- 
cally significant at the 1% level. This indicates that an 
increase in firm density at the time of market entry elevates 
a firm’s hazard rate. Population ecologists attribute this 
result to the liability of resource scarcity and to tight niche 
packing, which makes it difficult for an entrant to find prof- 
itable positioning opportunities. 


Robustness of Results 


Because theory does not provide clear guidance on the 
functional relationship between the degree of product inno- 
vation newness and survival, we consider two additional 
functional forms. The first maintains the dichotomy we 
used previously but considers a product really new only 
when every expert classifies it as really new. This differs 
from our measure, which is based on the majority of expert 
opinion. When every expert classifies a product as really 
new, our sample has 30 “really” really new products. 

The second functional form uses a continuous measure 
of product innovation newness. This follows the research of 
Chandy and Tellis (1998, 2000), whose measure of radical 
product innovations is continuous. Our continuous measure 
equals the difference between the expert’s rating and the 
expert’s average benchmark rating. We average these differ- 
ences across the experts who evaluated each of the 264 new 
markets. 

Table 8 provides the hypothesis-testing results for these 
alternative functional forms. Model 1 tests the dichotomy 
between “really” really new products and incremental inno- 
vations. (We exclude the 36 really new product-markets that 
do not have uniform expert support from the data analysis.) 
Model 2 uses the continuous measure of new product 
newness. 


Overall, the hypothesis-testing results are consistent 
with all three measures of product innovation newness. This 
is because the results consistently indicate that market pio- 
neers have higher hazard rates in markets that were pio- 
neered with a really new product but have lower hazard 
rates in markets that were pioneered with an incremental 
innovation. In addition, there is no material difference 
between early follower hazard rates in really new versus 
incremental innovation markets. 


Dynamic Survival Risk Insights 


Although our hazard rate model yields important survival 
tisk insights, it does not pinpoint how survival risk changes 
over time. We develop two types of dynamic insights. The 
empirical hazard rate equals the number of firms that failed 
in year t divided by the total number of firms that were at 
risk in year (t — 1). Because empirical hazard rates do not 
partial out the impact of additional explanatory variables or 
correct for unobserved heterogeneity, we also provide pre- 
dicted hazard rates. 

The predicted hazard rates are based on four hazard rate 
models. We estimate the four models for market pioneers 
(n = 66) and early followers (n = 146) in really new 
product-markets and for market pioneers (n — 198) and 
early followers (n = 340) in incrementally new markets. The 
full model specification includes the additional explanatory 
variables in Table 7. Because the incremental innovations. 
sample is larger, we used annual dummy variables for the 
first 12 years of commercialization versus 10 years for the 
smaller really new product-markets sample. 

For really new product-markets, Panels A and H in Fig- 
ure 1 display the empirical and predicted hazard rate plots, 
respectively. These figures illustrate how hazard rates for 
pioneers and early followers are initially similar but then 
diverge. The divergence arises after the sixth year of com- 
mercialization. Note that the tenth year includes year ten 
plus all subsequent years. This pattern can arise when it 
takes a few years for the market pioneer to learn whether its 
initial investments are on target. During the wait, the market 
pioneer will fight to survive, but after it is clear that its ini- 
tial investments are off target and cannot be salvaged, it is 
time to cut its losses and move on to better opportunities. 

For incrementally new product-markets, Panels À and B 
in Figure 2 show that market pioneers have the greatest sur- 
vival advantages in the first six to eight years of commer- 
cialization. This can arise when market pioneers benefit 
from their temporary monopoly in the first two or three 
years of commercialization. In addition, because first- 
mover advantages often dissipate over time (Robinson 
1988), it is not surprising that hazard rate differences nar- 
row with the passing of time. Even so, market pioneers have 
consistently lower hazard rates than early followers. 


Discussion 


When a market pioneer starts a new market with a really 
new product, survival is a major challenge. We base this 
conclusion on 66 really new product-markets, in which only 
23% of the market pioneers survived for at least 12 years. In 
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TABLE 8 


Robustness of Results 
Sa ee ee 


Model 2 (n = 750) 


Variable 


a 

r 
D2 
D3 
D4 
D5 
D6 
D7 
D8 
D9 
D10 
D11 
D12+ 


Innovativeness 

"Really" really new product-market 
Market ploneer 

Innovativeness x market pioneer , 
"Really" really new product-market x market pioneer 
Market ploneer x In pioneer lead time 
Early follower x In ploneer lead time 
Early follower x (In pioneer lead time)? 
Ln firm size 

Incumbent 

Ln calendar year of market inception 
Economic decline, 

Nt 

N2 

Nr 

Chemlcals 

Electronics 

Machinery 

Instruments 


Log-lIkelihood 


Model 1 (n = 645) 


3337.21  (.79) 1908.30 (1.10) 
1.22 (2.07)"* 1.49 (2.46)** 
64 (2.67) 61 (2.83) 
91 (3.41) 86 (3.76)*** 
1.43 (3.73)"** 95 (3.78)** 
1.07 (3.05)** 95 (3.39)'* 
1.15 (2.96)** 1.02 (3.31) 
1.40 (3.28)*** 1.09 (3.28)*** 
1.37 (2.93) 1.34 (8.85) 
1.68 (3.45)"* 1.58 (4.17)*** 
177 (3.36)"* 1.58 (3.96)** 
1.87  (3.30)** 1.60 (3.70)*** 
2.00 (8.25)'* 1.71 (3.67)*** 

02 (53) 
26 (1.18) 
—153 (-2.32)** —1.01 (—1.76)* 
30 (3.98)** 
1.10 (2.57)** 
—41 (-1.56) —26 (-1.18) 
—2.28 (-2.31)** -2.41 (-2.79)* 
81 (2.15)* 86 (2.58)'* 
—04 (-1.14) ~.03 (—93) 
~.30 (—1.81)* —34 (~2.43)** 
1.39 (4.45) 1.26 (5.386) 
45 (3.93)"* 46 (4.40)*** 
—55 (-2.89)*** —42 (-2.95)*** 
7.91 (2.54)** 6.85 (3.14)*** 
1.18 (4.16)** 1.04 (4.16)*** 
36 (1.17) —01 (—.06) 
01 — (.04) 05  (.29) 
07 (37) 310 (65) 
34 (1.62) 22 (1.32) 
—1780.9 —2077.6 
1.86 (8.58)** 32 (498)"* 
26 (1.18) 02  (.53) 
64 (1.73) 52 (2.40)** 
—47 (-2.04)* —43 (74.61)** 


Notes: To test H4 and H4 for Model 1 specrhication, we evaluated the derivative difference between market ploneers' and early followers' hazard 
rates with the average values of "innovativeness" from 66 really new product-markets (=.68) and 198 incrementally new product-markets 


(22.53), respectively. 


contrast, survival is much easier when the pioneer starts a 
new market with an incremental innovation. We base this 
conclusion on 198 incrementally new product-markets, in 
which 61% of the market pioneers survived for at least 12 
years. 

These results indicate that market pioneers must over- 
come greater resistance in building primary demand for 
really new products. Conversely, the market pioneer's tem- 
porary monopoly can help the pioneer strengthen its first- 
mover advantages for incrementally new products that rep- 
resent extensions of existing technologies and, in general, 
do not require consumer learning. 

In really new product-markets, is this high degree of 
uncertainty also fatal for early followers? In our sample, 
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most of the early followers trailed the market pioneer by 
three years or less. It is surprising that even with this rela- 
tively short delay, there is not a meaningful survival rate 
difference across realy new and incrementally new 
product-markets. For example, 12-year early follower sur- 
vival rates in really new product-markets are 3896 versus 
39% in incrementally new product-markets. This suggests 
that in really new product-markets, early followers can 
benefit from the prior efforts of the pioneer in building pri- 
mary demand. They can also benefit from the pioneer's 
experience in addressing customer needs and concerns, 
which can become public knowledge through professional 
and trade journals (Gort and Klepper 1982). Finally, early 
followers are more likely to leapfrog the pioneer in really 


FIGURE 1 
Hazard Rates In Really New Product-Markets 
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Notes: We use three-year moving averages except for the first year. 


For predicted hazard rates, we assume that all the firms do 
not face economic decline: The tenth year hazard rate in the 
predicted hazard rate function Indicates the hazard rate in 
the tenth year and thereafter. 


new product-markets that often undergo significant product 
improvements during the emergent stages (Christensen, 
Suarez, and Utterback 1998). 

In addition to highlighting the survival risks of pioneer- 
ing really new product-markets, these empirical results help 
resolve an ongoing controversy in the order-of-market-entry 
literature. The controversy is whether the first to market is 
the first to fail. In really new product-markets, the hazard 


FIGURE 2 
Hazard Rates In Incremental Innovation Product- 
Markets 
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rate modeling plots show that greater market pioneer sur- 
vival risks tend to surface after six or more years in the mar- 
ket. This can arise when it takes a few years for a market 
pioneer to determine whether its initial investments, which 
were made in the face of great un 

For incrementally new product- ns 
modeling plots show that market pioneers have Yow r sur- 

Iit I )5 Pe 


uo 
Market Ploneer and Early FO ewer Sun Sut val Risks | 29 


es 


4 M = n P 







vival risks than early followers. Although these survival risk 
differences narrow over time, market pioneers have consis- 
tently lower survival risk. This can be explained by pio- 
neers’ initial monopoly, which increases survival rates in 
the first two to three years of commercialization. Further- 
more, pioneer first-mover advantages are often strongest in 
the early years of a market’s evolution, and these advan- 
tages help protect the pioneer from outright failure. 

In conclusion, market pioneers are often the first to fail 
in really new product-markets. However, this is not true in 
incrementally new markets, in which market pioneers have 
consistently lower survival risks than early followers. 


Research Limitations and Further Research 


We now discuss four research limitations and the associated 
future research topics. First, our sample is composed only 
of industrial goods markets. Different results could surface 
in different types of markets. For example, incremental 
innovations in high-technology markets may not generate 
meaningful pioneer survival benefits relative to early fol- 
lowers. This is because a pioneer’s survival can be threat- 
ened by a higher degree of technological uncertainty. 

Second, our sample contains only 66 really new 
product-markets. Although this sample size compares favor- 
ably with other survival studies, a larger sample size would 
yield more precise insights into market pioneer survival 
risks and how these survival risks change over time. 

Third, our product innovation newness measure is 
applied decades after the new markets started. Major invest- 
ments in new products arise before product launch, perhaps 
even at the prototype development stage. Thus, during the 
product development process, managers need a different 
type of measure to distinguish really new from incremen- 
tally new products. 

Fourth, optimal entry timing requires greater insights 
into fast followers versus slow followers. For example, how 
long should an early follower wait before attempting to 
enter a new market? Although the Thomas Register data can 
address this research topic, it will require an expanded sam- 
ple size and an objective way to distinguish fast followers 
from slow followers. 


Managerial Implications 


Pioneering a new market is often considered a high-risk 
investment with high potential returns. If this is indeed the 
case, we expect that the highest returns will surface in the 
riskiest markets. However, at least from the standpoint of 
survival, this is not the case. This is because market pio- 
neers have the lowest survival rates in really new product- 
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markets, which carry the highest risk. Given this pattern of 
results, why would a firm attempt to pioneer a really new 
product-market? 

A key reason is that if lower survival rates are compen- 
sated with higher profits, pioneers might still be better off.5 
Higher profits can arise because really new product-markets 
tend to be larger with higher profit margins (Mansfield and 
Wagner 1975). This indicates that first-mover advantages 
can be especially strong for the few pioneering survivors. 
For example, according to the Thomas Register data, in 
1981, Hewlett-Packard, DatagraphiX, and IBM were the 
first laser printer entrants. Hewlett-Packard is a dominant 
survivor, and DatagraphiX exited in 1993. Thus, for really 
new products, though the risks of pioneering are high, so 
are the payoffs for the pioneers that do survive. 

Lower survival risks for early followers in really new 
product-markets suggest that early following can also be a 
profitable strategy. This is because the early followers not 
only face a lower survival risk than market pioneers but also 
can benefit from the market's larger size and higher profit 
margins. These results are consistent with the work of 
Olleros (1986, p. 11), who contends that breakthrough inno- 
vations "are bound to be rather crude, costly, and unreliable 
when they first appear in the market" This provides an 
opportunity for early followers to leapfrog pioneers in 
really new product-markets. 

The risk-and-return profile in incremental innovation 
markets is more encouraging for market pioneers. Incre- 
mental innovation markets tend to have relatively low risk 
and higher pioneer survival rates. Incremental innovation 
markets are also likely to yield sustainable market share 
advantages for the pioneer. This is because research on sur- 
viving entrants points to pioneer market share rewards (see, 
e.g., Kalyanaram, Robinson, and Urban's [1995] survey and 
VanderWerf and Mahon's [1997] meta-analysis). With the 
majority of new markets being pioneered by incremental 
innovations, it is likely that these market share reward stud- 
ies are dominated by incremental innovation markets. If so, 
pioneers of incremental innovation markets should benefit 
from higher survival rates and sustainable inarket share 
advantages. 





5Although profit data would yield valuable research insights, 
they are not provided in the Thomas Register. Profit data are avail- 
able in PIMS, but the business unit and product category are both 
disguised. With disguised product categories, really new and 
incrementally new product-markets cannot be identified. Thus, we 
do not know of any profit data that can be linked to really new ver- 
sus incrementally new product-markets. 


APPENDIX 
List of Product-Markets 





Really New Products (Year of Market Inception, Degree of Innovativeness) 


Chemical Products 
Bromide lead (1974, .1), carbon monoxide gas (1962, —.3), cellulose propionate (1963, .1), dielectric gas (1976, .1), gelling 
agents (1966, 1.3), indium compounds (1980, —.1), lanthanides (1985, .2), lithium acetate (1975, —.2), lithium niobate 
(1985, .5), magnesium perchlorate (1971, .0), methyl laurate (1964, —.1), microblocides (1982, .4), and nickel carbonyl 
(1973, —0.4) 


Materlals and Fabricated Metal 
Free-machining steel (1971, 1.3), parylene (1987, .6), polycarbonate film (1981, 1.2), polyimide fllm (1981, 1.7), shape 
memory alloys (1988, 1.6), solar energy reflector sheets (1982, 1.6), and solar energy storage rods (1982, 2.0) 


Machinery and Computer Equipments 
Blood gas monitors (1989, .6), clean room hoods (1967, .3), computer terminals (1971, 1.8), data display terminals (1971, 
1.4), digital comparators (1967, .6), graphite machinery (1986, —.2), handheld scanners (1990, .6), positioning machinery 
(1964, .6), protocol translators (1986, 1.4), sealing robots (1986, 1.1), steam turbogenerators (1982, .6), transaction 
processing systems (1987, 1.9), and ultrasonic drilling equipment (1971, 1.4) 


Electronic and Electrical Equipment 
Electroluminescent lighting (1982, .0), electron beam guns (1965, .7), lon beam guns (1965, 1.0), logic systems (1967, 1.9), 
memory cards (1991, .6), phase detectors (1970, .7), photoelectric timers (1972, 1.1), sillcon detectors (1973, 1.0), silicon 
rectifier converters (1968, .1), videodisc recorders (1985, 1.6), videotape recorders (1973, 1.6), and voice activated 
microphones (1992, 1.6) 

Measuring, Analyzing, and Controlling Equipments 
Blood pressure monitors (1986, —.3), blood pressure transducers (1988, .6), bomb detectors (1967, .0), bubble presence 
detectors (1962, — 1), computer screen projectors (1986, .6), ultraviolet detectors (1971, 1.1), ellipsometers (1987, .4), fiber 
optic attenuators (1985, .1), fiber optic controls (1986, .6), frequency translators (1986, .6), germanium lenses (1963, .5), 
microfilm reader/printers (1971, .2), microscopy cameras (1995, 1.1), navigating systems (1967, 1.0), osmometers (1964, 
.4), video microscopes (1986, .0), and videotape motion analysis systems (1983, 1.2) 


Other Industries 
Bar code computer software (1987, 1.0), EMC testing computer software (1995, .6), emulation computer software (1990, 
.6), and travel computer software (1995, .6) 





incrementally New Products (Year of Market Inception, Degree of Innovativeness) 


Chemical Products 
Chrome dyes (1973, —1.7), gallium oxide (1987, —1.7), laser printing Inks (1987, .1), lead silico fluoride (1976, —2.2), lead 
titanate (1964, —2.8), magneslum zirconate (1986, —2.8), manganese benzoate (1979, —1.3), manganous suffate (1964, 
—3.4), methyl mercaptan (1973, —3.2), molybdenum disillclde (1979, —2 2), nickel iodide (1975, —3.3), and nickel sulfamate 
(1969, —3.2) 


Materlals and Fabricated Metal 
Access, animal doors (1972, —3.7), adjustable nipples (1972, —2.4), air duct joints (1962, —3 3), alr logic valves (1986, —.3), 
asphalt concrete (1963, —2.3), bearing pads (1987, —3.4), boiler chimneys (1976, —3.7), carbide seals (1971, —1.4), 
chemical resistant seals (1982, —1.4), chromatography packing (1987, —3.4), conveyor belt lugs (1970, —1.3), diamond discs 
(1977, —1.0), disc dispensers (1979, —2.7), drain orifices (1975, —1.0), fiberglass reinforced plastic windows (1986, —2.4), 
fiberglass washers (1971, —1.4), frostproof valves (1967, —2.4), ground, wlre, plastic mouldings (1964, —2.3), high-pressure 
seals (1982, —1.4), hopper outlets (1964, —1.0), kiln vents (1986, —4.4), laminating rollers (1971, —1.4), laser mirrors (1966, 
—1.7), low-temperature belting (1966, —2.4), magnetlc ceramics (1967, —.4), master mirrors (1964, —2.4), micromlniature 
tubes (1987, —1.4), molten aluminum (1966, —2 0), mud, drilling lubricants (1981, —2.0), newsstands (1986, —3 7), nlobium 
foil (1969, —.3), noise mufflers (1973, —2.7), passivating equipment (1986, —1.4), plastic netting (1966, —2.3), plastic sheaths 
(1987, —2.4), plunger tips (1986, —3.4), prefabricated chlmneys (1976, —3.0), refractory foil (1967, —1), reprocessed oils 
(1981, —2.0), sapphire orifices (1972, —1.7), shock-absorbing dampers (1962, —2.0), silica curtains (1977, —.7), silo liners 
(1966, —.7), soil shifting screens (1964, —3.4), solar collector drain-down valves (1982, —3.4), stainless steel bumpers (1987, 
—2.4), steel foil (1967, —1.0), tantalum plates (1986, —1.4), transistor insulators (1962, —3), urethane gaskets (1971, -.7), 
vise grips (1972, —3.0), and zirconium copper (1962, —2.3) 


Machinery and Computer Equipments 
Adjustable gears (1968, —2.2), air movers (1975, —3.0), aluminum rollers (1987, —3.9), boat coolers (1976, —3.7), boat 
loaders (1973, —3.5), bottle sorters (1971, —2 9), bullet molds (1975, —3 4), canners coolers (1962, —3.3), carbide rings 
(1971, —3.4), chamfering heads (1973, —3.2), commutator turning lathes (1980, —2.2), conveyor belt guides (1970, —2.9), 
copy-turning machinery (1970, —2.0), cosmetic molds (1964, —2.7), cross-drilling (1965, —3.9), cutterhead dredges (1968, 
—2.7), dlamond-machining machinery (1975, —2.2), egg carton machinery (1964, —3.3), flash-drying mills (1973, —3.8), fluid 
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separators (1971, —2.9), fogging machinery (1975, —3.4), forced draft, fan mufflers (1966, —2.8), friction plates (1987, —3.4), 
furniture molds (1964, —3.4), gas liqueflers (1973, —2.0), glass crushers (1972, —3.4), Ice-dicing machinery (1976, —3.7), jig 
grinders (1973, —3.7), lateral collectors and distributors (1981, —3.9), log chippers (1972, —3.8), lowering devices (1972, 
—4.0), magnetic tape erasers (1962, —2.3), mask washers (1975, —3.9), mixing mills (1973, —3.9), molding, shoe counter 
machinery (1964, —3.5), motor assembly machinery (1975, —2.4), mud jacking (1973, —2.8), oil-fittering and transfer 
equipment (1982, —3.4), oil recovery systems (1975, —2.5), opening machinery (1973, —3.7), package sorters (1964, —2.4), 
pellet coolers (1962, —3.9), pickle-making machinery (1986, —4.0), poultry chillers (1962, —4.2), pug, agricultural chemicals 
mills (1972, —3.2), radio frequency identification scanners (1989, —2.4), roll-forming machinery (1970, —3.2), seaport tractors 
(1982, —2.9), shot sleeves (1986, —4.4), slgn rotators (1967, —2.4), snow-meiting mats (1966, —3.8), solld/liquid mixers 
(1966, —3.8), spherical grinding (1962, —2.2), steel belt coolers (1962, —2.7), thermal liquid coolers (1982, —3.2), 
thermoforming machinery (1970, —1.4), torque multipllers (1979, —1.9), tray loaders (1975, —4.0), tubing pumps (1982, 
—2.4), welding nozzles (1987, —2.6), wet-grinding mills (1964, —3.0), and wood-chipping machinery (1968, —2.5) 


Electronic and Electrical Equipment 

Bar code lamps (1985, —.1), boat lights (1987, —3.6), circuit board retainers (1987, —1 .4), D C. micro motors (1987, —1.0), 
explosion detectors (1964, —1.7), explosion-proof lamps (1969, —4.1), flber optic lamps (1991, —2.4), fiber optic links (1993, 
—2.4), humidity transmitters (1986, —.4), Insect control systems (1971, —2.7), magnifyIng lamps (1964, —1.4), pllferage 
detectors (1964, —1.9), railroad car lights (1987, —4.1), relay-switching matrices (1973, —3.7), RF communication systems 
(1992, —3.4), robot guarding systems (1987, —1.6), solid state contactors (1967, —2.1), synchronous generators (1972, 
—2.3), thermal cutoffs (1975, —2.0), V-band communication systems (1990, —3.4), video modems (1992, —3.4), video 
modulators (1992, —2.4), VSAT communication systems (1993, —3.4), and xenon lamps (1966, —1.4) 


Measuring, Analyzing, and Controlling Equipments 

Absorptometers (1966, —1.8), water analyzers (1966, —2.1), aperture cards (1962, —2.6), backer rods (1986, —1.4), blood- 
processing equipment (1968, —.4), diesel fuel meters (1972, —3.5), distance-measuring instruments (1972, —2.1), electric 
sensors (1964, —1.6), hour meters (1967, —3.0), instant photo identification systems (1973, —.9), liquid medical waste 
absorbents (1993, —3.4), medical electronic filters (1992, —4.4), medical equipment hydraulic power untts (1988, —3.4), 
medical waste disinfection systems (1992, —3.4), microscope objectives (1987, —1 .6), multitesters (1985, —1.7), optical 
sensors (1964, —.9), photographic transparency illuminators (1987, —1.0), pneumatic to electric converters (1987, —.9), 
pressure compensators (1962, —2.5), protective masks (1979, —3.5), radio annunclators (1989, —2.4), roundness testers 
(1972, —3.4), scanning tunneling microscopes (1992, -2.4), static electricity detectors (1973, -2.5), and web deflection 
Indicators (1985, —1.1) 


Other Industries 


Air-operated doors (1975, —2.5), battery pads (1986, —3.4), coin-operated lockers (1968, —3.2), credit card Imprinters (1962, 
—2.2), disposable mops (1975, —2.4), doorway sealers (1966, —5.2), doorway-sealing equipment (1964, —5.2), hardboard 
letters and figures (1975, —5.5), lactose (1985, —4.2), letter shop (1977, —5.2), mitre fold construction (1973, —3.5), mobile 
comfort stations (1973, —2.5), numerical control tapes (1971, —2.4), overall trimmings (1986, —3.4), release film (1981, 
—3.2), steel tire cord (1975, —4.8), synthetic fibre cordage (1966, —3.8), transfer tapes (1964, —2.4), web strap trimmings 
(1986, —3.4), flatbed vehicles (1986, —3.6), skin markers (1966, —4.8), and sign kits (1968, —5.2) 


Notes: The degree of Innovativeness equals the average expert rating minus the benchmark rating for really new products that are well known. 
Although most values are positive for really new products and negative for incrementally new products, there are a few exceptions, 
which arise because the classification Is based on the majority of expert opinion and not on the average expert rating. 
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Brands as Signals: A Cross-Country 
Validation Study 


This article tests how well the information economics view of brand equity explains consumer brand choice in 
countries that represent different cultural dimensions. In this empirical analysis, the authors use survey and exper- 
imental data on orange juice and personal computers collected from respondents in Brazil, Germany, India, Japan, 
Spain, Turkey, and the United States. The results provide strong empirical evidence across countries for the role of 
brands as signals of product positions. In addition, the positive effect of brand credibility on choice is greater for 
consumers who rate high on either collectivism or uncertainty avoidance. Credible brands provide more value to 
collectivist consumers because such consumers perceive these brands as being of higher quality (i.e., reinforcing 
group identity). Credible brands provide more value to high-uncertainty-avoidance consumers because such 
brands have lower perceived risk and information costs. 





bol, or design, or a combination of them, [that] is 
intended to identify the goods and services of one 
seller or a group of sellers and to differentiate them from 
those of competitors” (Kotler 1997, p. 443)—can poten- 
tially play many roles in consumer decision making and 
choice behavior. Underlying many of the brand effects is 
consumer uncertainty about product attributes and/or bene- 
fits. This uncertainty arises from the imperfect and asym- 
metric information state that characterizes most product 
markets (ie., firms are more informed about their own 
products than are consumers). 

The many roles that brands play in consumer decision 
making can materialize through multiple mechanisms, such 
as psychological (e.g., associative network memory), socio- 
logical (e.g., brand communities), and economic (e.g., 
brands as signals under uncertainty) processes (Keller 
2002). Prior work on brands as signals under uncertainty 
(e.g., Erdem and Swait 1998; Wernerfelt 1988) focuses on 
the influence of imperfect and asymmetric information on 
consumer choice processes. Specifically, Erdem and Swait 
(1998) suggest that the clarity and credibility of brands as 
signals of product positions increase perceived quality, 
decrease consumer perceived risk and information costs, 
and thus increase consumer expected utility. They also dis- 


fA ssim to be "a name, term, sign, sym- 
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cuss the antecedents to credibility and clarity. In this frame- 
Work, the key characteristic of a brand signal is its credibil- 
ity. "Brand credibility" (the credibility of a brand as a sig- 
nal) is defined as the believability of the product position 
information contained in a brand, which depends on the 
willingness and ability of firms to deliver what they 
promise. 

Most previous work using information economics 
frameworks to explain brand credibility and brand equity 
focuses only on the United States. The validity of existing 
frameworks in predicting consumer behavior in regard to 
brands as signals and outcomes outside the United States is 
not clear. With the growing trend in the globalization of 
marketing activities and the importance of brands in accom- 
plishing many of these activities, there is a pressing need to 
address explicitly whether existing frameworks are compat- 
ible with consumer behavior outside the United States and, 
if not, to delimit their scope of application. Despite this 
practical impetus and growing theoretical interest, relatively 
little work examines the use of signals or extrinsic cues to 
judge quality across countries and/or cultures. 

The purpose of this article is twofold. First, we test how 
well the information economics view of brand equity 
explains consumer brand choice and brand equity formation 
in different countries. Specifically, we assess the applicabil- 
ity of Erdem and Swait's (1998) framework across coun- 
tries representing different cultural dimensions.! Second, 
we highlight some differences across countries. More 
specifically, we explore differences in the way brands as 
signals operate across countries and link any such differ- 
ences to consumers' cultural orientation. We believe that 
three of Hofstede's (1980) cultural dimensions (collec- 
tivism/individualism, uncertainty avoidance, and power dis- 





‘Note that Erdem and Swait (1998) collected survey data on 
consumer self-reports of perceptions and purchase intentions and 
estimated a structural equation model (SEM) based on these 
survey data to assess the impact of brands as signals of expected 
utility, omitting price effects. In this article, using surveys, we col- 
lected consumers’ self-reports of their perceptions and experimen- 
tal data about their choices, and we estimated SEMs and brand 
choice models; for the latter, we also 1ncorporated price effects. 
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tance) may particularly affect the way consumers use and 
think about brands as signals. 

We conducted our analysis using survey and experimen- 
tal data on orange juice and personal computers (PCs). We 
collected data on consumer brand perceptions, choices, and 
cultural dimensions from respondents in Brazil, Germany, 
India, Japan, Spain, Turkey, and the United States. We 
chose these countries to represent a wide range of Hof- 
stede’s (1980) cultural dimensions. All respondents were 
undergraduates in business schools (except for about half of 
the Brazilian sample, who were undergraduate engineering 
students) and participated in exchange for course credit. 
The results we obtained provide strong empirical evidence 
for brands as signals of product positions in general and for 
Frdem and Swait’s (1998) framework in particular across 
countries that vary along the previously mentioned cultural 
dimensions. However, some interesting differences emerge 
in terms of both total credibility effects on choice and the 
mechanisms through which brand credibility effects on util- 
ity operate. 

We organize the rest of the article as follows: In the next 
section, we review the relevant literature and discuss the 
conceptual framework. We then develop several hypotheses 
to be tested and describe the empirical approach and the 
data collection. The last section outlines the implications of 
our findings and discusses avenues for further research. 


Literature Review and Conceptual 
Framework 


Brands Under Uncertainty 


The effect of uncertainty about brand attributes (in more 
general terms, a brand’s position in the attribute space) on 
consumer choice has been the focal point of economics 
approaches to brand effects. Uncertainty affects consumers’ 
perceptions of brand attributes, the variance of their 
attribute beliefs, and their information costs. The literature 
in both economics and marketing has focused on quality 
uncertainty, which is conceptualized and measured as a 
multidimensional construct (as a summary statistic of a 
brand’s position in the space for attributes that are imper- 
fectly observable). Aaker (1991) suggests that, all else 
being equal, strong brands are associated with higher per- 
ceived quality, which refers to mean beliefs about quality. 
Uncertainty about quality also implies that there is variance 
with consumer beliefs about quality for a given consumer 
(if consumers knew quality exactly or believed that they 
knew it exactly, the variance would be zero). This creates 
consumer perceived risk, and consumers tend to be risk 
averse in most contexts. Risk aversion affects consumers’ 
decision making in various ways (e.g., Rao and Bergen 
1992; Shimp and Bearden 1982). Risk-averse consumers 
feel threatened by ambiguous and uncertain product assess- 
ments. In this context, the literature suggests that brands 
can reduce perceived risk by becoming credible and consis- 
tent symbols of product quality (Erdem and Swait 1998; 
Montgomery and Wernerfelt 1992). 

When there is quality uncertainty, especially in the pres- 
ence of risk aversion, consumers tend to search for more 
information about product quality before making a decision 


(Money, Gilly, and Graham 1998; Shimp and Bearden 
1982).2 However, such information may often not be avail- 
able or may not be credible if it is available. If it is available 
and credible, consumers may not be willing to bear large 
information costs, because utility decreases with increasing 
information costs. Consequently, consumers may resort to 
extrinsic cues, such as price, advertising, or channel choice, 
to infer product quality (Zeithaml 1988). For example, brand 
as a cue of quality could be particularly useful in countries 
(e.g., India) in which the quality of the products may vary 
widely in a given product category (Maxwell 2001). 

The signaling literature in economics focuses on which 
conditions cause these cues to become credible signals (i.e., 
manipulatable attributes or activities of economic agents to 
convey information about their characteristics; see, e.g., 
Spence 1974). A firm can use various marketing-mix ele- 
ments to signal product quality (e.g., charging a high price, 
offering a certain warranty, distributing through certain 
channels). Each of these actions may or may not be credible 
depending on market conditions, including competitive 
conditions and consumer behavior. However, what sets 
brands apart from the individual marketing-mix elements as 
credible signals is that the former embody the cumulative 
effect of prior marketing-mix strategies and activities. The 
historical notion that credibility is based on the sum of prior 
behaviors has been referred to as "reputation" in the infor- 
mation economics literature (see Herbig and Milewicz 
1995). 

Credibility is broadly defined as the believability of an 
entity's intentions at a particular time. Credibility 1s posited 
to have two main components: expertise and trustworthi- 
ness (Rao and Ruekkert 1994; Wernerfelt 1988). Thus, 
brand credibility is the believability of the product informa- 
tion contained in a brand, which requires that consumers 
perceive the brand as having the ability (1.e., expertise) and 
willingness (1.e., trustworthiness) to deliver continuously 
what has been promised (brands can function as signals 
because if and when they do not deliver what is promised, 
their brand equity will erode). All else being equal, the 
credibility of a brand has been shown to be greater for 
brands with greater marketing-mix consistency over time 
and greater brand investments (Erdem and Swait 1998). 
Consistency refers to the degree of harmony and conver- 
gence among the marketing-mix elements and the stability 
of marketing-mix strategies and attribute levels over time. 
Brand investments are resources that firms spend on brands 
to (1) assure consumers that brand promises will be kept 
and (2) demonstrate long-term commitment to brands 
(Klein and Leffler 1981). Furthermore, it has also been 
shown that the clarity (i.e., lack of ambiguity) of the prod- 
uct information contained in a brand is an antecedent to 
brand credibility (Erdem and Swait 1998). 

As Aaker (1991) also suggests, higher perceived (or 
expected) quality, lower information costs, and lower risks 
associated with credible brands can increase consumer 


2Specifically, in cross-cultural contexts, consumers in cultures 
that rate high in collectivism and uncertainty avoidance have been 
shown to use personal information sources more often than those 
that rate low on these two cultural dimensions (Dawar, Parker, and 
Price 1996). 
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evaluations of brands. Using structural equation models 
(SEMs), Erdem and Swait (1998) show that expected utility 
increases with perceived quality and decreases with per- 
ceived risk and information costs; in turn, these are ante- 
ceded by brand credibility. 


Brand Effects Across Countries and Cultural 
Dimensions 


Few previous studies examine the use of signals or extrinsic 
cues to judge quality across countries and/or “cultures. 
Dawar and Parker (1994) find that there are few differences 
in the use of price to signify quality for a highly homoge- 
neous segment of consumers across both Western countries 
and Japan. McGowan and Sternquist (1998) compare 
Japanese and U.S. consumers in terms of their price-quality 
schema, prestige sensitivity, and value consciousness. The 
results indicate that price-inference behaviors may be mar- 
ket universal. However, these studies focus on comparing 
consumer price-quality perceptions across developed mar- 
kets. In inefficient markets, such as those of less-developed 
countries, consumers may believe in the price-quality rela- 
tionship to a lesser extent because price information is less 
credible (Zhou, Su, and Bao 2002). As a result, consumers 
in such markets may need to rely more on certain other sig- 
nals to infer quality because the usual product information 
is less available or less reliable. 

Prior research has also attempted to link brand choice to 
several cultural dimensions (e.g., Cray and Mallory 1998; 
Trompenaars and Hampden-Turner 1997; Voich 1995). Par- 
ticularly, Hofstede's (1980, 1984) influential work on cross- 
cultural value systems identifies three aspects of cultures 
that can be related to brand choice: collectivism/individual- 
ism, uncertainty avoidance, and power distance.? Although 
Hofstede conducted his research in organizational settings, 
the values he identifies have often been associated with con- 
sumer behavior, such as consumer tipping behavior (Lynn, 
Zinkhan, and Harris 1993), country-of-origin effects on 
product evaluations (Gurhan-Canli and Maheswaran 2000), 
value-attitude relationships (Gregory, Munch, and Peterson 
2002), persuasion effects (Aaker and Maheswaran 1997), 
consumer innovativeness (Steenkamp, Hofstede, and Wedel 
1999), and behavioral intention models (Lee and Green 
1991). 

The marketing literature has been particularly keen in 
examining the cultural dimensions of collectivism and indi- 
vidualism. The collectivism/individualism dimension 
relates brand usage to social motivations (Hofstede 1980). 
Cultures that are high in individualism tend to seek variety 
and hedonistic experiences, whereas cultures that are high 
in collectivism correlate more with conformity and group 
behavior. Roth (1995) finds support for the hypotheses that 


3Hofstede had a fourth dimension, masculinity, and later added 
time orientation as a fifth dimension. We chose not to focus on 
masculinity, because the scale he developed combines two sepa- 
rate (though related) components: (1) attitudes toward gender roles 
and (2) attitudes toward quality of life. No straightforward expec- 
tations can be generated about how these might affect brand credi- 
bility effects. We pretested the impact of time orientation. How- 
ever, there was not enough variation across countries on this 
dimension to study it further. 
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collectivist cultures consider brands that reinforce group 
membership and affiliation more attractive, whereas indi- 
vidualistic cultures favor brands that reinforce their inde- 
pendence and provide individual gratification. Furthermore, 
collectivist societies value consensus, which makes them 
loyal to the dominant brand (Robinson 1996). Research into 
the symbolic meaning of brands indicates that brand names 
are an important symbol of group identity in collectivistic 
societies (Johansson, Ronkainen, and Czinkota 1994). 
Finally, previous literature has found that the degree to 
which other people influence the consumer purchase deci- 
sion is a factor that may affect brand attitudes across 
national cultures. Particularly, Nicholls, Roslow, and Dub- 
lish (1997) show that a collectivist subculture (Hispanic 
consumers in the United States) tends to be more suscepti- 
ble to social influence than an individualistic subculture 
(their Anglo counterparts). For example, collectivist con- 
sumers considered the influence of friends in the choice of a 
favored brand very important. 

In this article, and given our focus on brands as signals, 
it is likely that the most relevant cultural mechanism is 
uncertainty avoidance. Uncertainty avoidance refers to “the 
extent to which people feel threatened by ambiguous situa- 
tions and create beliefs and institutions that try to avoid 
these" (Hofstede and Bond 1984, p. 418). This concept cap- 
tures the cultural pattern of seeking stability, predictability, 
and low risk rather than change and new experiences (Hof- 
stede 1984). Consequently, consumers in cultures that are 
high in uncertainty avoidance tend to favor credible and 
consistent brands (Dawar and Parker 1994; Robinson 1996). 

Another cultural dimension, power distance, describes 
the extent to which a culture fosters social inequality. Cul- 
tures that are high in power distance tend to emphasize 
prestige and wealth in shaping boundaries or vertical rela- 
tionships between social and economic classes, such as 
rich/poor and superiors/subordinates (Hofstede 1984; Inke- 
les 1960). If the aspects of cultural power distance are 
related to consumer needs, it appears that social brand 
images should be the best fit for high-power-distance cul- 
tures. Prior research confirms that consumers in high- 
power-distance cultures attach more importance to prod- 
ucts' brand names than consumers in low-power-distance 
cultures (Bristow and Asquith 1999; Robinson 1996; Roth 
1995). However, when power distance is high, consumers 
also tend to “distrust” authority (Hofstede 1980). In turn, 
greater distrust of authority may diminish brand credibility 
effects, counterbalancing the stronger positive impact of the 
relationship between social status and brand names. 


Applicability of the Information 
Economics Perspective to Brand 
Effects Across Countries and 
Cross-Cultural Differences in the 
Use of Brands as Signals 


Erdem and Swait (1998) show that brand credibility (1) 
increases perceived quality, (2) decreases perceived risk, 
and (3) decreases information costs, all three of which (4) 
increase consumer (expected) utility. We expect these main 
relationships to hold across countries but to be moderated 


by cultural dimensions. Uncertainty avoidance is the cul- 
tural dimension that is most clearly related to brand effects 
as signals. Consumers from high-uncertainty-avoidance cul- 
tures are more likely to be risk averse and should have 
lower tolerance for ambiguity. This type of consumer tends 
to perceive more risk from product consumption and there- 
fore adheres to historically tested patterns of behavior (Ver- 
hage, Yavas, and Green 1991). Thus, as we already indi- 
cated, consumers from high-uncertainty-avoidance cultures 
may use brands as signals more than consumers from low- 
uncertainty-avoidance cultures. In addition, perceived risk 
may be a relatively more important source of the brand sig- 
naling effect in such cultures. Specifically, we expect that 
the impact of brand credibility through perceived risk on 
consumer utility is greater in cultures with high uncertainty 
avoidance (Douglas and Craig 1997). However, we also 
expect that high uncertainty avoidance decreases sensitivity 
to information costs and increases willingness to collect 
information. Thus: 


H,: Brand credibility’s impact on consumer utility through 
perceived risk and information costs saved is larger than 
its impact through perceived quality under high uncer- 
tainty avoidance than under low uncertainty avoidance. 

H»: Brand credibility’s overall impact on consumer brand 
choice increases with uncertainty avoidance. 


The collectivist/individualist distinction is a more com- 
plex phenomenon in regard to its possible moderating 
effects on brand signals. Collectivist societies subordinate 
individual goals to the goals of a few large in-groups (Trian- 
dis and Gelfand 1998). In individualistic societies, the 
social fabric and group norms are much looser. Collectivism 
should result in more reliance on the brand choice of the 
group as a whole. Therefore, all else being equal, reliance 
on brands, especially in forming quality perceptions, may 
be more important in highly collectivist societies because 
the positive impact of credibility on perceived quality and 
the impact of perceived quality on choice could become ele- 
ments that enhance belongingness to the group. However, 
culture conditions the way consumers search for product 
information (Dawar, Parker, and Price 1996). Collectivist 
cultures tend to exhibit more information search than indi- 
vidualist cultures, but they rely more on interpersonal infor- 
mation exchange or word of mouth (Money, Gilly, and Gra- 
ham 1998), which reduces the impact of information costs. 
This may have two consequences: (1) a dampening of the 
total credibility effects and (2) brand credibility effects 
operating more strongly through perceived quality than 
through perceived risk and information costs. 

We expect brand credibility's role in enbancing quality 
perceptions and, thus, belongingness to the group to domi- 
nate the dampening impact of reliance on interpersonal 
information exchange. We believe that this will be the net 
effect because the importance of belonging to a group is the 
driver of behavior in collectivist societies. Thus: 

H4: Brand credibility's impact on consumer utility through 
perceived quality is more pronounced than its impact 
through perceived risk and information costs in high- 
collectivist cultures than in low-collectivist cultures. 


H4: Brand credibility's overall 1mpact on consumer brand 
choice increases with collectivism. 


High-power-distance cultures tend to be hierarchic in 
their interpersonal relationships and decision making, 
whereas low-power-distance cultures tend to be egalitarian. 
Power distance influences active opinion leadership behav- 
iors by causing consumers to distrust others (Dawar, Parker, 
and Price 1996). In addition, power distance should make 
consumers distrust brands. In this sense, enhancements on 
brand credibility and clarity could have a lower effect on 
perceived quality, perceived risk, and information costs 
saved. However, high-power-distance cultures tend to 
emphasize the importance of prestige and wealth in vertical 
relationships between social classes (Hofstede 1980). If this 
aspect of cultural power distance is related to consumer 
needs, it appears that brands as signals of positions that are 
related to social status should be most important in high- 
power-distance cultures because people are highly moti- 
vated by status and affiliation norms (Roth 1995). Thus, 
brands as signals may have a larger effect on perceived 
quality, perceived risk, and information costs saved in high- 
power-distance cultures than in low-power-distance cul- 
tures. Given the counteracting factors that determine the 
impact of power distance on brand credibility effects, power 
distance may not have an effect on how brand credibility 
affects choice, or if one of the two major mechanisms (dis- 
trust versus importance of authority) dominates, it may 
decrease or increase total brand credibility effects. Conse- 
quently, we do not propose any hypotheses about the direc- 
tionality of the moderating effects of power distance and 
Jeave the open empirical question for further analysis. 


Empirical Investigation of Brand 
Signaling Across Cultures 


Using a sample of U.S. university students, Erdem and 
Swait (1998) establish the validity of brand signaling 
(specifically, through brand credibility) as the basis for 
positive brand effects on product evaluations. We argue that 
certain basic relationships they postulate would be moder- 
ated across different cultures, particularly in terms of three 
important cross-cultural constructs: uncertainty avoidance, 
collectivism/individualism, and power distance. 

We collected comparable samples of respondents in 
seven different countries, which we specifically chose 
because they differ across these cultural constructs. Sekaran 
(1983) identifies two primary ways to achieve sample com- 
parability: drawing nationally representative samples or 
selecting matched samples on the basis of some set of char- 
acteristics of interest. Because of budgetary constraints that 
prevented representative sampling, we matched samples by 
recruiting undergraduate business student respondents in all 
countries. (Because of recruitment difficulties in Brazil, we 
used engineering undergraduates for about half of the sam- 
ple.) Although this is a potential limitation to the generaliz- 
ability of our results, Erdem and Swait (1998) first tested 
their model using undergraduate student samples, and there- 
fore we can best establish cross-cultural generalizability 
with samples that have similar characteristics. We also 
expect that differences in age, sociodemographics, relative 
income, and so forth, are small when we use undergraduate 
student samples, making possible a clearer attribution of 
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substantive theoretical differences among countries to cul- 
tural differences. Finally, the use of student samples consti- 
tutes a conservative test of the impact of cultural constructs. 
University students are one of the most globally homoge- 
neous segments of consumers, and thus, any cultural differ- 
ence may be amplified if data are collected by means of a 
more inclusive group of respondents. 

We made cross-country comparisons in two ways: First, 
we estimated and compared country-level SEMs; second, 
we used mixed multinomial logit (MNL) choice models 
based on an experimental design to test the role of brand 
and cultural constructs in the choice process. We selected 
two product classes for testing the robustness of brand sig- 
naling across cultures. We used orange juice to represent 
low-involvement, low-cost product categories, and we used 
PCs to represent high-involvement, high-price categories. 


The Structural Model 


Figure 1 presents our SEM. With the exception of the rela- 
tive price (RP) construct, this is essentially the same model 
as that which Erdem and Swait (1998) implement. We pos- 
tulate that brand credibility (Cr) increases perceived quality 
(PQ), which increases information costs saved (ICS), which 
decreases perceived risk (or, conversely, increases its com- 


plement, lower perceived risk [LPR]); in turn, we expect 
that these latter constructs increase product consideration 
and purchase (C&P). 

In addition to Erdem and Swait’s (1998) brand con- 
structs, the model includes an RP construct. We included this 
for two reasons: First, Erdem and Swait do not include price 
in any form in the SEM, and we believed that its addition 
would be an interesting extension of their work to control for 
purely economic effects on C&P. Second, rather than price 
itself, which would be country and time specific, we opted to 
use RP to capture the relative price positioning of the brands 
in each country. The RP positioning is likely to be a function 
of brand credibility because consumers are likely to perceive 
the firm as charging a premium if their brand is strong and 
visible in the market. This leads to our expectation that the 
path from Cr to RP will be positive. However, we expect that 
the effect of RP on C&P is negative: The greater the relative 
price of the brand, the less the brand should be considered 
and, thus, the less attractive it should be. 


The Measurement Model 


We used a total of 19 items (see Table 1) to implement the 
measurement model that supports the SEM in Figure 1. We 


FIGURE 1 
The SEM Relating Brand Credibility to Product Utility 
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TABLE 1 
Brand Constructs Measurement Model 





Construct Measurement Items SMC Juice SMC PC 
Cr 1. This brand delivers what It promises. 51 .56 
i 2. This brand's product claims are believable. 41 4 
3. Over time, my experiences with this brand have led me to expect it to 
keep its promises, no more and no less. .41 43 
4. This brand Is committed to delivering on its claims, no more and no less. .32 .36 
5. Thls brand has a name you can trust. .53 .69 
6. This brand has the ability to deliver what it promises. .54 .58 
PQ 7. The quality of thls brand is very high. .72 74 
8. In terms of overall quality, I'd rate this brand as a... 10 73 
ICS 9. Knowing what I’m going to get from this brand saves me time shopping around. 30 .36 
10. This brand glves me what ! want, which saves me time and effort trying to do better. .79 .75 
LPR 11. | need lots more information about this brand before I'd buy rt. (R) .39 45 
12. To figure out what this brand Is Ilke, l'd have to try it several times. (R) .45 .51 
13. | never know how good this brand will be before | buy it. (R) .34 .51 
HP 14. This brand Is more expensive than the average brand In this category .63 .69 
15. Compared to other brands, this brand Is more expensive .63 70 
C&P 16. | would never buy this brand. (R) 30 37 
17. | would seriously consider purchasing this brand. 49 53 
18. How likely would you be to purchase this brand? .80 74 
19. If you were to buy nine [product] at one tme, how many of each brand would you 
buy to reflect your relative preference for each brand (assume these are the 
only brands available)? .39 33 


Notes: We measured all items on nine-point "strongly disagree/strongly agree" scales, except Items 8 (nine-point “low quality/high quality” 
scale), 18 (nine-point *very unlikely/very likely" scale), and 19 (allocation of nine objects among brands to reflect preference). (R) after 
an tem indicates that we reversed it for Inclusion In the model SMC = squared multiple correlation, or the percentage of Item variance 
explained by the model, we based this number on the pooled sample across countries. 


refer the reader to the work of Erdem and Swait (1998) for 
the items of all constructs except RP, which we specifically 
formulated for this study. 

We established convergent and discriminant validity 
using Anderson and Gerbing's (1988) procedures. For con- 
vergent validity, pooling data across countries and product 
categories to yield a sample of 882 respondents and 5292 
brand evaluations (as the procedure requires), we found that 
the construct coefficients in all item equations are statisti- 
cally significant at the 95% level. This is reflected in Table 
1 by the appreciable squared multiple correlation levels for 
all items. In terms of discriminant validity, we found that 
the hypothesis that the correlation between every pair of 
constructs is equal to one (i.e., there is no discrimination 
between the constructs in question) is rejected for all pairs 
of constructs in Figure 1 at a level of 95% or higher. The 
smallest chi-squared statistic is 34.1 (d.f. = 1), and it occurs 
between Cr and ICS, for which the estimated correlation is 
.91 (SE = .014). Thus, we established both convergent and 
discriminant validity at the pooled level. 


Cultural Constructs 


We used the 13 items 1n Table 2 to characterize the four cul- 
tural constructs of interest: uncertainty avoidance, collec- 
tivism/individualism, and power distance (Hofstede 1980). 
We developed these items using the following procedure: 
We used Hofstede's (1980) original organizational values 
survey items and Triandis’s (1995) horizontal and vertical 
collectivism/individualism items, which we slightly modi- 
fied to improve their applicability to a general consumer 
, context. After we collected cultural identity data by country 


from our sample, we ran confirmatory factor analyses for 
each country to identify whether any item did not load high 
enough (our cutting point was factor loadings greater than 
.50) on a single factor. There were minimal discrepancies in 
the factor structure by country. Our final cultural construct 
specification excluded a total of 5 items from the original 
set (1 in the power-distance construct, 2 in the uncertainty- 
avoidance construct, and 2 in the collectivism/individualism 
construct) that did not load high overall on the specific cul- 
tural construct. This procedure ensured reasonable levels of 
scale reliability at both the overall level and country by 
country (all Cronbach's œ > .60). Table 2 shows the final 
factor loadings and reliability coefficients using data at the 
aggregate level. Henceforth, whenever we refer to these 
constructs in terms of inclusion in a statistical model, it 
should be understood that we operationalized the cultural 
constructs as the simple average of the respective item 
responses for each individual respondent. 


Data Collectlon Methodology 


The survey we administered to the respondents has several 
sections: (1) elicitation of the measurement model items 
(nine-point "strongly disagree/strongly agree" scale; see 
Table 1) for three brands in each of two product classes; (2) 
elicitation of the degree of confidence (seven-point "very 
low confidence/very high confidence" scale) in evaluating a 
new brand in each product class; (3) an association task 
linking (“yes/no” scale) product class to items describing 
familiarity, risks of purchasing, benefits offered, involve- 
ment levels, and so forth (Erdem, Swait, and Louviere 
2002); (4) a brand/price experimental choice task for each 
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TABLE 2 
Cultural Constructs 








Factor 
Construct Items Loadings 
UA (a = .692) 1. Security is an important concern in my life. 72 
2. Life is so uncertain that one must continuously be on the alert so as not to be 
caught at a disadvantage. .784 
3. It Is important to consider dissenting views when making personal and 
social decisions. .808 
C/l (a = .735) 4. | like sharing little things with my nelghbors. 537 
5. Being a unique Individual is important to me. (R) —.804 
6. Decislons reached in groups are better than those reached by single Individuals. .587 
7. | usually sacrifice my self-interest for the benefit of my group. .597 
8. I'd rather depend on myself than on others. (R) —.728 
9. it Is important to me to be useful to others. .758 
PD (a = .628) 10. One's boss is a person Just like oneself. (R) —.633 
11. Bosses are always Inaccessible and distant. 782 
12. The way to change a society is to make everyone equally powerful. (R) —.619 
13. Other people are a threat to one's power and cannot be trusted. 123 


Notes: We measured all items on nine-polnt "strongly disagree/strongly agree" scales. (R) after an item Indicates that we reversed it before use. 
We obtained the factor loadings from principal components single-factor analysis. PD = power distance, C/I = collectivism/Individualism, 


and UA z uncertainty avoidance 


product class, involving 17 choice sets that each elicited 
choice among three brands and a no-choice alternative (the 
experimental design used all brands present in every choice 
set, and the same four levels of price applied to all brands; 
this is the same design that Erdem, Swait, and Louviere 
[2002] use); (5) elicitation of the cultural constructs (nine- 
point "strongly disagree/strongly agree" scale; see Table 2); 
and (6) some simple demographics (age, sex, household 
income, citizenship status, and number of times traveled 
abroad). 

Originally developed and tested in U.S. English, the 
final survey instrument was then translated and back- 
translated into Spanish, Portuguese, Japanese, Turkish, and 
German for respondents in Spain, Brazil, Japan, Turkey, 
and Germany to ensure comparability across countries (for 
a description of cross-country experimental controls, see the 
Appendix). Respondents in the United States and India 
were given the origina] U.S. English survey, duly cus- 
tomized for local conditions (e.g., brands, prices). In the 
case of India, our recruiter examined the survey for under- 
standability. We selected these countries to represent a wide 
range of uncertainty-avoidance, collectivism/individualism, 
and power-distance dimensions (Hofstede 2003). Table 3 
shows Hofstede's (2003) original country ratings by dimen- 
sion for these countries. 

In each country, we selected three brands in each prod- 
uct class (orange juice and PCs) to represent different levels 
of market power and price/quality positioning.^ We required 
all brands selected to have high levels of consumer brand 
recognition. We also ensured that at least one local brand 
and one regional or global brand were always included. 
Thus, the brands used in the study could be local, regional, 
national, or multinational/global, but to the extent possible, 





4We also controlled for the potential of certain brands to com- 
municate more individualistic values and others to communicate 
social values. In addition, we believe that such discrepancy is not 
as prevalent in the two product categories under study. 
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TABLE 3 
Country Ratings by Cultural Dimenslon 
(Hofstede 2003) 
Country PD C/la UA 
Brazil 69 38 76 
Germany 35 67 65 
India 77 48 40 
Japan 54 46 92 
Spain 57 51 86 
Turkey 66 37 85 
United States 40 91 46 


aA higher score on the C/l dimension represents higher levels of 

Individuallsm. 

Notes: Hofstede analyzed a large database of IBM employees' 
value scores collected between 1967 and 1973 that covered 
more than 70 countries. In the editions of Hofstede's work 
since 2001, scores are listed for 74 countries and regions, 
partly based on replicatons and extensions of the IBM study 
on different international populations. To collect these scores, 
Hofstede used surveys that were designed for measunng 
culture-determined differences among matched samples of 
respondents from different countries and regions. It consists 
of 20 content questions and 6 demographic questlons. The 
answers are grouped into cultural value scores Value scores 
range from O to 100. Scores closer to 100 denote stronger 
agreement with a particular cultural value. PD = power dis- 
tance, C/I = collectivism/individuallsm, and UA = uncertainty 
avoldance. 


we selected them to cover the spectrum of market share 
(lower, average, higher), thus introducing greater potential 
heterogeneity in the brand constructs of interest. In the 
choice experiments, we used country- and product-specific 
price levels expressed in the local currency. We established 
average price levels across stores using price audits by 
country with the cooperation of our recruiters. 

We then programmed the surveys for execution on the 
Internet, which ensured method invariance. We provided 
recruiters at each institution of higher learning with a list of 


identifiers and corresponding passwords (to enhance survey 
security), which they provided to their students for volun- 
tary participation in the survey. In general, these recruiters 
were instructors of undergraduate business courses, except 
in Brazil, where we used both business and engineering 
undergraduates because of recruitment difficulties. The 
respondents have similar demographic profiles: They are 
young (average age varies from 18 years in India to 25 
years in Germany; 79% of all respondents are 22 years of 
age or younger) and mainly middle class. The gender split 
varies somewhat more than age: The male/female split is 
about even in the United States and India; there are approx- 
imately 40% males/60% females in Germany, Spain, and 
Turkey; and there are approximately 55% males/45% 
females in Brazil and Japan. We excluded foreign exchange 
students from the sample. Students were given course credit 
for survey completion and were urged to complete the sur- 
vey in one sitting (approximately 30 minutes); however, if 
the session was interrupted, respondents could subsequently 
resume the survey at the point of interruption. 

Data collection occurred from October 2002 to April 
2003, depending on the country. This extended data collec- 
tion period was necessary to obtain reasonable sample sizes 
for our statistical tests. Despite our best efforts, sample 
sizes by country vary significantly because of factors out- 
side our control: Brazil (126 respondents), Germany (57), 
India (161), Japan (114), Spain (155), Turkey (126), and the 
United States (143). 


Country-Specific SEMs 


Table 4 presents the standardized SEM coefficients for each 
country and product class and the averages of cultural con- 
structs for each country. We used the Amos 5 software pro- 
gram to obtain model estimates (Byrne 2001). Overall, the 
models display a modest, though acceptable, level of 
goodness-of-fit (for details, see Table 4). All statistically 
significant coefficients in both product classes are in the 
expected direction, with a single exception: The LPR — 
ICS path, which we expected to have a positive coefficient, 
has a negative and statistically significant effect in Japan for 
PCs. Otherwise, it seems that, in general, the model struc- 
ture from Figure 1 holds across the different countries. 
That is not to say that the countries are homogeneous in 
terms of the path coefficients. In the juice category, for 
example, the impact of Cr on C&P is through the PQ con- 
struct in Brazil and Turkey. In Spain, however, the ICS and 
LPR constructs are also mediators of the impact of Cr on 
C&P, indicating that Spanish respondents displayed a more 
nuanced response to the brand in the evaluation of product 
utility, considering the role of the brand in risk reduction 
and information acquisition enhancements to the utility of 
the brand. India, the United States, and, to a lesser extent, 
Germany are more similar to Spain than to Brazil and 
Turkey in overall structure in the juice category. Note that, 
in general, the RP construct is not statistically significant at 
the 95% significance level in this product category, except 
for in Spain. In the PC category, PQ is again the main medi- 
ator between Cr and C&P for Brazil, Germany, and Turkey. 
The impact of Cr on C&P in India, Japan, Spain, and the 
United States occurs through paths involving LPR and ICS. 


The total impact of Cr on C&P, as estimated through the 
SEMs in Table 4, is remarkably consistent across countries: 
Brazil (.806, .738 for juice and PC, respectively), Germany 
(.815, .732), India (.847, .831), Japan (.773, .66), Spain 
(.724, .712), Turkey (.862, .880), and the United States 
(.803, .855). The smallest impact for juice is found in Spain, 
and for PCs, it is found in Japan. In both categories, Turkey 
displays the largest impact of Cr on C&P (and India is a 
close second in both categories). As we pointed out previ- 
ously, however, these effects and their magnitudes come 
about through different pathways in Figure 1. 

To undertake a more systematic exploration of between- 
country differences in the path coefficients, we plotted the 
estimated coefficients as a function of cultural identity con- 
structs. The averages of cultural constructs (see bottom of 
Table 4) suggest that Germany (M = 5.31) scores lowest on 
uncertainty avoidance, whereas Brazil (M = 6.72) and Spain 
(M = 6.57) score highest. Among the seven countries we 
studied, Spain (M = 5.18) and Brazil (M = 5.22) scored 
highest on collectivism, whereas India (M = 4.55) and the 
United States (M = 4.69) scored lowest. Power distance 
seems to be lowest in Germany (M - 3.88) and highest in 
Japan (M = 4.99). These ratings are consistent with Hof- 
stede's (2003) original findings (see Table 3), except for 
India and Turkey, which seem to rate more moderately in 
our current data than in Hofstede's work. À possible reason 
for this divergence is that we collected these cultural values 
using university students, who are more likely to have an 
international mind-set, and therefore ratings may tend to 
equalize those of reference countries in the Western world, 
such as the United States and Europe. 

Figure 2 shows three plots that depict the coefficient 
estimates for the Cr —^ LPR — ICS — C&P paths as a 
function of uncertainty avoidance (UA) averages for each 
country. These paths are shown because they depict UA's 
moderating pattern relatively clearly: The higher the coun- 
try on the UA scale, the stronger the first two paths become. 
This implies that as UA increases, Cr lowers risk percep- 
tions more strongly and leads to greater ICS. This impact on 
risk is mitigated in part by the decreased reliance on the 
brand to save on information costs in decision making, as is 
evidenced by the downward trend in the strength of the 
ICS — C&P path as a function of increased UA (because 
UA decreases sensitivity to information costs). Further- 
more, the PQ — C&P path strength does not vary systemat- 
ically with UA. These combined results suggest that the 
moderating role of UA in regard to the impact of Cr on 
C&P is more through the LPR — ICS path than through the 
PQ path. Thus, we find evidence in support of H;. 

Figure 3 shows two plots of the coefficient estimates for 
the Cr — PQ — C&P paths as a function of the collec- 
tivism/individualism (C/I) construct. These paths depict a 
different moderating pattern for C/I. Although Cr — PQ 
path coefficients do not change significantly, there is a dis- 
cernible positive trend in PQ — C&P path coefficients with 
increasing collectivism (Spain is an exception to this pat- 
tern). Conversely, the sequence of paths Cr — LPR > 
ICS — C&P is also found to be a function of the C/I con- 
struct. The higher a country scores on the C/I scale (imply- 
ing a more collectivist culture), the larger are the standard- 
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FIGURE 2 
Plot of Cr — LPR — ICS > C&P SEM Coefficients as a Function of UA 
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ized coefficients for Cr — LPR and LPR — ICS, and the 
smaller is the ICS — C&P path coefficient, with an almost 
null overall effect. These results suggest that in collectivist 
countries, the impact of Cr is felt more directly through the 
PQ route than through the information acquisition route 
(LPR — ICS). This seems intuitive because it is reasonable 
that in collectivist cultures, sources of information, such as 
word of mouth, may be more easily available and more 


often used. Thus, we find evidence in support of Hi. 
Notably, the Jast cultural construct under study (power dis- 
tance [PD]) does not seem to explain any of the differences 
between path coefficients. 


Evidence from the Choice Modeling Results 


To analyze the overall impact of credibility on choice in 
various countries (while controlling for price effects) and to 


FIGURE 3 
Plot of Cr — PQ — C&P SEM Coefficients as a Function of C/l (Except Spain) 
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test H, and H4, we also estimated brand choice models. The 
final statistical results we present are based on mixed MNL 
(McFadden and Train 2000) specifications, using the exper- 
imental choice data we mentioned previously. These data 
resulted from eliciting 17 choices for each product class. We 
always presented three brands in each country and product 
category and a no-choice alternative; each brand was associ- 
ated with a four-level price variable, which had the same 
range for all brands in the same country and product class. 
Note that 17 choice sets are more than what is considered 
minimally necessary for identification of parameters, but we 
did this to maintain compatibility and comparability with 
prior studies (especially Erdem, Swait, and Louviere 2002). 

In this section, we examine whether the impact of cred- 
ibility on preference differs in magnitude between countries 
or, more specifically, whether the impact of brand credibil- 
ity on choice is moderated by the cultural constructs used in 
the previous two sections. To this end, we calibrate two 
mixed MNL models, one per product category, which allow 
for stochastic heterogeneity of brand and price effects at the 
country level. The utility function also includes the Cr con- 
struct and its interactions with UA, C/I, and PD. Precisely, 
the utility functions are given by the following expression:? 


(1) Uren = Oien + Bren X LN(p,-) + 0 X Crq + Y1 X UA, X Cr 
+% X (C/Dq X Cr + Y3 X PD, X Cien + £y, 


where i, c, and n are index brand, country, and respondent, 
respectively; Pic is a country-specific price for the good in 
question; €,,, is the brand-country-person stochastic utility, 
which we assume to be independent and identically Gumbel 
distributed across brands and respondents within country 
but is allowed to have different scale/variance by country 
(Swait and Louviere 1993); a, D, 6, and y are parameter 
vectors; and all other quantities have been previously 
defined. Note that the œ and B parameter vectors are sub- 
scripted by respondent, reflecting that these coefficients are 
allowed to have stochastic distributions (we provide more 
details with results in Table 5). 

We did not give the coefficient vectors for Cr and its 
cultural construct interactions stochastic distributions, and 
we assumed that they were comparable across countries. To 
support this specification, we established that the Cr con- 
struct has partial metric invariance across countries at the 
9596 confidence level by using Steenkamp and Baumgart- 
ner's (1998) test; this result holds for both product cate- 
gories. The Cr construct has six measurement items in total; 
we established partial metric invariance with, at most, two 
items being country specific. This test shows that the con- 
struct is homogeneous and robust across the countries tested 
(for orange juice: Y? = 26.0, d.f. = 18, p 2.10; for PCs: x2 = 





5Note that an alternative utility specification would include the 
successors of credibility only (ie., perceived quality, perceived 
risk, and information costs saved). Previous research has shown 
that credibility does not have a direct effect on utility, and its 
effects are mediated through these three antecedents (Erdem and 
Swait 2004). Therefore, there is no need to put credibility and the 
antecedents together in the utility function. We decided to put 
credibility rather than its antecedents in the utility to assess the 
total impact of Cr on choice and obtain a clearer and simpler pic- 
ture of total Cr effects across countries. 


34.4, d.f. = 23, p = .06). This also implies that comparing 


. credibility scores across countries is empirically supported 


in our data, so making the y and 6 parameters in Equation 1 
equal across countries is consistent with empirical evidence. 

The resulting product class models appear in Table 5. 
Note that there is strong support for Erdem and Swait’s 
(1998) contention that Cr increases product attractiveness/ 
utility: Across all countries, there is a positive and signifi- 
cant main effect of Cr in both product classes. This result is 
robust because we allow for substantial country- and 
respondent-level differences through brands, prices, and 
stochastic taste distributions at the country level. Note that 
because of scale differences (i.e., error variance differences) 
across product classes, it is not appropriate to compare juice 
and PC coefficients directly; only parameter ratios or elas- 
ticities should be compared. The coefficients of substantive 
interest here are the interactions between Cr and the cultural 
constructs (i.e., 5, Yi, ..., Y4). In both product classes, we 
found that PD has no substantive impact on the main effect 
of Cr on utility, because y3 and 'y4 are not significantly dif- 
ferent from zero at the 9596 significance level; this is in 
agreement with prior analyses we previously reported, but it 
is based on SEMs. Of greater interest is the consistent find- 
ing across both product classes that UA. increases the 
impact of Cr in both the juice and the PC categories (coeffi- 
cient y, in both categories), as we previously argued. The 
marginal impact of UA is relatively greater for PCs than for 
juice, as would seem reasonable, but it is statistically sig- 
nificant and increasing in both. Thus, we found evidence in 
support of H, in both the PC and the juice categories. 
Finally, increasing C/I also leads to the increased impor- 
tance of Cr but only in the juice category (see coefficients Y2 
in Table 5). A possible reason for this finding is that brands 
as symbols of collective behavior are more often embraced 
in high-frequency, low-involvement purchases for which 
decision time pressure is high and the financial risk of mak- 
ing a wrong choice is low. Thus, we find evidence in sup- 
port of Hy only in the juice category. Note that the impact of 
C/I is significantly greater than that of UA in the juice cate- 
gory. It is difficult to discern the relative importance of 
these two cultural constructs across product classes because 
they are subject to different error term scales. 


Discussion, Conclusions, and 
Further Research 


In this article, we investigated the effects of brand credibil- 
ity on consumer choice through perceived quality, perceived 
risk, and information costs saved across seven countries in 
Asia, Europe, and North and South America. These coun- 
tries have been shown to vary across the cultural dimen- 
sions of uncertainty avoidance, collectivism, and power dis- 
tance (Hofstede 1980). We found strong support for Erdem 
and Swait's (1998) framework of brand effects on choice 
under consumer uncertainty. More specifically, we found 
that brand credibility, the key construct in information 
economics-based approaches to brand equity, affected con- 
sumer choice in all the countries we studied. Although the 
paths through which credibility operated differed somewhat 
across countries, we found that the general framework of 
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Utllity 


Function® 


Brazil 
Brand 1 
Brand 2 
Brand 3 
—LN(price) 
LN(price) 


Germany 
Brand 1 
Brand 2 
Brand 3 
—LN(price) 
LN(price) 


India 
Brand 1 
Brand 2 
Brand 3 
—LN(price) 
LN(price) 


Japan 
Brand 1 
Brand 2 
Brand 3 
—LN(price) 
LN(price) 


Spain 
Brand 1 
Brand 2 
Brand 3 
—LN(price) 
LN(price) 


Turkey 
Brand 1 
Brand 2 
Brand 3 
—LN(price) 
LN(price) 


United States 
Brand 1 
Brand 2 
Brand 3 
-LN(pnce) 
LN(price) 


Cr (8) 

Cr x UA (y4) 
Cr x C/I (yo) 
Cr x PD (ya) 


LN(Scale Functlon) 
Brazil 
Germany 
India ` 
Japan 
Spain 
Turkey 
United States 
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TABLE 5 


Mixed MNL Parameter Estimates 


Julce 
(Asymptotic t-Statistics)^ 


—2.73 (-14.9)|3.643 (6.96) 
—571 (-2.78)|.789 (5.26) 


-3.241 (-16.61)|4.727 (6.42) 


1.784 (27.03)|.371 (10.73) 


~36.172 (-1.23)|1739.369 (.56) 


3.723 (.63)|234.583 (.55) 


—46.155 (—1.22)]641.065 (.55) 


3.815 (4.23)].996 (6.66) 


4.881 (13.28)|1.155 (6.18) 
6.106 (13.83)]1.673 (9.06) 
6.189 (13.61)|.117 (3.83) 

1.188 (21.93) |.067 (15.27) 


1.188 (21.93)].067 (15.27) 
26.54 (17.74)|.067 (—) 

26 066 (17.73)}1.766 (8.42) 

1.881 (36.1)].004 (12.84) 


7.482 (14.47)|.356 (6.87) 

6.863 (14.01)|2.934 (7.17) 

7.974 (15.12)|.356 (—) 
,886 (16.55)].049 (11.03) 


64.338 (31.79)|.204 (2.91) 
65.109 (31.79)|.886 (11.48) 
64.384 (31.6)|.114 (2.97) 
1.484 (54.48)|.262 (8.36) 


~.664 (-2.56)|1.607 (5.84) 
—1.188 (—5.47)|1.055 (4.95) 
—.398 (-1.47)|.409 (3.86) 

1.716 (23.2)].529 (10.46) 


438 (6.47) 
.009 (2.03) 
.064 (7.84) 
—002 (—41) 


~.193 (-3.12) 
—1.498 (-1.68) 
—125 (-2.49) 
238 (4.45) 
.087 (1.66) 
O° oe 
862 (12.32) 


PCs 
(Asymptotic t-Statistics)c 


44.569 (12.66)|.201 (5.47) 
44.223 (12.67)|.201 (—) 
43.246 (12.67)|1.718 (6.03) 


-5.875 (—12.87)|.049 (5.15) 


44.449 (57.78)]4.6 (6.27) 
46.086 (58.38)|2.706 (5.66) 
43.053 (57.02)|4.853 (3.72) 


—5.952 (-60.46)].053 (2.41) 


29.546 (10.78)|3.007 (5.03) 
31.852 (10.78)|.391 (5.21) 
31.211 (10.77)|.391 (—) 


-3.586 (—11.21)].09 (4.95) 


22.384 (12.85)|.122 (4.97) 
22.036 (12.84)|.432 (5.52) 
22.22 (12.87)|.234 (5.13) 


-3.439 (-13.25)|.037 (5.82) 


8.605 (6.96)|.118 (2.42) 
7.444 (6.27)|2.125 (4.18) 
8.736 (6.96)|.023 (1.01) 


—1.198 (-7.85)..056 (2.88) 


22.577 (56.94)|.095 (3.74) 
24.321 (54.12)].095 (—) 
24.499 (53.09)].36 (8.67) 


—2.581 (-37.19)|1.398 (13.06) 


13.147 (10.79)].467 (3.82) 
13.374 (10.73)|.374 (4.82) 
13.318 (10.65)|.149 (3.09) 


—2.148 (-11 92)|.045 (4.26) 


448 (5.4) 
025 (5.28) 
.000  (.03) 
~.006 (—1.02) 


581 (7.31) 
0 (—) 
5115-133) 
661 (8.7) 
164 (1.71) 
0. 0 
813 (9.76) 


TABLE 5 


Continued 
Utility Juice PCs 
Functiona (Asymptotic t-Statlstics)> (Asymptotic t-Statlstics)c 
Goodness-of-Fit 
Log-likelihood at O —20,786. 1 —20,758.4 
Log-likelihood at convergence —10,339.1 —11,985.0 
Hho-squared .5026 .4226 
Number of respondents 882 881 
Number of choices 14,994 14,974 





&Pnces are In currency of the respective country 

bHeterogenelty specification: Numbers represent independent normal distributions for brand effects and independent log normal distnbutions 
for prices. For brand effects, presentation format is mean|variance of distributlons, and t-statistics are in parentheses; for pnce effects, pre- 
sentation format is mean|varlance of the normal distribution of the natural logarithm of taste coefficients, and t-statistics are in parentheses To 
use the latter coefficients In the utilrty function, note that the taste coefficients will multiply the negative of price. We used the log normal dis- 
tnbutlon for this category's price coefficients because the specification led to higher goodness-of-fit than the normal distribution. 

cHeterogenerty specification: Numbers represent independent normal distributions for brand and price effects. Presentation format is mean|vari- 


ance of distributions, and t-statistics are In parentheses. 


Notes: We used 100 Halton quasi-random numbers for simulation estimator. 


brand credibility effects was empirically robust across the 
countries we studied. 

To explore whether cultural constructs that have been 
linked to consumer choice behavior and brand effects in 
previous research moderated these effects, we investigated 
whether uncertainty avoidance, collectivism, and power dis- 
tance explain any differences across the countries in regard 
to mechanisms through which brand credibility effects on 
consumer brand choice materialize. We expected that uncer- 
tainty avoidance and collectivism would increase total cred- 
ibility impacts, and our results confirmed this. Given coun- 
terbalancing considerations in the case of power distance, 
we deemed its net effect to be an empirical question; we did 
not find any moderating effects of this construct in either 
analysis (SEM or choice model). Note, however, that this 
does not mean that this construct does not influence brand 
credibility effects. It merely indicates that given that many 
factors are at play in confounding directions, these factors 
may cancel out, and it may not be possible to observe any 
total net effect. Indeed, a necessary area for further research 
is to map out these different factors more carefully by mea- 
suring and testing them separately, for example, with the 
inclusion of a new construct that denotes brand perceived 
prestige (Steenkamp, Batra, and Alden 2003). 

The results we obtained through the choice models esti- 
mated suggest that uncertainty avoidance amplifies the net 
effect of credibility on choice. We found that the effect was 
stronger for PCs than for juice, which we expected because 
PC is a more complex, higher-involvement product category 
than juice. We found that collectivism amplified the brand 
credibility effects as well, but only in juice. This might be 
due to the conspicuous consumption nature of juice, a more 
frequently purchased, low-ticket item for which a brand's 
value as a group identity symbol would be more important 
than for PCs, for which functional features prevail. 

The results we obtained through the SEMs suggest that 
as uncertainty avoidance increases, brand credibility lowers 
risk perceptions and increases information costs saved. 
However, it also decreases the reliance on the brand to save 
on information costs. The same pattern of influence is true 


as collectivist tendencies increase. We believe that uncer- 
tainty avoidance and collectivism decrease the impact of 
information costs on consumer choice because of the 
increased willingness of consumers to engage in active 
search (in case of uncertainty avoidance) and the ability to 
use other low-cost information sources, such as word of 
mouth (in the case of collectivism). Conversely, increasing 
collectivism leads to a stronger path for the effect of per- 
ceived quality on brand consideration and purchase. This is 
consistent with the notion that collectivist consumers per- 
ceive credible brands as better quality and prefer them 
because they reinforce their belongingness to the group. 
Overall, these findings suggest that collectivism moderates 
the impact of brand credibility on consumer utility (consid- 
eration and purchase) by increasing the effect of perceived 
quality, whereas uncertainty avoidance causes the credibil- 
ity impact to operate mainly through the reduction of per- 
ceived risk and information costs. 

These findings have important managerial implications. 
Returns on a company's efforts to establish brand credibil- 
ity depend on consumers' cultural values. Establishing a 
brand's credibility seems to be especially effective when 
consumers have more aversion to risky choices and, thus, 
consider brands a way to choose "the sure thing" and when 
consumers strongly need to reinforce their belongingness to 
a group through the consumption of symbols that reflect 
such identity. In addition, companies could take advantage 
of cultural differences in brand equity formation in many 
ways. For example, companies could work with product 
positionings that match cultural values (e.g., emphasizing 
brand attributes that invoke either risk reduction or social 
acceptance). Companies could also execute communication 
campaigns that reinforce a brand's ability to either reduce 
risk or generate group identification (e.g., showing brand 
consumption occasions in which such ability comes into 
play). These communication campaigns could also add ele- 
ments that represent a brand's trustworthiness (e.g., mes- 
sages with ideas such as “you can always count on brand 
X" for high-uncertainty-avoidance countries or "you will 
always be part of it" for high-collectivist countries). Finally, 
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brand extensions could be perceived as categories in which 
either risk or collective identity matters. 

There are many avenues of further research to pursue. 
Because the original body of empirical research in the 
United States is based on student samples and because tak- 
ing relatively homogeneous samples across countries makes 
it easier to control for factors such as age, income, educa- 
tion, and other items that may lead to between-country dif- 
ferences, our study constitutes a conservative analysis of 
cultural differences. We expect that in more heterogeneous 
consumer samples, the effects of cultural constructs would 
be amplified. Testing the model on more heterogeneous 
consumer samples and with other product categories will 
also enhance the generalizability of our results. Another 
avenue for further research would be to investigate the 


influence of cultural dimensions other than Hofstede’s 
(1980), which have been criticized on the basis of their 
descriptive nature and organizational focus, in the validity 
and applicability of the model. 

In this study, we focused on brand credibility and its 
antecedents. However, factors that underlie brand credibility 
(e.g., temporal consistency of brand positioning) and sub- 
dimensions of such factors may be moderated by cultural 
constructs as well. A better understanding of underlying 
dimensions of consistency, brand investment perceptions, 
and the like has important branding and brand management 
implications in international markets. Indeed, the success of 
globalization strategies depends on assumptions about the 
communalities in mechanisms by which brands affect con- 
sumers, regardless of their cultural identity. 


APPENDIX 
Cross-Country Experimental Controls 


Functional equivalence 
(Sekaran 1983) 


Survey instrumentalization 
(Steenkamp, Hofstede, 
and Wedel 1999) 


Conceptual equivalence 
(Adler 1983) 


To control for functional equivalence, our survey requested responses to similar problems in 
every culture. There was a comparable context or work setting in every country. 


We used back-translation to control for vocabulary equivalence, Idiomatic equivalence, and 
grammatical and syntactical equivalence In our survey. 


To control for conceptual equivalence, we pretested equivalence of the brands selected for the 
analysis and chose product categories that are purchased often, are equally available, and 


have the same function for consumers In each country of interest. 


Construct equivalence 
(Steenkamp and 
Baumgartner 1998) 


Response equivalence 
(Adler 1983) 


Subject pool equivalence 
(Alden, Steenkamp, and 
Batra 1991) 


Although we do not present partial metric Invariance because of space limitations, we 
established It using multlple-group SEMs for a subset of countries. This suggests cross- 
country comparabllity of path coefficients. 


We controlled for response equivalence by using uniform data collection procedures (Internet 
collectlon) and identical Instructions. 


We controlled for the equivalence in educational background in three ways: First, the 
universities we chose were all top-tier universities in thelr countries. Second, participants were 
all students in business or engineering. Third, we analyzed sample characteristics to verify the 


match In terms of age, sex, and social class. 


Currency equivalence 
(Roth et al. 1991) 


We controlled for currency equivalence by using organic prices in each country (l.e., current 
price averages across retail stores). 
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Emotional Branding and the 
Strategic Value of the Doppelganger 
Brand Image 


Emotional branding is widely heralded as a key to marketing success. However, little attention has been given to 
the risks posed by this strategy. This article argues that emotional-branding strategies are conducive to the emer- 
gence of a doppelganger brand image, which is defined as a family of disparaging images and meanings about a 
brand that circulate throughout popular culture. This article’s thesis is that a doppelganger brand image can under- 
mine the perceived authenticity of an emotional-branding story and, thus, the identity value that the brand provides 
to consumers. The authors discuss how the tenets of emotional branding paradoxically encourage the formation 
and propagation of doppelganger brand imagery. This article develops the counterintuitive proposition that rather 
than merely being a threat to be managed, a doppelganger brand image can actually benefit a brand by providing 
early warning signs that an emotional-branding story is beginning to lose its cultural resonance. This article 
demonstrates these ideas through an analysis of the doppelganger brand image that is beginning to haunt a 
paragon of emotional branding—Starbucks. The authors conclude with a discussion of how marketing managers 
can proactively use the insights gained by analyzing a doppelganger brand image. 


emerged as a highly influential brand management at conclusions and recommendations that are consistent 
paradigm (Gobe 2001; Zaltman 2003). Emotional with emotional-branding tenets (Aaker 1997; Aaker, 
branding is a consumer-centric, relational, and story-driven Fournier, and Brasel 2004; Brown, Kozinets, and Sherry 
approach to forging deep and enduring affective bonds 2003; Fournier 1998; Kozinets 2001; McAlexander, 
between consumers and brands (Roberts 2004). Proponents Schouten, and Koenig 2002; Muniz and O’Guinn 2001). 
of emotional branding proclaim that this high degree of Even brand management paradigms originally premised on 


O: the past decade, emotional branding has ings that anchor strong consumer—brand relationships arrive 


consumer passion is seldom, if ever, cultivated through cognitive theories of consumer knowledge formation now 
rational arguments about tangible benefits or even appeals incorporate many concepts and recommendations that are 
to symbolic benefits, such as heightened self-esteem or sta- characteristic of the emotional-branding paradigm (Keller 
tus (Gobe 2001). Rather, these potent consumer—brand link- 2003a, b). 
ages typically emerge when branding strategies use narra- However, a seldom-discussed risk of emotional- 
tives and tactics that demonstrate an empathetic branding strategies is their potential to expose firms to a 
understanding of customers' inspirations, aspirations, and particular type of cultural backlash, which we characterize 
life circumstances and that generate warm feelings of com- as the “doppelganger brand image”—that is, a family of 
munity among brand users (Atkin 2004; Cova and Cova disparaging images and stories about a brand that are circu- 
2002; Fournier 1998; Muniz and Schau 2005). Jated in popular culture by a loosely organized network of 
Among marketing practitioners, this relational, commu- consumers, antibrand activists, bloggers, and opinion lead- 
nal, participatory, sensory, and emotive view of consumer- ers in the news and entertainment media. Over time, these 
brand relationships is increasingly heralded as a central pil- brand-focused parodies and criticisms can coalesce into a 
lar of market differentiation and sustainable competitive coherent set of opposing meanings that plague brands that 
advantage (Atkin 2004; Gobe 2001; Lindstrom 2005; otherwise have attained competitive success through 
Roberts 2004). Similarly, consumer researchers investigat- emotional-branding strategies. In effect, a doppelgänger 
ing brand personalities, brand communities, and the mean- brand image culturally competes with the emotionally reso- 


nant meanings that a brand's management attempts to instill 
through its marketing activities. 
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form of brand management myopia that overlooks strategi- 
cally valuable cultural insights that can be gleaned from an 
examination of these defamatory representations of the 
brand. 

In this article, we argue that a doppelgänger brand 
image functions as a diagnostic tool that can reveal latent 
brand image problems that could eventually blossom into a 
full-fledged brand image crisis. More specifically, the 
analysis of a doppelganger brand image can (1) call atten- 
tion to cultural contradictions that could potentially under- 
mine the perceived authenticity of a firm’s emotional- 
branding strategy, (2) provide early warning signs that an 
emotional-branding story is beginning to lose its value as an 
authenticating narrative for consumers’ identity projects, 
and (3) offer insights into how an emotional-branding strat- 
egy can be reconfigured to fit better with changing cultural 
times and shifting consumer outlooks. We further argue that 
these diagnostic insights are unlikely to be generated by 
conventional brand management frameworks. 

Our theory-building orientation follows an emerging 
tradition of studies that use qualitative data and culturally 
based insights to develop new theoretical knowledge of the 
symbolic and experiential dimensions of consumer—brand 
relationships (e.g., Brown, Kozinets, and Sherry 2003; 
Fournier 1998; Holt 2002; McAlexander, Schouten, and 
Koenig 2002; Muniz and O’Guinn 2001). In pursuing this 
agenda, we begin with a brief review of the tenets of emo- 
tional branding. We propose that this widely used marketing 
strategy harbors a set of unintended (and largely unrecog- 
nized) risks that predispose brands to the formation of dop- 
pelganger brand images. Next, we show how such 
unintended consequences emerge in the context of Star- 
bucks’s emotional-branding strategy and analyze how these 
unflattering and disauthenticating meanings motivate con- 
sumer brand avoidance. We then discuss general managerial 
insights that can be garnered by monitoring and analyzing 
doppelgänger brand images and the broader strategic impli- 
cations of our research. 


The Tenets and Unintended 
Consequences of Emotional 
Branding 


Emotional branding emerged in the late 1990s as a self- 
proclaimed corrective to the shortcomings of the conven- 
tional benefit-driven approach to branding (see Gobe 2001). 
Specifically, emotional-branding gurus challenge the 
benefit-driven approach's fundamental claim that brand 
managers must establish a clear, consistent, and distinctive 
benefit position in the mind of the consumer (see Aaker 
1996; Ries and Trout 1981). Rather, proponents of 
emotional-branding argue that a benefit-driven positioning 
cannot provide an enduring competitive advantage, because 
it is readily emulated, particularly when the benefits are tied 
to technological and product design features. They further 
contend that straightforward benefit appeals are unlikely to 
break through the clutter of a saturated marketing environ- 
ment that a plethora of brands are fighting to claim distinc- 
tive associations. In this vein, emotional-branding propo- 
nents insist that the brand meanings that inspire consumer 


passion and abiding loyalty are seldom, if ever, attribute- 
oriented benefits (e.g., Gobe 2001; Mark and Pearson 2001; 
Roberts 2004). 

From an emotional-branding standpoint, brand strate- 
gists should focus on telling stories that inspire and capti- 
vate consumers. These stories must demonstrate a genuine 
understanding of consumers' lifestyles, dreams, and goals 
and compellingly represent how the brand can enrich their 
lives (Roberts 2004). As Gobe (2002, p. xxi) writes, “con- 
sumers today not only want to be romanced by the brands 
they choose to bring into their lives, they absolutely want to 
establish a multifaceted holistic relationship with that 
brand, and this means they expect the brand to play a posi- 
tive, proactive role in their lives." Thus, the strategic objec- 
tive of emotional branding is to forge strong and meaning- 
ful affective bonds with consumers and, in so doing, 
become part of their life stories, memories, and an impor- 
tant link in their social networks (Atkin 2004). Over the 
past few years, many well-known brands have adopted 
emotional-branding strategies, including Tide, Lexus, 
Apple, Nike, IBM, Cheerios, McDonald's, and Starbucks 
(Atkin 2004; Gobe 2001; Roberts 2004). 

The benefits of emotional-branding strategies are 
increasingly being endorsed by the academic marketing 
community. For example, in a pioneering scholarly article 
on consumer-brand connections, Fournier (1998) details 
the ways that consumers can experience brands as relation- 
ship partners that help them accomplish personal goals and 
resolve dilemmas in their everyday lives. Rather than being 
generated through skillful marketing, Fournier contends 
that these relational meanings (1.e., love/passion, self- 
connection, commitment, trust, and intimacy) emerge only 
when brands become integrated into consumers' lives and 
identity projects. Several subsequent studies (e.g., Brown, 
Kozinets, and Sherry 2003; Holt 2002; Kates 2004; Zaltman 
2003) have supported the identity significance of emotion- 
ally based consumer-brand relationships. 

The social dynamics that underlie the emotional- 
branding paradigm have been most fully explicated in 
scholarly studies of brand communities, such as the Harley 
Owners Group and Apple Macintosh enthusiasts (e.g., 
McAlexander, Schouten, and Koenig 2002; Muniz and 
O'Guinn 2001; Muniz and Schau 2005; O'Guinn and 
Muniz 2005). In these communities, brand meanings are 
collectively forged among a group of consumers. These 
shared meanings enable consumers from diverse walks of 
life and different geographic locales to experience deeply 
satisfying feelings of community and solidarity. It is diffi- 
cult to find a more striking example of "loyalty beyond rea- 
son" (Roberts 2004) than the community of Apple Newton 
devotees, who have religiously stuck by a brand long aban- 
doned by its manufacturer and who go to arduous extremes 
to continue using this increasingly antiquated technology 
(Muniz and Schau 2005). 

At present, the conceptual divide between emotional- 
branding and more traditional benefit-driven branding 
strategies is rapidly narrowing as emotional-branding prin- 
ciples are being integrated into the benefit-driven brand 
management paradigm. For example, Keller (2003a, p. 596) 
commends Zaltman's metaphor elicitation technique, 
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Fournier’s brand relationship quality model, and ethno- 
graphic research on brand communities for highlighting 
“intangible aspects of brand knowledge not related to the 
actual physical product or service specifications per se.” 
The pinnacle of Keller’s (2003b, pp. 92—93) customer-based 
brand equity pyramid is “brand resonance,” which refers to 
“the ultimate relationship and level of attachment that a 
consumer has with a brand.” 


Emotional Branding and the Contested 
Cocreation of Brand Meaning 


The tenets of emotional branding are perfectly aligned with 
the postmodern view that brand meanings are not controlled 
by managers but rather are cocreated through ongoing inter- 
actions among their users (Cova and Cova 2002; Fournier 
1998; Muniz and O’Guinn 2001). In the emotional- 
branding literature, consumers are commonly portrayed as 
enthusiastic partners who engage in intimate dialogues with 
other brand users and brand managers to create mutually 
beneficial, identity-enhancing, community-building, and 
loyalty-sustaining meanings (Atkin 2004; Gobe 2001; 
Roberts 2004). The Apple Macintosh and Harley-Davidson 
brand communities exemplify this model of shared brand 
ownership. By participating in these collaborative dia- 
logues, brand managers can gain valuable insights into 
product design improvements and innovations, new product 
development opportunities, and resonant meanings that can 
be leveraged in advertising campaigns (Lindstrom 2005; 
Roberts 2004). In addition, proponents of emotional brand- 
ing argue that these passionate consumers/owners will often 
act as brand missionaries, promoting the brand through 
their own invocative, personalized brand stories (Gladwell 
2000; McAlexander, Schouten, and Koenig 2002; Muniz 
and O’Guinn 2001). 

Yet this process of brand-meaning cocreation harbors 
some infrequently discussed but consequential risks. When 
a brand is viewed as transgressing or abandoning its 
emotional-branding promises, formerly loyal customers can 
create a significant backlash. This pernicious consequence 
of passionate consumer-brand relationships is vividly illus- 
trated by the travails of Snapple, which was brought to the 
brink of collapse by a widespread rebellion among its most 
loyal consumers—energized by the spurned former 
spokesman Howard Stern, who angrily rechristened the 
brand "Crapple"—after its new corporate owner, Quaker, 
attempted to mainstream the brand (Deighton 2002; Holt 
2003). 

In addition to estranged customers, the media can also 
circulate brand meanings that are maligning, worrisome, or 
opprobrious. For example, as Martha Stewart gained 
celebrity status, she also became the target of an escalating 
litany of scandalous exposés, unflattering media construc- 
tions, and pop culture satires, all of which generated serious 
brand management headaches for Martha Stewart Living 
Omnimedia, even before the ImClone scandal (Fournier and 
Herman 2004). In these alternative cultural constructions, a 
brand may be used as fodder for humorous parody, caution- 
ary tales about latent lifestyle dangers (e.g., McDonald’s 
vilification in the muckraking documentary Super Size Me), 
or derisive critiques of corporate capitalism (e.g., Nike as 
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the poster child for sweatshop labor). These uncharitable 
brand meanings and images can quickly spread through 
multiple channels, such as the news media, entertainment 
media, e-mail networks, and bloggers (see Klein 1999; Lasn 
1999). Consider Procter & Gamble's initial introduction of 
the fat substitute Olestra, which was effectively sunk by the 
deadly combination of sensationalized press reports about 
unpleasant gastrointestinal side effects and the incessant 
lampooning of this supposed side effect by popular comedi- 
ans, such as Jay Leno (see Gladwell 2001; Kozinets 2004). 

Antibrand activists constitute another vocal group of 
adversarial brand-meaning creators (see Kozinets and Han- 
delman 2004). These activists, such as Adbusters 
(www.adbusters.com), are often well-organized entities that 
try to raise consumer consciousness about what they deem 
to be the excesses and exploitations of corporate capitalism. 
When these activists are successful in garnering attention 
for their antibrand campaigns, a brand’s image can take a 
drastic turn. For example, in the mid-1990s, the cosmetic 
retailer The Body Shop experienced a public relations crisis 
and lowering of its investment ratings when the popular 
press brought attention to activists’ claims that the com- 
pany’s socially responsible identity was more hype than 
reality (Atkin 2004; Entine 2003). In effect, these activists 
were successful in creating a doppelgünger brand image 
that directly competed with the corporate brand image 
advanced by The Body Shop. A more recent example of 
potent antibrand activism is offered by Cary and Van Neis- 
tat’s viral video “iPod’s Dirty Secret,” which portrays Apple 
as intentionally exploiting customers by not offering a 
replacement battery for this widely popular product. Their 
viral video spread like wildfire across the Internet, and the 
story was picked up by several mainstream news organiza- 
tions. The resulting storm of controversy led Apple to offer 
a replacement battery and, thereby, dramatically alter its 
marketing strategy (Stuever 2003). 

Antibrand activism comes in many forms, including 
boycotts, demonstrations, organized public relations and 
informational campaigns, and what has come to be known 
as “culture jamming” (Kozinets and Handelman 2004; Lasn 
1999). Culture jamming refers to a practice of using adver- 
tising tools and techniques subversively to cast a critical 
and often-demonizing light on marketing practices, such as 
Adbusters’ “Joe Chemo” parodies of R.J. Reynolds’ contro- 
versial Joe Camel campaign and their spoof advertisements 
featuring jarring juxtapositions between Nike’s inspira- 
tional advertising rhetoric and stark images of sweatshop 
laborers (http://www.adbusters.org/spoofads/). Culture jam- 
mers tend to target successful iconic brands, such as Nike or 
Starbucks, because these brands are considered corporate 
capitalism’s most valued (and vulnerable) symbolic assets 
(Holt 2002; Klein 1999). 

Adbusters and like-minded activists characterize them- 
selves as antimarketers who use guerilla tactics to “uncool” 
iconic brands (Lasn 1999) or, in our terminology, to create 
doppelgänger brand images that resonate with consumers. 
As the Internet and e-mail have become ubiquitous aspects 
of everyday life and as digital-imaging software gains wide- 
spread diffusion, the number of brand parodies and biting 
culture-jamming satires that circulate in cyberspace has 


mushroomed and are now being recognized as a potentially 
serious brand image threat (Earle 2002; Walker 2003). 
Similar to a virulent virus, these alternative, often- 
unflattering constructions of a brand can move through pop- 
ular culture with lightening speed. 

After a doppelgiinger brand image is successfully dif- 
fused into popular culture, it presents a competing set of 
meanings that have the potential to influence consumer 
beliefs and behavior. We suggest that a doppelgánger brand 
image can provoke a sense of incongruity between the inti- 
mate and inspirational tenor of an emotional-branding strat- 
egy and the profit-driven motives of its corporate sponsor. 
Under such conditions, the authenticity of an emotional- 
branding story can become suspect, and the identity value 
that the brand offers to consumers can become greatly 
reduced (Holt 2002). 

The literature on the psychological and cultural dimen- 
sions of authenticity is extensive (for more comprehensive 
reviews, see Campbell 1987; Cross 2000; Grayson and 
Martinec 2004; MacCannell 1989). A recurrent conclusion 
of these investigations is that the symbolic divide between 
the authentic and the inauthentic is a cultural construction 
rather than an inherent property of objects or actions. Sev- 
eral influential scholars have also noted an enduring cul- 
tural tension between the ideal of authenticity and popular 
conceptions of commercialism, as evinced by conventional 
laments about artists who compromise their integrity by 
“selling out’ (Lears 1994). Underlying this contrast 
between the authentic and the commercial is a quasi- 
religious belief that certain spheres of human existence 
should remain sacred and distinct from the profane spheres 
of commerce and profit seeking (Belk, Wallendorf, and 
Sherry 1989). 

The significance that consumers attribute to this 
ephemeral ideal of authenticity has been documented in 
several studies (e.g., Arnould and Price 1993; Fournier 
1998; Grayson and Martinec 2004; Kozinets 2001; 
Pefíaloza 2000; Price and Arnould 1999; Thompson and 
Tambyah 1999). Brands that successfully shroud them- 
selves in the cloak of authenticity are able to convey desir- 
able consumer meanings of inner directedness, lack of pre- 
tense, and genuine commitment to brand-related activities 
(see Lewis and Bridger 2000). In pursuit of the authenticity 
grail, leading brands, such Miller, Burger King, Adidas, and 
Coke, now routinely use stealth marketing campaigns that 
are designed to give their brands a more authentic persona 
through tactics such as the co-option of urban vernaculars 
and styles and the seamless integration of their brands into 
the worlds of art and entertainment (Frank 1997; Holt 
2002). 

However, these tactics are continuously monitored and 
widely disparaged by antibrand activists as another insidi- 
ous tactic used by Machiavellian marketers. In a consumer 
culture marked by pervasive skepticism toward marketing 
and advertising practices (Walker 2003; Wipperfürth 2005), 
the authenticity of a brand can be extremely difficult for 
brand managers to secure and disconcertingly easy for 
antibrand activists to usurp. When activists propagate a 
doppelgünger brand image, they are expressing a stinging 
rebuke to the idea that a corporate-sponsored brand could 


be a sincere relationship partner or offer genuine emotional 
sustenance. In the following section, we highlight the 
emerging doppelgiinger brand imagery that is currently 
challenging the authenticity of Starbucks’s emotional- 
branding claims. 


Attacks on Starbucks’s Perceived 
Authenticity 


Currently, few brands are riding higher than Starbucks. 
From its humble origins as a single coffee shop in Seattle a 
mere 20 years ago, Starbucks has climbed the Fortune 500 
ladder; 1t has approximately 7000 stores worldwide and a 
projected total of 10,000 by the end of 2005 (www. 
starbucks.com). Starbucks’s revenue has climbed annually 
at an average of 20% per year, and it generated more than 
$4 billion in gross revenue and nearly $300 million in profit 
during 2003 (Starbucks Corporation 2003, p. 12), though 
the chain’s same-store sales growth began to slow during 
2004 (Ouchi 2004). However, these impressive financial 
numbers do not fully tell the story of Starbucks’s transfor- 
mative role in popular culture. Starbucks has sparked a dra- 
matic resurgence in coffee consumption in the United States 
and has changed the leisure practices and daily routines for 
a broad swath of consumers. The aesthetic look, feel, and 
sound of Starbucks has influenced the design of retail ser- 
vicescapes in general (Postrel 2003). Thus, Starbucks has 
become an important cultural icon, and the “Starbucks revo- 
lution” has, in large part, created the burgeoning market for 
coffee shops and premium-priced gourmet coffee beans 
(Helliker and Leung 2002). 

Given its phenomenal rise, it is no surprise that Star- 
bucks is widely heralded as a shining exemplar of emo- 
tional branding and successful brand positioning (e.g., 
Gobe 2002; Keller 2000; Lewis and Bridger 2000; Roberts 
2004; Schmitt 1999; Schmitt and Simonson 1996; Shields 
2002). However, not everyone shares this unbridled enthusi- 
asm for the brand. A Web search reveals a staggering num- 
ber of anti-Starbucks Web sites that present extensively 
elaborated criticisms of its business practices, scathing dia- 
tribes, and manifold culture-jamming ploys. One particular 
Web posting from an antibrand activist organization in the 
United Kingdom illustrates the spirit of these anti-Starbucks 
sentiments (www2.spacehijackers.org/starbucks): 


Starbucks make us feel ill, [w]ith their carefully chosen, 
subtle (but not too bland) colour scheme. They try to sug- 
gest some kind of ethnic, liberal charm. Even the back- 
ground music has been Starbucked: watered-down, instru- 
mental versions of Hendrix; after all, we wouldn't want to 
offend. The quirky living room sofas and tables are not so 
quirky when you realize they are identical to the other 
thousands of stores, [a]nd those thousands of stores, like a 
plague infesting our highstreets, are a McDonald's for the 
new media generation.. . Most of all, we dislike their ver- 
minlike spread, subsidizing new stores in order to outprice 
and close down local rivals, before turning entire towns 
and cities that horrid green, [b]landing out any local cul- 
ture and diversity worldwide. 


Culture jammers routinely spread anti-Starbucks mean- 
ings through cyberspace, using attention-grabbing blends of 
narrative, satirical images, and animation. One animated 
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Web parody belittles the pretentiousness of the counter talk 
that unfolds when a Starbucks’ customer orders a drink. 
Behind the barista is a sign proclaiming “We proudly propa- 
gandize Star-Schmucks coffee” and listing ridiculously high 
prices for standard coffee shop fare. When a trendily 
dressed customer orders a “vente, white chocolate, mocha 
thingy,” a foul-mouthed cartoon chipmunk (imagine Alvin 
crossed with Chris Rock) pops up and mercilessly ridicules 
Starbucks’s latte-lingo pretensions and everything else 
about its customers and marketing tactics (www. 
illwillpress.com/sml.html). A more scathing viral video 
likens Starbucks to a Nazi-like empire, steadily and ever- 
more aggressively pursuing world domination (www. 
albinoblacksheep.com/flash/sbux.php). 

Through this gamut of critical parodies, anti-Starbucks 
activists use humor and satire to assail this brand for many 
perceived misdeeds, including crushing local competition, 
degrading the environment, exploiting third-world workers, 
homogenizing local cultures, and perpetuating the worst 
excesses of status-conscious consumerism, as evinced by 
the cultural-jamming takes on its iconic mermaid logo (see 
Figure 1). Although these brand disparagements may be sta- 
tistically and factually contestable, they put meanings into 
cultural circulation that challenge the authenticity of Star- 
bucks’s emotional-branding promises. 

Of course, most consumers are not antibrand activists. 
However, these culture-jamming criticisms regularly enter 
into mainstream cultural discourse, such as media satires of 
Starbucks’s breakneck expansion, as exemplified by the 
Onion’s (a satirical newspaper) mock headline “New Star- 


bucks Opens in Rest Room of Existing Starbucks” that has 
been widely circulated on the Web or the lampooning car- 
toon presented in Figure 2. Although more sedate in tone, 
these parodies nonetheless reinforce the doppelgänger 
image that Starbucks is little more than a profit-driven, mar- 
keting affectation and that its customers are either dupes or 
shameless trend followers. 

Much like a conventional brand image, the ultimate 
meanings of a doppelgánger brand image are forged by the 
consumers who interpret and use them in the context of 
their everyday lives. In the next section, we (1) highlight the 
aspects of Starbucks’s doppelganger brand image that most 
strongly resonate with our sample of consumers and (2) 
analyze how these meanings are incorporated into these 
consumers' preferences, lifestyles, belief systems, and iden- 
tities. Drawing from these qualitative insights, we then dis- 
cuss how these seemingly adversarial meanings can be used 
in a diagnostic fashion to understand, monitor, and proac- 
tively manage an emotional-branding strategy more 
effectively. 


An Analysis of Starbucks's 
Doppelganger Brand Image 


During the course of this study, we collected data over a 
two-year period, using a multisite approach. Our primary 
data consist of tape-recorded phenomenological interviews 
with 36 coffee shop patrons across two locations: a large 
city and a quintessential “latte town” (see Brooks 2000, pp. 
103-109). The interviews totaled more than 60 hours of 


FIGURE 1 
Culture-Jamming Starbucks’s Mermaid Logo 





Notes: The image in the lower left-hand corner has been altered to remove profanity. 


54 / Journal of Marketing, January 2006 


FIGURE 2 
A Popular Culture Parody of Starbucks 
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conversation, which we tape-recorded to produce 600 pages 
of single-spaced text. These interviews were supplemented 
by extensive field notes and numerous photographs from 
on-site observations of patrons across six different local 
coffee shops. 

Interviews were conducted in six different coffee shops, 
ranging from those that were militantly anti-Starbucks and 
fashioned around a countercultural motif to those that 
exhibited a polished aesthetic much like that of Starbucks. 
This diversity of locales enabled us to accumulate narratives 
from respondents with a broad range of opinions about the 
Starbucks brand. Many of our participants also patronized a 
wide variety of coffee shops, and a few reported that they 
periodically frequented Starbucks. Thus, our interviews also 
elicited participants’ perceptions and evaluations of the 
broader coffee shop scene in relation to Starbucks. 

Participants were informed that the interviews pertained 
to their experiences and perceptions of coffee shops and 
were ensured that their identity would remain confidential. 
Each interview began with a set of “grand tour" questions 
(McCracken 1988) about participants’ personal back- 
grounds, interests, and life goals and then focused on their 
experiences and perceptions of local coffee shops. Inter- 
viewers were provided with a small set of predetermined 
topics to cover (e.g., what participants liked and disliked 
about their local coffee shop, their perceptions of the coffee 
shop’s regular customers). In keeping with depth interview 
protocols (e.g., McCracken 1988; Thompson, Locander, 
and Pollio 1989), however, we allowed participants to guide 
the flow and content of the discussion to minimize the risk 
of interviewer-induced biases. These interviews had a con- 
versational quality; interviewers interjected predetermined 
questions only when breaks in the dialogue arose. Impor- 
tantly, the interviewers did not introduce the topic of Star- 
bucks. However, in all of our 36 cases, the interviewees 
raised the distinction between their preferred coffee shop 
and contrasting doppelgiinger meanings attributed to Star- 
bucks. Further probes were then used to elucidate how these 


consumers understood Starbucks and the kind of identity 
projected by its brand image. 

We interpreted this body of qualitative data using a 
hermeneutic approach (Thompson 1997). In this approach, 
provisional understandings are formed, challenged, revised, 
and further developed through an iterative movement 
between individual transcripts and the emerging under- 
standing of the entire set of textual data. Thus, each inter- 
view was initially treated as a separate idiographic case in 
which we attempted to uncover the salient meanings and 
identity projects (e.g., Mick and Buh] 1992) that each par- 
ticipant pursued. As our interpretation unfolded, we 
attempted to recognize thematic and narrative commonali- 
ties that emerged across the data set. Our aim was to iden- 
tify the most recurrent and robust patterns of underlying 
cultural meanings that engendered these identified com- 
monalities. This hermeneutic mode of interpretation is 
premised on the idea that a given consumer is not express- 
ing a strictly subjective viewpoint. Instead, he or she is 
articulating a culturally shared system of meanings and 
beliefs, personalized to fit his or her specific life goals and 
circumstances (Thompson 1997). 


Overview of Emergent Themes 


Jack: It’s [Starbucks] just a huge corporation. J mean, just 
a gigantic corporation. They’re all over the world. Thou- 
sands of outlets, and they come in, run specials, cut prices, 
and break other coffee shops—just drive them out of busi- 
ness. Because, unfortunately, there’s not a lot of loyalty 
out there as far as consumers are concerned. They’re usu- 
ally going to go where they spend the least amount of 
money; they’re going for the bargains rather than loyalty 
to local businesspeople. 


The doppelganger meanings plaguing the Starbucks 
brand are culturally grounded in a particular form of market 
populism that portrays large corporations as exploitive and 
rapacious agents whose actions run counter to the best 
interests of ordinary consumers and small business owners 
(Cohen 2003; Kelly 2001). However, the accuracy of this 
populist anticorporatism is widely contested. For example, 
the two firms most commonly accused of predatory prac- 
tices, Wal-Mart and Starbucks, counter that they actually 
improve local economies and create greater opportunities 
for small businesses. In the case of Starbucks, this anticom- 
petitive charge is particularly ironic because the so-called 
Starbucks revolution has corresponded to a dramatic rise in 
the number of locally owned coffee shops (Helliker and 
Leung 2002). 

However, brand image is much more a matter of per- 
ceived meaning and cultural mythology (Holt 2003; Mark 
and Pearson 2001) than an aggregation of verified evidence. 
Regarding Jack's vignette, for example, there is little evi- 
dence that Starbucks, which maintains a premium market 
position, underprices local competitors. However, Jack's 
accusation is consistent with the populist idea that large 
corporations are modern-day robber barons that, as a matter 
of course, aggressively wield their clout to crush relatively 
powerless mom-and-pop operations. 

The David versus Goliath portrayal of competition 
between a small, locally owned establishment and a global 
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chain, such as Starbucks, makes for a particularly com- 
pelling mythic formulation in which the consumer can also 
assume heroic qualities. Through patronage of local alterna- 
tives, these consumers construe themselves as participating 
in a dramatic populist struggle against corporate titans, such 
as Starbucks. Their heroic self-constructions are further but- 
tressed by historically recurrent examples of corporations 
abusing their economic and political power and putting 
immediate profits over long-term societal and environmen- 
tal interests, as exemplified by the well-publicized corpo- 
rate accounting scandals of 2002. 

Among our sample of local coffee shop patrons, the 
identity value of avoiding the Starbucks brand and the ven- 
eration of their preferred local alternatives hinged on two 
primary motifs, both of which have underlying connections 
to this brand’s lack of perceived authenticity: (1) the cos- 
mopolitan motif and (2) the artisan/owner motif. Through 
the cosmopolitan motif, local coffee shops are constructed 
as sites for acquiring authentically distinctive social and aes- 
thetic experiences that are not readily available in corporate- 
run servicescapes and commercial settings. Through the 
artisan/owner motif, our participants view local coffee shops 
as atavistic throwbacks to a bygone era that existed before 
the marketplace dominance of national chains, when dedi- 
cated proprietors put their personal stamps on all facets of 
the business. Importantly, our data indicate that these 
authenticating narratives are forged through a critical com- 
parison to Starbucks, which is summarily constructed as 
little more than a commercialized facsimile of a genuine 
coffee shop and is perceived as lacking any authentic 
expression of the cosmopolitan or artisan/owner motifs. 

Notably, these two motifs closely mirror Starbucks’s 
highly publicized homage to European cafés (including the 
attendant ideals of cultural sophistication, worldly engage- 
ment, and intellectual conversations) and its related market 
positioning as a comfortable communal space in which con- 
sumers can relax and socialize (Schultz 1997; Shields 
2002). Our participants embrace these cosmopolitan and 
communal motifs as defining features of an authentic coffee 
shop, and they perceive Starbucks as being unable to deliver 
on these emotional-branding promises. 


A Distinctive Cultural Experience (the 
Cosmopolitan Motif) 


Scott: I don’t like Starbucks, because it seemed like every 
time I was there, the people have their laptop and then they 
have their New York Times or whatever, their espresso, and 
it just seems sterile. There was nothing, like here it's a lit- 
tle gothic and a little different. So it seemed too sterile. 


Distaste for Starbucks's sterility and boring uniformity 
is a prominent refrain in our interviews. Our participants 
widely condemned Starbucks for offering a bland, prepack- 
aged, focus-group-tested consumption experience that 
caters to bourgeois tastes. In diametric contrast, our partici- 
pants view local coffee shops as locales for authentic 
expression of cultural uniqueness and alternative (i.e., non- 
corporate) sensibilities. 

The quest for authentic cultural experiences has been 
_ discussed as a prominent motivator of global travels (Mac- 
Cannell 1989) and excursions to historic sites, museums, 
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and reenactments (Grayson and Martinec 2004; Pefialoza 
2000). In an era marked by cultural concerns over market- 
place standardization, homogenization, and a growing 
antipathy toward generic suburban landscapes populated by 
cookie-cutter homes and strip malls (Oldenburg 1997; 
Ritzer 1998), the search for authenticity is often coupled 
with a desire to experience distinctive cultural worlds (Holt 
1998; Thompson and Tambyah 1999). This cosmopolitan 
motif is prominent among our sample of local coffee shop 
patrons, who consistently characterized an authentic coffee 
shop as one that provides a spontaneous experience of local 
color that cannot be attained elsewhere. Importantly, these 
coveted cosmopolitan experiences are understood through 
the critical contrast to Starbucks's doppelgänger meaning of 
being a highly standardized, calculated, and status- 
conscious servicescape. Consider the following passage: 


Brian: I mean, if you just think of coffee culture, you're 
thinking about, you know, struggling artists. I hate to 
stereotype like that, but, I mean, with the local coffee 
places, you're going to have a little more idiosyncrasy, a 
little local flavor, color to the place. They might have a 
specialty kind of menu or something, or they might make 
[or] do something unique. But, say you're in San Fran- 
cisco or Seattle, and you see a Starbucks; you more or less 
know what you're going to get when you walk in the door. 
This place is kind of an eclectic nature. Yeah, I just like 
the area. It's kind of a bohemian area of town. It's not like 
cookie cutter like Starbucks. You don't have to be a certain 
class to go there; more or less, you can be of any class 
really. I mean you just, what I like about it 1s, you never 
know what kind of character 1s going to show up here 
when you're having your coffee. 


Through the mundane act of patronizing local coffee 
shops, our participants can experience themselves as rebels 
who are consuming against the grain of corporate- 
dominated mainstream conformity. Through this formula- 
tion, their experiences of authentic cultural difference and 
social diversity afforded by their favorite coffee shops 
become a resource for cultivating a sense of personal dis- 
tinctiveness and defiant individuality. Conversely, their dia- 
metric contrast to Starbucks's doppelganger meanings (i.e., 
corporate conformity and mass-market standardization) 
functions as a frame of reference that enhances these expe- 
riences of authenticity: 


Frank: Well, yeah, a place like this 1s unique, which I like. 
It very much caters to, you know, the population around 
here. It’s very, you know, it's comforting to somebody 
who lives in the suburbs. It’s unique, it’s got its artsy sort 
of thing, but as opposed to Starbucks, you know, every- 
where it’s still the same corporate kind of cookie-cutter 
type of place that 1s more, that I don’t see the same 
uniqueness. You know, you can come in here and get a cup 
of coffee, and it’s going to be different than what you get 
in [name of another local establishment]; it’s going to taste 
a little bit different, but you know, [in] every Starbucks, 
the food and drinks are pretty much the same. So, I mean, 
just a place like this has more personality. I wouldn’t go to 
Starbucks. I'll go out of my way to go all the way to the 
East Side—for me, a drive from here is about six miles. I’d 
rather do that, and it’s just more of a satisfying experience. 


Similarly, Ellen echoes Adbusters’ critique of mass con- 
sumer culture as a stultifying force (Lasn 1999) that incul- 


cates preferences for standardized experiences that are 
familiar and comforting but ultimately inauthentic: 


Ellen: Well, I actually don’t go into coffee shops that are 
chains, or I avoid them as much as possible. I don’t do 
stores that are chains. 


Interviewer. Tell me about that. i 


Ellen: Well, I don’t want to participate in sort of the cor- 
porate spread into the world. I have the feeling that people 
that go to chain places have some sort of sense of their 
safety in doing that. Like, it’s always the same no matter 
where they go. It actually becomes the culture. 


Interviewer : Can you explain that a little more? 


Ellen: When you go to Paris or you go someplace out in 
the world, the tendency is to want to find something that’s 
home, something that you know, and if you walk down the 
street and find a McDonald’s or a Starbucks, you know, 
this 1s your home, because these things have become so 
entrenched in the culture that it’s not even conscious. If 
you go to look for restaurants, to look for places that are 
locally owned and run, they are not the same all over 
everywhere and that takes, it feels as if it takes effort. Like 
people, it takes effort and daring to try something that 
isn’t the same. If you’re somebody who’s used to going 
into the same all the time, I think that’s a sad comment on 
our culture. 


Interviewer: What are some of the things you think about 
when choosing a coffee shop? 


Ellen: Okay, I check out whether it’s a place that has inter- 
esting people in it that look like I might want to get to 
know them. If it offers soy milk (laugh), that’s peculiar! If 
it embraces variety, isn’t rushed.... I think that’s important 
too. I like being in a place where people aren't, you know, 
pounding a cup of coffee and zooming out somewhere. A 
certain kind of vitality I like as well. I look for where 
people are alive and active and doing something with their 
lives. 


In this passage, Ellen forges a symbolic contrast 
between a standardized and mass-marketed consumer cul- 
ture that caters to a risk-averse and staid clientele and the 
authentic experiences of local cultural charm that exist out- 
side the sphere of corporate chains. Through her avoidance 
of chains, Ellen venerates her consumption preferences and 
lifestyle by pejoratively framing consumers as people who 
are trapped in the proverbial rat race and who lack time or 
inclination to live life to its fullest. In sharp symbolic con- 
trast, she aligns herself with an enlightened community that 
leads authentic and purposeful lives, exhibiting a relaxed 
sense of joie de vivre and an enthusiasm for new experi- 
ences and cosmopolitan pleasures. Much like the value- 
laden distinction between tourists and travelers (see Mac- 
Cannell 1989; i.e., “Everyone else is a tourist, but I am a 
traveler", a devout preference for local coffee shops and 
publicly expressed antipathy toward corporate chains, such 
as Starbucks, can be lionized as a sign of personal virtue 
and self-directed (rather than marketing-influenced) cos- 
mopolitan tastes. 

Beyond the personal significance of these identity- 
enhancing symbolic distinctions, these consumer narratives 
also highlight an important cultural aspect about percep- 
tions of authenticity. Prior research has suggested that expe- 


riences of servicescape authenticity are steeped in a per- 
ceived fidelity to historical circumstances, cultural tradi- 
tions, or the lives of famous people (Grayson and Martinec 
2004; Pefialoza 2000). In the case of the American coffee 
shop scene, however, these cultural pillars of authenticity 
carry less symbolic weight. For example, Ellen's preferred 
coffee shop had been in operation for only three years, so it 
could not lay claim to some long-standing tradition. In gen- 
eral, local coffee shops tend to be simulations of a romanti- 
cized image of the Greenwich Village beatnik/bohemian 
coffeehouses (circa the 1950s) or left-bank French cafés 
frequented by beret-wearing, existential-spouting intellectu- 
als. In this context, compelling claims to authenticity must 
be actively constructed, and the symbolic distinctions 


^ forged through Starbucks’s doppelgänger brand meanings 


loom large in this regard. Particularly, the authenticity of 
local coffee shops' cultural ambiance is contingent on the 


' idea that Starbucks's trademark servicescape is a marketing 


affectation designed to maximize profitability rather than an 
authentic, spontaneous expression of local culture: 


Anne: My image of Starbucks is pretty bad. Even though I 
haven't been there that many times, I think of [1t] as more 
commercial, um, selling out in a way because you think 
of, like, Starbucks, they find out what people want and the 
atmosphere and the CDs and the merchandise, and yop 
know, they're doing a really good job with all that, but 
something about that whole idea.... I just want to rebel 
against it. It's almost too analytical, where, you know, this 
place is just more "spiritual, you know, it's more that 
feeling; it’s not as analytical. It’s just more emotional or 
something 


Our participants' sharp distinction between authentic 
coffee shops and crass commercial interests reveals a glar- 
ing historical misconception. As McFall (2004) discusses, 
eighteenth- and nineteenth-century European coffeehouses 
were protomarketing sites. Not only were they segmented 
according to different consumer interests, emphasizing 
poetry/literature, news, storytelling, and gossip, but they 
also provided an important means to distribute advertise- 
ments (with several advertising agencies actually originat- 
ing from the coffeehouse trade) and commercial gathering 
points for auctions, sales, and business meetings. However, 
this historical intertwining of the commercial and the cul- 
tural does not easily align with romanticized desires for 
spontaneous cultural experiences that reveal the defining 
essence of a community. Through these invidious contrasts 
to Starbucks, however, local coffee shops can appear to be 
less driven by marketing considerations and more closely 
linked to their local community. In other words, our con- 
sumers' perceptions of authenticity emerge in relation to 
their critical constructions of Starbucks and the corporate 
staging of its servicescape. 


An Intimate Commercial Experience (the Artisan/ 
Owner Motif) 


Patrick: Stammcafé is a German word for your neighbor- 
hood café, where you're a regular, and it's great because it 
totally says that idea, which is you have your local, local 
place, you know, your stammcafé, the place that you 
always go to, and then there are "cafés," where you just 
kind of go and, you know, when you're out with friends or 
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if you’re in another part of town for a movie or things like 
that; so sure, why not go there, you know? But the stamm- 
café, you have a special sort of relationship. 


Patrick’s musings about his stammcafé bear a close rela- 
tion to Oldenburg's (1997) concept of “third places" (i.e., 
public spaces that function as social gathering points and 
provide a proverbial home away from home, where a person 
can experience camaraderie and community). Oldenburg 
suggests that third places are characterized by a regular 
clientele who gather to experience a particular form of inti- 
macy in these commercial spaces. This form of intimacy 
typically does not arise in the formality of the workplace, 
and it is more relaxing and convivial than the private inti- 
macy of family relationships because it carries fewer inter- 
personal demands, responsibilities, and obligations. Impor- 
tantly, the heart and soul of this commercial intimacy is the 
establishment’s proprietor. In the ideal case, patrons believe 
that they are being personally enriched by their relationship 
to a unique character (see Oldenburg 1997; Price and 
Arnould 1999). 

This sense of having an authentic connection to an indi- 
vidual proprietor is quite salient to our local coffee shop 
patrons. However, our interviews and observations suggest 
that, in general, the servicescapes of American coffee shops 
do not exhibit the kind of public intimacy and collectively 
shared conversations that Oldenburg argues are a defining 
cultural characteristic of third places. The social milieu of 
the contemporary coffee shop simply does not enjoin such 
spirited expressions of camaraderie and social bonding. 
Rather, it tends to foster more individualized experiences of 
a common connection to the owner/proprietor, who is 
viewed as the embodiment of the establishment's values 
and its quirky, endearing qualities. 

In this sense, our participants' narratives parallel some 
findings from Price and Arnould's (1999) analysis of com- 
mercial friendships, particularly in regard to the role of an 
owner's self-revelations in generating a sense of shared inti- 
macy. However, the range of personal information our par- 
ticipants desired is fairly narrow. These consumers want to 
know just enough about the proprietor to draw inferences 
about how the coffee shop reflects his or her views and val- 
ues. Thus, this bounded intimacy enables customers to craft 
a compelling story about why their preferred coffee shop is 
truly different and, thus, fundamentally and irrevocably dis- 
tinct from the corporate-run and standardized Starbucks: 


Interviewer: What is a local coffee shop for you? 


Sandra: For me, it would be a coffee shop that isn't 
national, something that is not a chain. Like this coffee 
shop, the owner is here most of the time I am here, and I 
just find that I really respect that, and I have had conversa- 
tions with the owner before. I know that he used to be, 
like, in the jewelry business, he left that and opened a cof- 
fee shop. So I just find that pretty interesting that the 
owner frequents the place and works there and has a con- 
nection to the customer. 


Interviewer: So what is it about knowing the owner or 
having the owner living in the area that's important to 
you? 


Sandra: I like to support local business. I think that Star- 
bucks, in a way, is very sterile. I mean it doesn't have 
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much character and personality, because it is this mass- 
produced business, and I think that when you have a situ- 
ation where there are so many, it becomes more imper- 
sonal, and the people who work there don't care as much I 
would guess. I would think that if it's locally owned, the 
owner is more involved. He or she would have more con- 
nection to the employees, and the employees would prob- 
ably feel better about working for this person that you see 
regularly. I mean if you look at this place, it's unique. 
There is no other place like it in town, there is no other 
place in the world just like it. But if you look at Starbucks, 
it's pretty common. It’s just not as interesting, I think. 


Much like our other participants, Sandra praises local 
coffee shops for having a one-of-kind quality that emanates 
directly from the owner's personal touches and that are con- 
spicuously absent from Starbucks's mass-marketed ser- 
vicescape. Our participants tend to read considerable mean- 
ing into their experiences of local charm. They view these 
distinctive features of the local coffee shops as a kind of 
self-revelation on the part of the proprietor, who is express- 
ing his or her own personal tastes and values. Furthermore, 
these various signs of uniqueness are taken as compelling 
signals that these proprietors are following their own muses 
rather than calculating what offerings would be most popu- 
lar to a particular customer segment. Paradoxically, this 
apparent "take-it-or-leave-it" indifference to customer 
whims provides these establishments with a particular 
appeal, particularly in contrast to Starbucks, which is con- 
sidered a conservative and boring servicescape designed not 
to offend anyone's sensibilities: 


Kevin: I visited my sister in Oakland; there are three cof- 
fee shops on Piedmont Avenue. I've been to all three. One 
is a place called Peet's coffee; it's like a local that's gone 
national. Then there's the Starbucks of course, and then 
there's another little place that I went to that I started to 
frequent and just loved. It was a little left leaning, kind of 
different. Their coffee was high quality and stuff, but this 
little ma-and-pa place served ice cream; they had ice 
cream as part of their coffee shop—a little different. They 
had a Ms. Pac-Man video game there, and they also put on 
the walls photographs of homeless people. They pay 
homeless people to have Polaroid pictures taken—they 
would ask for permission of course—and then they gave 
them, like, $20 or something, and it becomes art on the 
wall. They weren't afraid to be nontraditional; that's what 
I loved. Someplace like Starbucks wouldn't dare do that. 


In these consumer narratives, local coffee shop propri- 
etors are venerated for being sincerely and deeply commit- 
ted to their clientele, employees, and the neighborhood at 
large. In contrast, Starbucks is described as a corporate 
interloper that wants to reap profits through a mass- 
produced servicescape and that conspicuously lacks an 
authentic connection to the local community. Sandra's pas- 
sage also reiterates the previously discussed heroic framing 
of the local coffee shop. The proprietor is celebrated for 
fighting to maintain an invaluable and distinctive commu- 
nity asset against the powerful forces of an impersonal cor- 
porate chain. Through this heroic interpretation, Sandra's 
own patronage of local coffee shops takes on a heightened 
moral and social significance, representing a means through 
which she can help ameliorate the blight of standardized 
corporate servicescapes. 


Beyond this immediate sense of personal connection 
and reciprocal commitment, our participants also portray 
the proprietors of their favorite coffee shops as devoted arti- 
sans, who enact laudable virtues through their business 
activity, such as educating the palettes of their clientele (in 
the manner of a wine connoisseur): 


Carie: I love Tina [the owner]. I think, originally, people 
came into [name of local establishment] for a hit of coffee 
and a hit of Tina. You know, she’s got such great energy. I 
love to see Tina, and I love the coffee. I love the freshness 
of their coffee. You know, and I buy their beans. It was 
really fun when she was first getting into roasting coffee. 
She had a popcorn popper, like an air popper. Which is 
very similar to the big roaster that they use. The same 
technology, so she was experimenting with different 
beans. She’d bring over, like, little tiny Ziplock bags full 
of different coffees. So she’d have us try it all the time. At 
first, it was like a little bit, it was so intense, you’re not 
used to drinking that fresh of a coffee, and then you sort 
of develop like this, it’s like drinking wine. You know, you 
don’t really appreciate, like, a really good wine at first, 
and then you start to appreciate it. .. She is real connected 
to the Kaladi Brothers up in Alaska, up 1n Anchorage, and 
so it’s really fun, because we were up in Anchorage, and 
you know, Kaladi Brothers was all over the place. Like 
you'd see a sign, “We serve Kaladi Brothers Coffee.” It's 
the same awesome quality, really consistent, you know, 
roasting, and everything. My guess is that’s her inspira- 
tion. And Kaladi Brothers, their main shop is off this road 
that's difficult to get to; it has no parking, and you know, 
so the same kind of thing as [her favorite coffee shop's 
location]. It's not like in the perfect shopping mall and 
real accessible, so. 


. This passage exemplifies the shared experience between 
a proprietor and customers that imbues local coffee shops 
with an undeniable sense of authenticity. In this case, Carie 
views herself as intimately involved in the proprietor's 
(Tina) efforts to perfect a time-honored craft and to culti- 
vate greater consumer appreciation for skillfully roasted 
coffee. In keeping with the artisan ideal, Tina (the propri- 
etor) has earned a further claim to authenticity through her 
apprenticeship at Kaladi Brothers Coffee and her choice of 
an out-of-the-way location for her shop, which contributes 
to the sense that she is not catering to the masses (a la Star- 
bucks) but only to those who can really appreciate a great 
cup of coffee (and are willing to make some effort to attain 
it). 

As Brown (2003) suggests, under certain market condi- 
tions, inconvenience—the bane of conventional marketing 
management thought—can actually increase customer loy- 
alty and commitment. Through this nexus of meanings, 
Carie's patronage of her favorite local coffee shop facili- 
tates an identity project that is particularly salient to con- 
sumers with higher levels of cultural capital (Holt 1998). 
Such consumers strive to demonstrate the distinctiveness 
and sophistication of their aesthetic tastes by consciously 
seeking out unique consumer experiences that exist off the 
proverbial beaten path (Thompson and Tambyah 1999). 
Starbucks's mass-marketed coffee and its ubiquitous 
convenience-oriented servicescapes provide a readily recog- 
nizable symbol of the consumer mainstream, and thus the 
brand serves as a symbolic foil for consumers who pursue 
this sense of cosmopolitan distinctiveness. 


Conventional brand management theory holds that, in 
general, consumers avoid brands whenever negative associ- 
ations, such as those dogging Starbucks, become strongly 
established in their minds. Through brand avoidance, con- 
sumers circumvent the risk that these stigmatized meanings 
will sully their public image or self-concept (see Keller 
2003b). However, our research provides evidence that con- 
sumers do not just avoid doppelgänger brand meanings but 
rather consciously leverage these disparaging meanings in 
ways that reinforce an identity-affirming belief that they are 
more daring and discriminating than the average, main- 
stream consumer. 


Summary 


Our cultural analysis suggests that the widespread critiques 
(by both mass media and antibrand activists) of Starbucks 
as a rapacious corporate titan are reworked by consumers to 
create an identity-enhancing morality tale, premised on the 
consumption ideal of authenticity. The subtext of this 
authenticating narrative is the David versus Goliath myth, 
which imbues our participants' experiences of oppositional 
brand loyalty (see Muniz and O'Guinn 2001) with a partic- 
ularly strong emotional charge. Our participants perceive 
local coffee shops as loci of authentic consumption experi- 
ences and social relationships. These two themes map onto 
contradictions that exist between Starbucks's emotional- 
branding promises and its highly public presence as a 
global corporate chain. Starbucks's emotional-branding 
strategy encourages consumers to desire a sense of intimacy 
and cultural distinctiveness, but paradoxically, it is not as 
well suited to deliver on these experiential promises as are 
the myriad local coffee shops that have followed in its 
frothy corporate wake. 

In the following section, we offer some culturally 
grounded extrapolations beyond our interview data to spec- 
ify which aspects of Starbucks's emotional-branding strat- 
egy contribute to its doppelgänger brand image and render 
it susceptible to the stigma of inauthenticity. Drawing from 
this cultural reading, we then illustrate how an analysis of a 
brand's doppelgánger meanings can be used (1) to diagnose 
emerging cultural changes that could undermine the authen- 
ticity of its emotional-branding story and (2) to gain 
insights into how this brand story could be altered to avoid 
this undesirable outcome. This kind of interpretive move is 
consistent with theoretical arguments that the analyses of 
consumer narratives gain explanatory power and enhanced 
managerial relevance when they are supplemented by 
broader considerations of their supporting social, cultural, 
and historical contexts (see Arnould and Wallendorf 1994; 
Holt 2003; Thompson 1997; Zaltman 2003). 


Diagnosing Starbucks's 
Authenticity Dilemma 


Starbucks's success is widely attributed to its ability to 
deliver consistently the communal ambiance, conviviality, 
and emotional warmth of a quaint European café (Schultz 
1997; Shields 2002). According to this explanation, Star- 
bucks has attained market dominance because its carefully 
designed servicescape is an inviting and comfortable place 
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for consumers to meet, socialize, and enjoy a memorable 
coffee-drinking experience (Aaker 2004; Keller 2000; 
Roberts 2004; Schmitt 1999; Schultz 1997). The putative 
importance of the third-place experiences sits uneasily with 
tracking studies that indicate that more than 70% of Star- 
bucks’s customers are “grab-and-go” types, who are most 
concerned with speed of service (Moon and Quelch 2004; 
Stone 2004) and, thus, spend little time imbibing its quaint- 
ness and warmth. 

This marketplace disparity raises the possibility that this 
third-place appeal is a peripheral aspect of Starbucks’s iden- 
tity value and emotional appeal. We also note that the dis- 
paraging portrayals of the Starbucks brand, whether 
expressed by our participants or by the gamut of anti- 
Starbucks Web sites, never question the basic premise that 
Starbucks is a good place to meet friends and have casual 
conversations. Rather, challenges to Starbucks's authentic- 
ity appear to target a deeper emotional level that cuts to the 
heart of the brand's identity value. 

Taking a cue from Holt's (2004) thesis that brands attain 
iconic standing when they provide myths that consumers 
can use to assuage salient threats to their identities, we 
begin by noting that Starbucks became a cultural icon dur- 
ing this heyday of the mid-1990s new economy and the dot- 
com boom. These socioeconomic conditions gave rise to a 
professional class of symbolic workers who were well 
rewarded for their creative skills. These professionals 
inhabited organizations that were increasingly decentralized 
and that reveled in having an informal and playful corporate 
culture that sharply deviated from bureaucratic norms 
(Florida 2002). 

The social corhmentator David Brooks (2000) cleverly 
christened this new professional class as the Bobos, that is, 
a new breed of pojsionals who attempted to meld the 
bourgeois values of hard work, career success, and material 
affluence with the bohemian values of creativity, expres- 
Siveness, anticonformism, and antimaterialism (for a related 
discussion of how Volkswagen became the automotive 
brand of choice among the Bobo class, see Holt 2004). 
Through this postmodern identity, people could be career 
driven and yet define themselves as anticonformists who 
live by an edict of creativity and self-expressiveness, and 
they could lead affluent, brand-conscious lifestyles while 
viewing themselves as committed antimaterialists. 

To maintain a coherent self-identity, Bobos needed a 
means to assuage the many cultural conflicts posed by these 
juxtapositions. Thus, for members (and aspirants) of this 
new class, Starbucks delivered exactly the right kind of 
mythic story. Through its embodiment of a hip, creative, 
caring form of capitalism, this brand image erased any 
sense of cultural incompatibilities between bourgeois and 
bohemian value systems. Starbucks catered to these new 
class sensibilities in every aspect of its brand delivery, most 
particularly its patented decor that emulates not a European 
café but rather the decor of creative-class office spaces 
(Florida 2002; Schmitt and Simonson 1996). The symbolic 
message is that Starbucks is an appropriately informal, cre- 
ativity conducive space in which the creative class can meld 
their professional and bohemian identities and directly 
experience material and symbolic identity validation. 
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Through their patronage of Starbucks, corporate bohemians 
can reinforce the authenticity of their delicate ideological 
balancing act and thus allay concerns that they are corporate 
clones who, in some way, may be culpable for the social 
and environmental problems widely attributed to corporate 
capitalism. 

The doppelgánger brand meanings we have identified in 
this study provide some evidence that Starbucks's rapid 
(and much parodied) market expansion is beginning to 
undermine this brand's ability to assuage the identity con- 
flicts facing the corporate bohemian class. In effect, Star- 
bucks’s doppelganger brand image unravels its corporate 
bohemian synthesis and suggests that its bohemian image is 
a marketing gambit designed to mask its real identity as a 
corporate titan. This is a potentially severe threat to Star- 
bucks’s continued marketing success, because if it loses its 
aura of authenticity, 1t can no longer symbolically affirm the 
Bobo-ish identities of its core customers. In addition, more 
casual customers may also be more predisposed to question 
the brand’s commitment to social responsibility and the 
artisan craft of coffee roasting. Such a development would 
likely make Starbucks vulnerable to competitors in the form 
of small independents, regional chains, and national com- 
petitors, such as Peet’s. 

The franchise of an iconic brand, such as Starbucks, is 
typically built around a relatively small segment of 
extremely loyal customers who validate the authenticity of 
its emotional-branding story and act as brand evangelists. 
Through the actions of this devoted core, the brand acquires 
a heightened cultural cachet that, in turn, attracts a broader 
segment of peripheral consumers who draw from the 
brand’s symbolic value (Holt 2004; McAlexander, 
Schouten, and Koenig 2002). As exemplified by the travails 
of brands such as Snapple and Volkswagen (circa the 1970s 
and 1980s), when these core customers begin to abandon an 
iconic brand, it precipitates a downward cultural spiral, 
which can erode brand equity and market share (see Holt 
2004). Thus, a relatively small number of brand avoiders 
could be the impetus to a cultural tipping point (Gladwell 
2000) that weakens Starbucks’s brand franchise. 

In summary, our analysis indicates that Starbucks’s 
aggressive growth strategy, highly publicized market domi- 
nation, and omnipresence no longer mesh with its appeal to 
a countercultural, bohemian sensibility and that this strat- 
egy may be compromising the perceived authenticity of its 
emotional-branding story. However, an analysis of doppel- 
ginger brand meanings can only take brand strategists so 
far in deciding whether a brand’s image should be reconfig- 
ured and in identifying what new cultural contradictions 
should be targeted in an attempt to revise its brand position- 
ing. To accomplish this strategic task, brand strategists need 
to analyze thoroughly the socioeconomic and cultural con- 
ditions currently facing their core customers. For example, 
the conditions that characterized the “new economy” during 
Starbucks’s rise to iconic status have been fundamentally 
altered by the puncturing of the dot-com bubble. The cur- 
rent economic climate is colored by the events of Septem- 
ber 11 and a heightened sense of economic vulnerability 
that has arisen from dramatic declines in new job produc- 
tion and other related factors, such as corporations’ out- 


sourcing of the new class’s “creative work” to lower-wage 
countries (Pink 2005). By analyzing how these insecurities 
are represented in popular culture (through books, maga- 
zines, films, music, blogs, and other media) and by studying 
how consumers understand their own lives in relation to 
them, insights can be gained into resonant life concerns that 
the Starbucks brand can credibly speak to through its 
emotional-branding efforts. 

As a hypothetical illustration, imagine that follow-up 
research reveals a growing cultural tension between Ameri- 
cans’ historically optimistic view of the future (and the 
corollary belief that everyone can succeed if they work hard 
and apply their talents) and a more pessimistic view that 
economic fate is controlled by forces outside of personal 
control (and that fate is not always kind or just). This hypo- 
thetical cultural contradiction could support a brand 
mythology that represents Starbucks as a pillar of stability 
and security in consumers’ lives (thus reframing the doppel- 
giinger meanings about Starbucks’s “boring standardiza- 
tion” as a positive). Going further, Starbucks could assume 
the role of an ally or partner that helps consumers take 
charge of their professional lives and accomplish life goals 
in the face of random and arbitrary obstacles. Starbucks’s 
professional, corporate aura (again, currently viewed as a 
negative) and the stimulating properties of its product could 
lend themselves to this reformulated emotional-branding 
story. 


Implications for Strategic Brand 
Management 


In general, the emotional-branding literature reads like a 
heroic tale in which intrepid brand managers break with 
orthodox branding strategies and are rewarded with 
intensely loyal customers, heightened profits, and a nearly 
impervious foundation of competitive advantage. These 
heroic testaments to the power of emotional branding 
invariably gloss over the potential risks associated with this 
strategy. Through a detailed examination of the case of Star- 
bucks’s emerging doppelgánger brand imagery, we have 
attempted to address this important issue. In this section, we 
take a step back from the details of this case and develop a 
set of initial recommendations that could help brand man- 
agers diagnosis pending threats to their emotional-branding 
investments. 

History has shown that dominant brands routinely fall 
from their stellar heights and struggle to regain their former 
glory. Disney, McDonald’s, AT&T, Levi’s, Kodak, and 
Coca-Cola are just a few examples of market-driven brands 
that rather suddenly went from paragons of marketing 
excellence to besieged icons losing ground to competitors 
(Haig 2003; Holt 2002; Stewart 2002). However, the con- 
ventional brand management literature offers little concrete 
advice on how brand strategists can proactively diagnose 
the cultural vulnerabilities that could eventually erode their 
customer-based brand equity. Rather, brand image-related 
problems are attributed in general to companies failing to 
follow the tried-and-true principles of sound brand manage- 
ment, such as maintaining consistency and relevance 
(Keller 2000, 2003a). 


Yet the diagnostic value of these normative branding 
principles is actually quite limited. The consistency princi- 
ple instructs managers to specify their brand's essence, core 
personality, and primary benefit carefully and then commu- 
nicate consistent messages about these essential dimen- 
sions. The relevance principle instructs managers to update 
continuously their brand's user/usage imagery, brand per- 
sonalities, types of relationships, and promotional themes to 
keep a brand image fresh, energized, and contemporary 
(Aaker 2004; Keller 2003a). The somewhat contradictory 
nature of these two normative principles leaves little doubt 
that brand managers must undertake a very delicate balanc- 


ing act. However, these principles beg the important ques- 


tion, How can this balance be effectively attained? To 
redress this gap, we propose a three-step action plan that 
can help marketers protect their emotional-branding 
investments. 


Monttor Cultural Cues 


First, brand managers should closely monitor popular cul- 
ture for signs that their brand is beginning to develop a dop- 
pelgánger brand image. Here, the critical issue is not how 
many people embrace these unflattering brand meanings or 
if they pose an immediate threat to the brand. Indeed, a 
best-case scenario would be to catch these meanings during 
their underground phase, analogous to the way that “cool 
hunters” attempt to discover nascent consumer trends 
(Gladwell 2000). The relevant concerns at this stage are 
which aspects of the emotional-branding story are being 
subverted and what alternative symbolic brand meanings 
are being created. For example, our analysis of Starbucks 
suggests that its mass-market ubiquity has undercut an 
essential component of its emotional-branding story, 
namely, its ability to act as an authenticating symbol of hip 
consumerism (see Frank 1997). These cues suggest that 
Starbucks should consider developing a new emotional- 
branding story that reframes its corporate connotations in 
ways that function as a symbolic asset rather than as a 
source of inauthenticity. 

The same medium that facilitates the rapid cultural dif- 
fusion of a doppelgänger brand image (1.e., the Internet) can 
also facilitate the task of cultural monitoring. For example, 
brand managers should routinely monitor consumer chat 
rooms, blogs, and Web-based media to gauge the cultural 
buzz surrounding their marketing campaigns. A more 
sophisticated approach would be to use systematic netnog- 
raphy techniques (see Kozinets 2002) to assess when an 
emotional-branding story is beginning to inspire parodies, 
critics, and other signs of a cultural backlash and to analyze 
the cultural content of these cultural-jamming efforts. Brand 
managers should be sensitive to both the frequency and 
level of intensity of these antibrand sentiments and the spe- 
cific aspects of their emotional-branding story that are 
being subverted (and the adversarial meanings that result). 


Identify and Track Brand Avolders 


Second, brand managers should identify and study people 
for whom these doppelganger meanings resonate and moti- 
vate various kinds of brand avoidance. Blogs and antibrand 
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Web sites may be a useful starting point for locating these 
people. In addition, customer relationship management sys- 
tems could be used to identify lapsed customers, who may 
turn out to be intentionally avoiding a brand because of its 
doppelgünger imagery. By studying these consumers’ view- 
points (we recommend using qualitative methods, such as 
depth interviews, laddering, ZMET-type collage studies, or 
even focus groups), managers can assess which doppel- 
günger brand meanings are resonating with consumers and 
plumb the identity benefits they gain from this negative 
imagery. 

This second step is a necessary complement to the 
broader monitoring of cultural cues we outlined in the first 
step. The ideological concerns that inspire antibrand 
activists to create and circulate a doppelgünger brand image 
(and that tend to dominate those representations) may not 
be that relevant to consumers or that central to their brand 
avoidance motivations. In the case of Starbucks, for exam- 
ple, antibrand activists widely accuse Starbucks of engaging 
in a host of predatory business practices. Although the con- 
sumers in our study made passing reference to these 
charges, their primary motivations for avoiding the brand 
ensued from their perceptions that Starbucks’s commercial- 
ized servicescape lacked cultural distinctiveness and an 
authentic personal touch. Thus, brand management deci- 
sions based only on an analysis of cultural cues could lead 
managers to erroneous conclusions about the doppelganger 
meanings that pose the greatest threat to their emotional- 
branding strategies. 


Develop and Test a New Emotlonal-Branding 
Story 


Third, to manage the challenges posed by the emergence of 


a doppelgänger brand image, we advocate a decidedly 
proactive approach. Specifically, we believe that brand 
managers should make adaptations to their emotional- 
branding story before it goes stale and precipitates a full- 
blown branding crisis. This proactive orientation carries an 
unnerving implication: Managers should begin modifying 
their emotional-branding strategy while it is still accom- 
plishing its marketing objectives. In a dynamic and rapidly 
changing marketing environment, we believe that it 1s much 
better to stay ahead of the cultural curve than to fall behind 
it and suddenly need to play catch-up with competitors that 
have more resonant emotional-branding stories. 

When confronted by an emotional-branding strategy 
that is showing blatant signs of diminished marketplace 
appeal, managers can easily fall into a crisis management 
panic mode as they grapple with declining sales and lost 
share. Under such conditions, they may be tempted to dele- 
gate much of the strategy-creation process to outside brand- 
ing consultants in the quest for quick, magic-bullet solu- 
tions (Haig 2003). However, this crisis mode can also lead 
to arbitrary and abrupt shifts in an emotional-branding story 
that often compound the problem (Keller 2000). For exam- 
ple, if a brand has lost its cultural claim to authenticity, a 
dramatic shift in the symbolic terms of its emotional- 
branding strategy could easily be interpreted as further evi- 
dence that the brand has no authentic identity other than 
that dictated by the whims of fashionability. 
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Accordingly, we suggest that a successful brand image 
revitalization strategy must fit with prevailing cultural con- 
ditions while displaying a clear and comprehensible con- 
nection to the emotional-branding story through which the 
brand made its cultural mark. Because of the amount of cor- 
porate transparency now available on the Internet, 
emotional-branding stories that lack this kind of historical 
continuity are likely to be perceived as inauthentic (see Holt 
2004). A cultural analysis of a doppelgdnger brand image 
can offer valuable insights into how a resonant new 
emotional-branding story can be culled from the narrative 
threads of an existing one in a way that maintains this vital 
sense of historical continuity. 

By engaging in the cultural monitoring and brand 
avoider tracking we discussed previously, brand managers 
(and their creative consultants) should have a wealth of 
insights (both cultural and individual) to develop a set of 
alternative emotional stories that can be tested among con- 
sumers. Although most consumers are not particularly good 
at generating new campaign ideas, particularly ones that tap 
into emerging cultural trends (see Gladwell 2005; Holt 
2003; Zaltman 2003), they are quite astute at assessing the 
relative degree of emotional connection they experience 
with different story lines. Again, we recommend the use of 
qualitative methods, especially rich projective techniques 
(e.g., Z-MET; Zaltman 2003), as a means to evaluate the 
emotional resonance (and authenticity value) of alternative 
story lines. These qualitative insights could also be used to 
develop quantitative measures of emotional resonance and 
perceived authenticity (see, e.g., Grayson and Martinec 
2004). Furthermore, this approach offers a means to 
develop a culturally based brand-shielding strategy (see 
Park, Jaworski, and MacInnis 1986), designed to insulate a 
brand's symbolic value (e.g., its perceived authenticity) 
from the devaluation among core customers that can arise 
when a brand attains broader market appeal. 

The implementation of these three steps requires a thor- 
ough understanding of a brand's history as well as the cul- 
tural meanings and identity projects most salient to its core 
customers. This type of knowledge 1s difficult to codify and 
emerges through a process akin to dynamic puzzle solving. 
In this puzzle-solving activity, brand strategists must iden- 
tify a gestalt fit among the target customers' cultural worlds, 
the properties of the brand, and a credible emotional- 
branding story that links consumers to the brand. Unlike a 
conventional puzzle, however, the emotional-branding 
conundrum keeps changing, and accordingly, emotional- 
branding strategies must be periodically revised and trans- 
formed. In this sense, our proposed guidelines align with 
Dickson's (1992) competitive rationality thesis that market- 
ing success is dependent on constant monitoring of the 
dynamic competitive environment and the continual genera- 
tion of innovative adaptations to these changes. 


Conclusions and Future Directions 


In this research, we attempted to demonstrate that (1) 
emotional-branding strategies may unintentionally foster 
the cultural development of a doppelgänger brand image 
and (2) careful analysis of this imagery can provide insights 


into latent branding problems that might not be detected by 
conventional branding strategy prescriptions. Although we 
believe that our analysis of Starbucks’s emerging doppel- 
ginger imagery provides support for our thesis, we also 
acknowledge that our research is limited by important 
boundary conditions and that these conditions represent 
potential research opportunities. 

Our analysis suggests that consumers avoid brands 
when their emotional-branding promises are viewed as 
inauthentic and, conversely, that emotional-branding strate- 
gies succeed when they can function as an authenticating 
narrative for consumers’ identity projects. These ideas are 
congruent with research indicating that consumers’ most 
valued brands are those whose symbolic meanings play an 


important role in their self-conceptions (Fournier 1998; 
Holt 2002) and with findings that consumers can be unfor- 
giving when trusted brands seem to violate their emotional- 
branding promises (Aaker, Fournier, and Brasel 2004). 
However, a lack of perceived authenticity may not be a 
major stumbling block for brands that do not seek competi- 
tive advantage through emotional branding. In addition, 
brands that lack iconic status (i.e., market challengers or | 
followers) may be less susceptible to attacks on their 
authenticity because such brands may fly under the radar of 
antibrand activists. Thus, further research that considers a 
broader spectrum of brands, consumers, and contexts is 
necessary to evaluate the robustness of our theory and 
findings. 
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Does the Internet Promote Better 
Consumer Decisions? The Case of 
Name-Your-Own-Price Auctions 


The Internet has enabled consumers to make more informed decisions more conveniently with apparently more 
efficient price-clearing mechanisms than was available before its advent. One such mechanism is the name-your- 
own-price auction. The authors study the extent to which decisions made in such an auction are rational in relation 
to an economic model. The results from two independent markets show that a large number of consumers do not 


exhibit rational decision making. 


onsumers of their offers and for consumers to 
respond to those offers than has been possible in tra- 
ditional retail channels (Bakos 1997; Brynjolfsson and 
Smith 2001; Geoffrion and Krishnan 2003). Several retail 
forms have evolved on the Internet, such as single-store 
offers, comparison sites with offers across multiple stores, 
and various types of auctions. One type of auction is the 
name-your-own-price mechanism (e.g., www.priceline. 
com), in which consumers bid for a product on sale at an 
unrevealed retailer's price. At Priceline, consumers are 
allowed to bid only once within seven days for a specific 
product (e.g., a flight); however, they can partially circum- 
vent the time constraint (Fay 2004). Other name-your-own- 
price retailers have extended the model to allow multiple 
bidding for the same product. 

Ideally, auctions enable a better market-clearing price 
between a buyer and a seller, assuming that both are ratio- 
nal. Sellers hope to extract the maximum buyer surplus by 
completing a transaction at the highest price that a buyer is 
willing to pay above their own marginal cost for that item. 
Buyers hope that they can obtain the lowest price a mar- 
keter is willing to take below their own reservation price. 
However, because buyers do not know the sellers' marginal 
cost, the only way a buyer can obtain such a good deal is to 
begin with a low price and raise it in increments up to his or 
her reservation price until the seller agrees to the price. 
Economic theory suggests the specific sequence of prices 
that a rational, price-minimizing consumer should adopt. 
However, to do so, buyers need time, thoughtfulness, and 
planning, all features that the Internet is supposed to pro- 
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vide. The specific issue we address in this article is whether 
consumers’ actual bids fit the pattern suggested by eco- 
nomic theory in the case of name-your-own-price auctions. 
In general, we ascertain the extent to which shopping on the 
Internet facilitates consumers’ rational, price-minimizing 
decisions. 

The literature on name-your-own-price auctions is 
sparse. Some studies focus on the specific design of Price- 
line but do not provide empirical data (Chernev 2003; Ding 
et al. 2005; Fay 2004). A few studies analyze consumer 
characteristics on the basis of data on such auctions, but 
they do not examine whether consumer behavior is rational 
(Hann and Terwiesch 2003; Spann, Skiera, and Schifers 
2004). The current study analyzes the empirical behavior of 
consumers and compares it with the economic norm to 
ascertain the extent to which it is rational. 

We conduct this analysis in three steps. First, we ana- 
lyze whether observed bidding patterns follow the norma- 
tive patterns. Second, we analyze what factors explain devi- 
ations from rational behavior. Third, we analyze the effect 
of these possible influencers of rational behavior on the 
value of consumers’ bids. We apply our analysis to two 
large data sets of consumers’ bids using a name-your-own- 
price auction: (1) a retailer selling flights from one country 
to various international and domestic destinations and (2) a 
low-cost airline selling its own flights within Europe. 

We organized the remainder of this article as follows: In 
the next section, we describe the expected behavior of con- 
sumers at the name-your-own-price auction. Then, we 
describe the data and our analysis and present the results. 
Finally, we discuss the implications of our findings. 


Expected Behavior at Name-Your- 
Own-Price Auctions 


Name-Your-Own-Price Auction 

In a name-your-own-price auction, any consumer who bids 
above a seller's unrevealed threshold price receives the 
product at the price of his or her bid. In case of limited 
availability, consumers who are the first to bid above the 
threshold are served first. In contrast, a standard auction 
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determines the winning bidder as the one who places the 
highest bid (if bidding to buy) or the lowest bid Gf bidding 
to sell) among rival bids. 

Two broad types of name-your-own-price auctions are 
prevalent. In one type, firms more or less perfectly limit 
consumers to a single bid for a specific product. An exam- 
ple of such an auction is that of Priceline (Fay 2004). In the 
other type, firms allow consumers to bid again if their pre- 
vious bid has been rejected. Several examples of the latter 
are available in Europe. In the latter type, if the seller 
accepts a buyer’s bid, a purchase occurs because con- 
sumers’ bids are binding (e.g., Priceline requires consumers 
to provide their credit card information before bidding). If a 
consumer’s bid is unsuccessful, he or she can either with- 
draw or submit another bid (see Figure 1). Rejected bids 
provide some information about buyers and sellers. By bid- 
ding, consumers provide the lower limit of the range of 
their reservation price at that moment. In rejecting the bid, 
sellers indicate the lower limit of the range of their thresh- 
old price at that moment. According to economic theory, the 
threshold price should represent the sellers marginal cost 
for the product unless the seller hopes to attract consumers 
with higher prices. 


Buyers’ Optimal Behavior 


How should buyers bid in a name-your-own-price auction 
such as that which we previously described? Economic 
theory provides a framework to answer this question, 
assuming that consumers are rational and want to minimize 
the prices they pay. Spann, Skiera, and Schüfers (2004) 
develop a model that arrives at the consumer’s optimal strat- 
egy in terms of the optimal number of bids and the optimal 
prices of those bids. They apply the model to impute con- 
sumers’ willingness to pay and bidding costs. We used their 
model to derive the optimal pattern of bid prices. Here, we 
describe the economic intuition of the model and derive the 
optimal pattern of bid prices. (Readers may go to the origi- 
nal article for details of the model development.) 

In this model, consumers maximize their surplus, which 
is the difference between their reservation price and the 
price they pay. The latter is the amount of their bid if it is 
successful (i.e., above the firm’s threshold price, which is 
unknown to consumers). Because this threshold price, and 
thus the success of a bid, is uncertain to consumers, they 


maximize their expected surplus (1.e., surplus x probability 
of success). Therefore, consumers need to form a belief 
about the probability distribution of the unknown threshold 
price. By submitting a bid, consumers accrue costs that 
reduce their surplus. The costs of a bid include the cost of 
thinking to determine the optimal amount of the bid, the 
transaction costs for the time and trouble of bidding, and 
the costs of waiting for the acceptance or rejection of a bid 
(Shugan 1980; Tellis 1986). 

Consumers influence their surplus and the probability 
that a bid is successful by the size of their bid. On the one 
hand, the consumer can increase the probability that the bid 
is successful (i.e., above the seller's unrevealed threshold 
price) by increasing the amount of the bid. On the other 
hand, a higher bid leads to a lower consumer surplus in the 
case of a successful bid. Thus, the consumer must solve this 
trade-off between increasing consumer surplus and decreas- 
ing the probability of a successful bid. Consumers do not 
bid if they expect a negative surplus. 

The consumer has the possibility of rebidding if a bid is 
not successful. In this case, consumers update their belief 
about the distribution of the threshold price because an 
unsuccessful bid signals a threshold price that is greater 
than the bid. Thus, the updated distribution for the threshold 
price is left-truncated at the value of the unsuccessful previ- 
ous bid. The optimal bid price of a repeated bid is then 
determined on the basis of this updated belief (for the for- 
mal model, see Appendix A). 

The optimal pattern of bid prices has three norms (for 
the proof, see Appendix B). First, consumers should begin 
with the lowest price that does not seem completely unrea- 
sonable given their reservation price and belief about the 
threshold price. The reason is that consumers do not know 
how many seats are left, under what duress the seller oper- 
ates, and what the seller’s marginal cost is. A reasonably 
low price can gather information about these factors. 

Second, if their bids are rejected, buyers should increase 
them up to their reservation price, until the seller accepts 
the bid. The reason for increasing the bid is to offer a price 
that 1s more acceptable to sellers, as long as it is below the 
buyer's reservation price. 

Third, if not under pressure of time, buyers should 
increase their bids in decreasing size of increments. The 
reasoning behind this third norm is the most difficult to 


. FIGURE 1 
Multiple Bidding at à Name-Your-Own-Price Retaller 
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submitting the Ith bid 





eSeller decides about 
accepting the ith. bid 
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appreciate. The logic is as follows: Costs accruing with 
each bid reduce consumers’ surplus. Given higher costs, the 
optimal strategy is to reduce the number of bids necessary 
to be successful. Consumers increase the probability of 
each bid being successful by bidding higher than their prior 
bid. However, at the same time, as they bid closer to their 
reservation price and the seller’s unknown threshold price, 
they do not want to bid higher than is necessary to avoid 
giving up too much of their surplus to the seller. Therefore, 
they should increase their bid in smaller increments. 

If we assume rational, price-minimizing buyers, the 
optimal bidding strategy is characterized with monotoni- 
cally increasing bids, which have decreasing increments as 
long as consumers accrue bidding costs. We refer to such 
bidding sequences as being "strongly rational." If bidding 
costs are zero, consumers' bids monotonically increase at a 
constant rate. However, such costs are unlikely to be zero 
(e.g., Hann and Terwiesch 2003; Spann, Skiera, and 
Schüfers 2004). If consumers are under pressure or if they 
panic about not winning a bid, they may bid with increasing 
increments. We refer to this case as being "weakly rational.” 

Supply constraints, such as limited seats, can affect the 
probability of whether a consumer's bid 1s successful. 
Rational consumers would make estimates of these supply 
constraints as best they can before their first bid. They 
incorporate this risk of a supply constraint in their belief 
about the probability of winning the bid and the distribution 
of the unknown threshold price. They estimate the threshold 


price to be higher than when there are no constraints. 
Because rational consumers determine ex ante their optimal 
bidding behavior by simultaneously determining the opti- 
mal number of bids and optimal bid prices, the belief about 
supply constraints is already incorporated into their optimal 
bidding strategy. Next, we examine real markets to deter- 
mine whether and to what extent observed buyer behavior 
conforms to the normative behavior we described previ- 
ously (again, assuming rational, price-minimizing buyers). 


Data 


Study 1: Name-Your-Own-Price Retailer 


Our first study examines the data of a name-your-own-price 
retailer based in Germany that sells airline tickets for vari- 
ous airlines and allows multiple bidding. Our data set con- 
sists of all bids at the name-your-own-price retailer for a 
period of 11 months between February 2000 and December 
2000. We have 6539 bidding sequences and a total of 
12,999 individual bids for flights from Germany to 86 dif- 
ferent international and domestic destinations. A bidding 
sequence consists of the bids of a specific consumer for a 
specific destination. Table 1 shows the distribution of bid- 
ding sequences by bid length and the distribution of con- 
sumers by destinations for which they bid: 7896 of all bid- 
ding sequences consist of only one or two bids, and 8896 of 
all consumers bid for only one destination. Table 2 shows 


TABLE 1 
Description of Data from Name-Your-Own-Price Retailer 


Distribution of Bidding 


Distribution of Consumers 


Sequences by Bid Length by Destinations 

Number of Destinations Number of 
Bid Length Sequences Bida Consumers 
1 3647 1 4965 
2 1477 2 492 
3 634 3 110 
4 347 4 25 
5 173 5 12 
6 89 6 12 
>6 172 7 4 
All 6539 All 5620 
aDestinations for which a consumer bids. 

TABLE 2 
Number of Destinations by Geographic Region (Name-Your-Own-Price Retaller) 

Geographic Region Destinations Sequences Proportion (%)# 
Germany (domestic) 7 1160 18 
Europe 32 3397 52 
North America 21 1240 19 
Rest of world 26 742 11 
All 86 6539 100 


«Proportion of sequences per geographic region according to all sequences. 
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the number of destinations by geographic region: Fewer 
than one fifth of all bidding sequences are for domestic 
flights, but a majority of sequences are for flights within 
Europe. 


Study 2: Name-Your-Own-Price Low-Cost Airline 


Our second data set consists of the application of the name- 
your-own-price auction during a promotional offering by a 
European low-cost airline. Low-cost airlines (e.g., South- 
west, easyJet, JetBlue) differ from traditional airlines in 
several ways. First, they offer “no-frills” service at low 
prices. Second, they treat each flight as a separate product; 
that is, they do not apply any booking restrictions, such as a 
maximum stay restriction between the outward and the 
return flight. Third, they usually do not serve transcontinen- 
tal destinations. Our data set consists of all bids for flights 
on this airline from a name-your-own-price auction during a 
promotional offering (from Friday, September 17, to Sun- 
day, September 19, 2004). Consumers could bid for 14 dif- 
ferent flight destinations in Europe on 30 different dates 
(between the end of September 2004 and the end of October 
2004). We have 3598 bidding sequences and 12,280 indi- 
vidual bids for these destinations. Table 3 shows the dis- 
tribution of bidding sequence by bid length and the distribu- 
tion of consumers by destinations for which they bid: 53% 
of all bidding sequences consist of only one or two bids, 
and 80% of all consumers bid for only one destination. 


Modeling Bidding Behavior 

We analyze whether empirically observed patterns match 
the normative pattern of behavior (assuming rational, price- 
minimizing buyers). We do so through a hierarchical model 
that consists of two stages (e.g., Chandy et al. 2001). In the 
first stage, we estimate each consumer’s pattern of bidding. 
In the second stage, we ascertain whether the patterns of 
bidding estimated in the first stage vary by characteristics of 
consumers and their bids. 


First-Stage Analysis 


Classification of bidding patterns. We consider only 
bidding sequences that are longer than two bids for a single 


itinerary. To capture various bidding patterns parsimo- 
niously, we fit the following quadratic function to each 
sequence of bids: 


(1) Bpa =Q XIa +O, xi, +O, +8, — (4, > 0), 
where B,, is the bid value; I, , is the number of a bid in a 
kth bidding sequence; &,, 0»,, and 0, are parameters that 
capture the shape of the quadratic function; and &,, are 
residuals, assumed to follow a normal distribution identi- 
cally and independently. We estimate Equation 1 by mini- 
mizing the sum of squared residuals for each individual 
sequence to capture consumer heterogeneity. 

On the basis of the estimated parameter values 0), and 
O2% the first couple of bids, and the last bid number I, max 
of each (kth) bidding sequence, we classified individual 
sequences into six different bidding patterns (see Figure 2). 
The constant term (3, is irrelevant for classification. We 
begin by explaining the identification of the two simplest 
patterns (Pattern 2 and 6) and then move on to the more 
complex patterns. 

Pattern 2 reflects linearly increasing bids. A sequence 
follows Pattern 2 if the quadratic term has a parameter of 
Zero (Qt = 0) and the linear term has a parameter of posi- 
tive value (0 > 0). 

Pattern 6 reflects constant or linearly decreasing bids. A 
sequence follows Pattern 6 if the quadratic term has a 
parameter of zero (0); = 0) and the linear term has a param- 
eter of negative value or zero (0 S 0). 

Pattern 1 reflects increasing bids at a decreasing rate; 
the last bid still is the highest in the sequence (i.e., only the 
upward portion of an inverted U shape). À sequence follows 
Pattern 1 if the parameter of the linear term is positive 
(OQ, > 0), the parameter of the quadratic term is negative 
(Gy < 0), and the value of the last bid of the sequence is the 
highest (By Imax > By poa. 1). Appendix C shows that these 
conditions are met when 


2 0»0,^» . 
a i 2X lema 


Pattern 4 reflects bids whose values follow a full 
inverted U shape. A sequence follows Pattern 4 if the 


TABLE 3 
Description of Data from Low-Cost Alriine 





Distribution of Bidding 
Sequences by Bid Length 


Number of 
Sequences 


OD 
A 
r- 
© 
5 
a 
5 


922 
981 
600 
322 
227 
156 
390 
3598 


oh a TU ee 


aDestinations for which a consumer bids. 
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Distribution of Consumers 
by Destinations 


Destinations Number of 
Bida Consumers 
1 1937 
2 377 
3 110 
4 22 
5 IL 
6 1 
7 — 
All 2389 


FIGURE 2 
idealized Patterns of Quadratic Function 
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of Bids 
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Number 
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parameter of the quadratic term is negative (oy < 0) and the 
value of the last bid is less than or equal to the value of the 
previous one (By ymax S Byjgax — 1); that is, the curve 
includes the right, decreasing part of the inverted U shape. 
The parameter of the linear term (Q5) can be positive, zero, 
or negative. Appendix C shows that these conditions are 
met when f 


3 0O>a, 8 
2 2x Ty max — 1 


Pattern 3 reflects bids whose values follow an increas- 
ing curve at an increasing rate (1.e., exponential increase or 
only the increasing part of a U-shaped curve). À sequence 
follows Pattern 3 if the quadratic term has a parameter of 
positive value (a, > 0) and the value of the second bid of 
the sequence is greater than the value of the very first one 
(By 2 > By). The parameter of the linear term (02%) can be 
positive, zero, or negative. Appendix C shows that these 
conditions are met when 


—QL 
(4) 0< ory, > 2k 


Pattern 5 reflects bids whose values follow a U-shaped 
curve. A sequence follows Pattern 5 if the parameter of the 
linear term is negative (Qo, < 0), the parameter of the qua- 
dratic term is positive (0, > 0), and the value of the second 
bid of the sequence is less than or equal to the value of the 
first one (By? S B, 4); that is, the curve includes the left, 
decreasing part of a U-shaped curve. Appendix C shows 
that these conditions are met when 


Pattern 2 





(Pattern 6) . 


$ 
Number Number 
of Bids of Bids 
Pattern 3 
$ 
Number 
of Bids 
Pattern 5 
(5) 0<a, < ik, 


Table 4 summarizes these criteria for classification of the 
patterns. 

Pattern 1 is consistent with strongly rational behavior 
(i.e; a monotonically increasing pattern with decreasing 
increments, as expected by the normative predictions), Pat- 
tern 3 is weakly rational (i.e., a monotonically increasing 
pattern with increasing increments). Pattern 2 is rational if 
bidding costs are zero, a condition that is unlikely to be met 
in real markets. Patterns 4 and 5 are clearly irrational, show- 
ing both increasing and decreasing trends in the same 
sequence. Similarly, Pattern 6 is clearly irrational, showing 
linearly declining bids. 

We do not have sufficient degrees of freedom to test the 
significance of each of the estimated parameters in Equa- 
tion 1. However, we can test the validity of our classifica- 
tion in two ways: First, we test the extent to which people 
within a group vary from the mean of that group. We do this 
by testing the extent to which the group means for each of 
the estimated parameters are statistically different from 
zero, given the alternate hypothesis that they take on the 
values we specify in Table 4. In the cases that the mean 
itself should be zero, as we specify in Table 4, we expect 
not to reject the null hypothesis. Second, we test whether 
the means between groups are different from each other. 

Results. Table 5, Panel A, displays the estimation results 
of fitting the quadratic function to individual bidding 
sequences at the name-your-own-price retailer. On average, 
our model explains 89% of the variance. The results of our 
two tests for these data are as follows: First, in Patterns 1, 3, 
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4, and 5, group means for the parameters of the quadratic Table 4 summarizes. For Pattern 2, the mean value of the 
term (04) and the linear term (054) are significantly differ- parameter of the quadratic term (Q;,) is not different from 
ent from zero and in the expected direction, which unam- zero, which is exactly the characteristic we expected. We 
biguously classifies the pattern according to the criteria that did not observe Pattern 6 in our data. Second, to test differ- 


TABLE 4 
Criteria for Classifying Bidding Patterns 


Range of Parameter of Quadratic Term 
Se RÓÁ 


k = 0A0,» 








Range of 
Parameter of 
Linear Term 






Notes: Numbers in circles show pattem numbers (see Figure 2). Shaded areas represent clearly Irrational pattems (not monotonically 








Increasing). 
TABLE 5 
Proportion of Bidding Patterns 
A. Name-Your-Own-Price Retaller 
Mean Voan Parameter Eoin alee 
Proportion Number Mean k ox 

Pattern Observations (%) of Bids R2 (%) SE (p Value) SE (p Value) 

1 511 36 4.1 94 —22.88 161.02 
2.42 (.00) 12.37 (.00) 

2a 75 5 3.8 98 .00 77.24 
00 (1.00) 5.95 (.00) 

3 325 23 4.5 94 20.35 —23.37 
1.46 (.00) 4.22 (.00) 

4 317 23 4.8 81 —36.01 174.30 
3.25 (.00) 13.71 (.00) 

5 187 13 5.4 78 44.21 —187.68 
5.93 (.00) 22.69 (.00) 
All 1415 100 4.6 89 X?c 1054.72 (.00) 839.60 (.00) 

] B. Low-Cost Airline 

1 808 48 5.6 98 —1.34 12.30 
.08 (.00) .43 (.00) 

2a 206 12 4.0 99 .00 6.59 
.00 (1.00) .38 (.00) 

3 483 29 5.2 98 1.92 —1.10 
.26 (.00) .77 (.15) 

4 140 8 7.5 82 —4.12 22.50 
.64 (.00) 2.64 (.00) 

5 58 3 8.1 77 4.35 —18.42 
1.03 (.00) 4.06 (.00) 
All 1695 100 5.5 96 X26 1333.22 (.00) 822.96 (.00) 

904, = 0. 


bFirst row’ mean parameter for each group of similar patterns; second row: within-group standard errors; p value is in parentheses. 
cChr-square of Kruskal-Wallis test for parameter differences between groups of similar patterns, p value is in parentheses. 
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ences of the means between groups, we used a nonparamet- 
ric test (a Kruskal—Wallis test) because standard deviations 
differ between the groups. The test yields significant differ- 
ences of a, and o», between groups (for bivariate tests of 
differences of means between groups, see Appendix D). 
Only 36% of all bidding sequences fit Pattern 1 (i.e., 
strongly rational), 2346 fit Pattern 3 (i.e., weakly rational), 
and 596 fit Pattern 2 (which would be rational if bidding 
costs were zero). However, Hann and Terwiesch (2003) and 
Spann, Skiera, and Schifers (2004) find mean bidding costs 
(which they call “frictional costs") of approximately $6 and 
$3. Furthermore, 3696 of all sequences are not monotoni- 
cally increasing, which is clear evidence of irrationality. 
Thus, overall, at the name-your-own-price retailer, 64% of 
the bidding sequences range from weakly rational to clearly 
irrational. 

. Table 5, Panel B, depicts the estimation results of fitting 
a quadratic function to individual bidding sequences at the 
low-cost airline. On average, our model explains 9696 of the 
variance. The results of our two tests for these data are as 
follows: First, group means for the parameters of the qua- 
dratic term (Œy) and the linear term (0,) are significantly 
different from zero, except for the parameter Qt, of Pattern 
2 and Q5 of Pattern 3. We already outlined that a), = 0 is 
the characteristic of Pattern 2. In Appendix C, we outline 
that in case of Pattern 3, a, can take a positive value, zero, 
and, to a certain limit, a negative value. Thus, the insignifi- 
cance of Oyy in this case does not reject Pattern 3. Second, 
the Kruskal-Wallis test yields significant differences of 0); 
and Ohy between groups (for bivariate tests of differences of 
means between groups, see Appendix D): 48% of all bid- 
ding sequences fit Pattern 1 (strongly rational), 12% fit Pat- 
tern 2, and 2996 fit Pattern 3. In addition, 1196 of all 
sequences are not monotonically increasing. Therefore, at 
the low-cost airline, 5296 of bidding sequences range from 
weakly to clearly irrational. 

However, bidding at the low-cost airline shows less 
deviation from rational behavior than at the name-your-own- 
price retailer. The reason is that customers of the low-cost 
airline are more attuned to cost minimization and are more 
likely to follow the predictions of the economic model of 
rational bidding than are customer of the air ticket retailer. 


Second-Stage Analysis 


We begin by hypothesizing what factors could explain the 
occurrence of nonmonotonic patterns in the data. Then, we 
show the results of testing these hypotheses. Finally, we 
attempt to explain the values of the bids. 


Explanation of nonmonotonic patterns. 'This subsection 
explains the factors that lead to monotonically increasing 
patterns. The dependent variable is the presence of a mono- 
tonically increasing bid pattern. The independent variables 
are the number of bids in a sequence, the number of desti- 
nations for which a consumer bids, the mean time between 
each consecutive bid of a bidding sequence (1.e., the mean 
interbid time), and the region of the destination (see Equa- 
tion 6). For each independent variable, we develop a 
hypothesis for its effect on the dependent variable. We use a 
binary logit model for the analysis. Thus: 


(6) PR(Mon,) 


1 


- (k eK), 
Bo +B, x NBid, +B, x NDes, , 


+B, x NorAm, +B, x RWor, 


where 


PR(Mon,) = the probability of a monotonically increas- 
ing pattern of the kth sequence, 
NBid, = the number of bids in the kth sequence, 
NDes, x = the number of destinations for which the 
jth consumer in the kth sequence bid, 
ITime, = the mean time between consecutive bids in 
the kth sequence, 
Euro, = a dummy variable (1 if flight destination 
of the kth sequence is in Europe), 
NorAm, =a dummy variable (1 if flight destination 
of the kth sequence is in North America), 
RWor, =a dummy variable (1 if flight destination 
of the kth sequence is in the rest of the 
world, and 
K - the index set of bidding sequences. 


The rationale for the independent variables is the fol- 
lowing: Consumers who bid for more destinations have 
more experience with the name-your-own-price auction 
than those who bid for less destinations. The reason is that 
each destination has its own configuration of dates, times, 
and competitors; consumers' assumptions about supply and 
threshold prices; and outcomes of consumers' bidding. A 
final acceptance indicates how low the seller's threshold 
price is, and a final rejection indicates how high it might be. 
Thus, consumers gain rich knowledge from experience with 
bidding for each destination (Remus, O'Connor, and Griggs 
1996). Learning from experience can compensate for lim- 
ited information and further rational behavior (Arthur 1991; 
Conlisk 1996). Thus, experienced consumers should be less 
prone to behavioral limitations and more likely to behave 
according to the normative model (1.e., rational): 


H;: The number of destinations for which a consumer bids has 
a positive effect on the probability of a monotonically 
increasing pattern. 


After we control for experience, the number of bids 
consumers place within a sequence may adversely affect the 
rationality of their bidding for several reasons. First, if con- 
sumers must take too much information into account, their 
mental capacity becomes limited, and their decisions can 
suffer (Jacoby, Speller, and Kohn 1974; Lurie 2004). As 
consumers make more bids, they are more likely to become 
confused with their prior bids. Thus, they are less likely to 
make rational bids. Second, bidding at a name-your-own- 
price auction requires consumers to develop some belief 
about the retailer's threshold price and update this belief 
when the bid is rejected, which can be difficult (Chernev 
2003). Third, purely by chance, the more bids a consumer 
makes, the greater is the likelihood that the consumer will 
either make errors or become confused. 
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We suspect that, together, these three factors overwhelm 
the small increase 1n experience that may come from merely 
increasing the number of bids in each sequence. Similarly, 
these factors are likely to overwhelm the small decrease in 
learning costs that may occur with increasing lengths of 
bids in a sequence. Thus: 


H,. The number of bids in a sequence has a negative effect on 
the probability of a monotonically increasing pattern. 


In general, consumers' memory and recall of prices is 
low (Dickson and Sawyer 1990; Monroe and Lee 1999; 
Rajendran and Tellis 1994; Vanhuele and Dréze 2002). As 
interbid time increases, consumers are more likely to forget 
about their prior bids. Thus, we hypothesize that the time 
between consecutive bids adversely affects the rationality of 
consumer bidding behavior: 


Ha: The mean interbid time of bids in a sequence has a nega- 
tive effect on the probability of a monotonically increas- 
ing pattern. 


The flight time and cost of the ticket increase with the 
distance between the origin and the destination of a flight. 
Thus, the larger the distance, the greater a consumer can 
save by more careful planning and more rational bidding. 
The greater the saving, the more the consumer will be moti- 
vated to make a well-thought-out, rational decision (Tellis 
and Wernerfelt 1987). Thus, the longer the distance, the 
more a consumer will bid rationally to obtain a good deal. 
Because all bids were for flights originating in Germany, 
we operationalized distance through dummy variables for 
the region of the destination (Europe, North America, and 
the rest of the world); a flight within Germany was the 
baseline category: 

H4a: An international destination has a positive effect on the 

probability of a monotonically increasing pattern. 


Hay: The magnitude of the effect increases (becomes more 
positive) with the distance of the region. 


Results. Table 6 shows the estimation results of the 
binary Logit model for the name-your-own-price retailer 
(second column) and the name-your-own-price offering at 
the low-cost airline (third column). 


In both studies, contrary to H,, the effect of the number 
of destinations on the probability of a monotonically 
increasing pattern is not significant (and is negative). 
Apparently, experience does not mitigate the behavioral 
limitations that lead to irrational bidding behavior (e.g., 
Prabhu and Tellis 2000). In both studies, consistent with H», 
the number of bids in a sequence has a highly significant, 
negative effect on the probability of a monotonically 
increasing pattern. In both studies, consistent with H4, we 
find that mean interbid time of a sequence (i.e., the time 
between consecutive bids) has a negative influence on the 
probability of a monotonically increasing pattern. We mea- 
sure interbid time at the name-your-own-price retailer 
because of the availability at the daily level, whereas we 
have data on the level of seconds for the low-cost airline. In 
Study 2, consistent with H4, an increasing flight distance 
Jeads to a higher likelihood of a monotonically increasing 
pattern. Thus, the magnitude of the effect (positive value of 
the parameter of the dummy variable) increases with the 
distance (the rest of the world consists of destinations in 
South America, Africa, Asia, and Australia). We cannot test 
H; for the second data set of the name-your-own-price 
offering at the low-cost airline, because all flights were 
within Europe and had about the same distance (between 
one and two hours of flight time). 

Both studies yield identical results for the factors that 
influence deviations from normative bidding behavior, indi- 
cating robust effects. However, the proportion of such devi- 
ations is much higher at the airline ticket retailer than at the 
low-cost airline (see Table 5). At the retailer, consumers bid 
for a flight that could be served by various airlines and had 
no knowledge of the actual airline, which was revealed only 
after a successful purchase (Priceline uses a similar policy). 
Thus, the task of generating the bid values is more difficult 
because of this greater uncertainty. At the airline, consumers 
could bid for a flight only at this specific low-cost airline; 
thus, they knew the product. Furthermore, the total duration 
of the offering at the low-cost airline was less than four days 
(1.e., there was less time to forget about previous bids). 


Explanation of bid values. This section analyzes the 
maximum bid value of a bidding sequence. The indepen- 


TABLE 6 
Results of Blnary Logit Model (Probability of Monotonically IncreasIng Patterns) 





Parametersb Retaller (Study 1)? Low-Cost Airline (Study 2)@ 
Constant 1.17 (.00) 2.85 (.00) 
Number of destinations —.06 (.34) —.16 (.11) 
Number of blds —.12 (.00) —.09 (.00) 
Mean interbid timec —.03 (.00) —.08 (.01) 
DV Europed .30 (.06) 

DV North Amencad .49 (.01) 

DV rest of worldd .66 (.00) 

Likelihood ratio test: x2 123.72 (.00) 46.77 (.000) 
Classification goodness (%) ‘ 69.4 88.4 
Number of observations 1415 1695 





ap value is in parentheses. 


bDependent vanable. Prob(Y = 1) is the probability of monotonically Increasing pattern. 
cMean time difference in days (hours) between consecutive bids of a sequence at retaller (low-cost airline). 


dBaseline category. Germany 
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dent variables are the same as in the analysis of nonmono- 
tonic patterns. We use multiple regression to analyze the 
dependent variable: 


(7 MaxBidV, = Bo +8, x NBid, +B, x NDes, , 
+B, x ITime, +B, x Euro, 


+B, x NorAm, +B, xRWon +u,  (keK) 


where MaxBidV, is the maximum bid value of the kth 
sequence and y is the residual of the kth sequence. 

Table 7 shows the estimation results of the regression 
analysis for the maximum bid value of a bidding sequence 
for the name-your-own-price retailer (second column) and 
for the name-your-own-price offering at the low-cost airline 
(third column). In both studies, the maximum bid value 
decreases with the number of destinations. Apparently, 
inexperienced consumers may overbid. Conversely, experi- 
enced consumers who bid for more than one destination are 
more likely to bid lower. Thus, experience does not lead to 
a higher likelihood of rational bidding behavior but reduces 
the consumer's maximum bid value. 

In both studies, the maximum bid value of a sequence 
increases with the number of bids. Thus, even if consumers 
become confused with an increasing number of bids, they 
raise their average bid by the same amount and overbid. 
Interbid time has no significant effect on the maximum bid 
value in both studies. Apparently, the waiting time between 
consecutive bids influences the rationality of the bidding 
pattern but does not have an effect on the maximum bid 
value. The distance of a flight increases maximum bid value 
because consumers correctly anticipate that costs (espe- 
cially fuel costs) increase with distance and that higher 
costs lead to a higher threshold price for a flight. Again, we 
cannot analyze this effect for the data for the low-cost air- 
line because of similar distances of all destinations. 

We found that the same factors explain maximum bid 
value in both studies. This indicates the stability of our 
results for name-your-own-price auctions because both 
studies differ with respect to currency, time of the study, 


duration of data gathering, and offering (multiple airlines 
versus one low-cost airline). 


Discussion 


The popularity of the Internet is supposed to have helped 
consumers make better decisions that are more consistent 
with rational economic theory than was possible before the 
Internet. We analyze bidding sequences for airline tickets at 
a name-your-own-price retailer and at a low-cost airline. We 
find that a majority of bidding sequences are not consistent 
with the predictions of an economic model of a rational, 
price-minimizing consumer. This finding indicates that the 
Internet does not eliminate or lower consumers' irrational 
decisions as many experts expected or hoped. Further 
analysis reveals that the deviations from rationality are 
mostly consistent with a priori expectations. In particular, 
consumers who place many bids with rather long interbid 
times are more likely to bid irrationally, which can be attrib- 
uted to forgetting. Consumers are more likely to bid ratio- 
nally for larger distances, which can be explained by higher 
jnvolvement for or savings from such flights. Consumers 
who have more experience have lower bids on average, 
probably because of the knowledge they have gained. Our 
results have some important implications for managers, 
consumers, and researchers. 


Implications for Managers 


Price-discovery mechanisms, such as name-your-own-price 
auctions, are facilitated by the low transaction costs of trade 
on the Internet. Such mechanisms promise sellers two 
important benefits: First, prices based on the bidding 
process can be consumer specific and allow for better 
extraction of consumer surplus through this segmentation. 
Second, price discovery mechanisms can reduce efforts by 
third parties that post lowest market prices and thus thwart 
firms' price discrimination of consumers. 

Our results indicate that despite the convenience of the 
Internet to consumers, most consumers still do not make 
strictly rational decisions. This finding suggests that firms 


TABLE 7 

Regression Results for Maximum Bid Value 
Parameters Retailer (Study 1)5.d Low-Cost Airline (Study 2)8 e 
Constant 265.93 (.000) 45.20 (.000) 
Number of destinations —8.80 (.098) —3.73 (.000) 
Number of bids 3.45 (.083) 1.57 (.000) 
Mean interbid time? —.08 (.808) .55 (.172) 
DV Europec 121.73 (.000) 
DV North Americae 632.38 (.000) 
DV rest of worldc 901.61 (.000) 
R2 .694 .059 
F-value 531.22 (.000) 35.32 (.000) 
Number of observations 1415 1695 


ap value is in parentheses 


bMean time difference in days (hours) between consecutive bids of a sequence at retailer (low-cost aurline). 


cBasellne category: Germany. 
dBid value In Deutschmark (DM) (DM1 = approximately $.5). 
eBid value In euro (€1 = approximately $1 2) 
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can further segment consumers by their bidding patterns 
and exploit their irrationality for profit. For this purpose, 
firms would need to modify the design of such auctions or 
develop decision rules that alter acceptance of bids on the 
basis of the bidding pattern. For example, if enough seats 
are projected to be available, firms might accept bids early 
from those consumers who show declining bids, as Jong as 
the price is over the firms’ threshold price. Alternatively, 
they could hold off accepting bids from consumers who 
seem to be in a panic mode and show increasing bids at an 
increasing rate. However, such a strategy might not be suc- 
cessful if anticipated by consumers who then try to conceal 
their true characteristics. In general, firms should not rely 
solely on models of behavior that assume strict rationality, 
because such models may not reflect reality and may rec- 
ommend suboptimal pricing strategies. 


Implications for Consumers 


The Internet may allow for more informed shopping and 
more efficient price discovery mechanisms than before. In 
particular, name-your-own-price auctions promise con- 
sumers the option of obtaining a price that is below their 
reservation price but close to a seller’s marginal cost. How- 
ever, Many consumers may not fully exploit these price dis- 
covery mechanisms to obtain as low a price as they hoped. 
This state of affairs may exist because these mechanisms 
are new, because consumers do not fully understand how 
they run, or because consumers are not experienced enough 
to exploit them fully. For example, we find that inexperi- 
enced consumers tend to overbid in name-your-own-price 
auctions, thus offering higher prices to sellers that use such 
mechanisms. Consumer advocate groups could issue warn- 
ings of the irrational bidding patterns that consumers are 
prone to make in such auctions. They could also develop 
advisories that educate consumers on how best to use such 
systems to retain as much consumer surplus as possible 
given supply constraints and their own urgency to travel. 


Implications for Researchers 


The assumption of rationality in economic models has been 
debated for decades. Behavioral economists, such as Simon 
(1955), Kahneman and Tversky (1979), and Thaler (1985), 
have not only questioned the validity of this assumption but 
also proposed alternate theories that do not depend on the 
assumption of strict rationality adopted in traditional eco- 
nomic models. Our results provide doubts about the 
assumption of strict rationality even in the context of the 
Internet, which is supposed to be a boon to consumers. As 
such, models that incorporate more realistic assumptions of 
consumer decision making are likely to have enhanced rele- 
vance and predictive accuracy. 

Furthermore, such enhanced models may enable online 
auctioneers to improve revenue predictions or to extract 
consumer surplus more effectively through higher threshold 
prices, given predictions on the maximum bid values con- 
sumers submit. In addition, these models can allow the 
development of price discovery mechanisms that support 
consumer decision making and attain higher efficiency. 

Although our results are robust across our two empirical 
studies of consumer bidding behavior, they are limited by 
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geography and context. Replications across other markets 
and contexts would be useful to explore nonnormative 
behavior on the Internet further. 

In addition to econometric analyses of behavioral data, 
this phenomenon lends itself to fruitful experimental 
research, in the laboratory and on the Internet. Laboratory 
experiments allow for the testing of specific theories as to 
why consumers behave in irrational ways. The Internet 
enables field experiments easily. Such field experiments can 
explore what ecological contexts promote or mitigate con- 
sumers’ irrational behavior and how organizations can avoid 
or exploit such behavior. 

Finally, our theoretical model does not take into account 
gaming strategies by firms that discover consumers' bidding 
patterns or consumers who expect firms to do so and thus 
hide their patterns. Such models make for interesting game 
theoretic solutions and might be worth pursuing. 


Appendix A 
Consumers’ Optimal Bid Prices 


The consumer optimizes the expected consumer surplus of 
the bid over the bid amount, solving Equation Al. A jth 
consumer’s belief (i.e., expectation) about the seller's 
threshold price p is assumed to follow a uniform distribu- 
tion on the interval [P P] with p, 2 - where n is the con- 
sumer's reservation price (Ding et al. 2005; Hann and Ter- 
wiesch 2003; Spann, Skiera, and Schifers 2004; Stigler 
1961). 


(AD — max ECS, = (r, ~bj,) x Prob(b,, 2 p) 


b Ja, 


+ ECS, . x Prob(b,; < p) - c, 





p,-b 
+ ECS x =— - c 
Py By 


J-l 
subject to ECS, 20,b,, Sr visn, 


The jth consumer’s expected consumer surplus ECS; ; 
of the first bid b, ; has two components in this model (Equa- 
tion A1). The first component represents the expected con- 
sumer surplus in case of a successful bid, which is weighted 
with the probability that the first bid is successful. Thus, the 
expected consumer surplus accounts for the bidding costs 
c,» Which accrue by submitting the ith bid. The second 
component illustrates the expected consumer surplus ECS, 2 
of a second bid, which is weighted with the probability that 
the first bid is unsuccessful. As such, ECS, 2 consists of the 
expected consumer surplus of a second bid and further bids 
beyond the second, if the previous bids are not successful 
and if it is still beneficial for the bidder to make these addi- 
tional bids. Thus, the consumer surplus of further bids 
beyond the ith bid is recursively included in the formula for 
the consumer surplus of the ith bid. | 

The consumer simultaneously determines the optimal 
number of bids and the optimal bid prices by solving this 


TABLE A1 
Bld Values of Optimal Bidding Strategles for One to Six Blds 








One Bid Two Bids 
1 1 "ua RU 
First bid bf, = 5 toh bř = aB a + 3012 
; — m 
Second bid j273P +31 +3%2 
Three Bids Four Bids 
«€. 2 iis 4 
First bid bř = abi tat Cio t aua bf, = gPi* gh * gCi2 * gCja tee 
ote eae 4 ua ee ee 6 4 
second bid Dip = ab ta ties tica Dto = Bet gi tee * aT ES 
je: 14, ,3, 1 2 uu" E 4 6 
Third bid ja 7 4B) * 41 * 4042 * 4°13 Dis = BP * gi tee + gis + ge^ 
: 1,,4, 1 2 3 
Fourth bid D = sP tEh tE tEC s to, 
Five Bids Six Bids 
5 1 6 1 6 
First bid bf, = ePi tah Du 7P * 7'j * 26,2 * Z0 
5 4 3 2 4 3 2 
+ e 2 + it + 4 + 65 + 7C,4 + 70,5 + 7°46 
«A. Le ye SBS 10 
Second bid bi = EP) + eh bie =SPpt zh * 262 * 7 61a 


4 8 6 4 


3 6 9 6 12 9 6 
+ acie + "x + ea tatie + 794 + 70,5 + 7°16 
2 4 ' PE, 4 3 6 
Fourth bid bja =P tah Dia = SP * 7f * 7652 * 76,3 
8 +26 +126 Sc 
T 6012 + 613 + 614 + 6 25 7 1,4 7 45 7 L6 
s 1 5 ..2 5 2 4 
Fifth bid bts = abita Dis = 7B) FSF z€,2 * 70,3 
2 4 10 
TERN SUN 
Sixth bid L8 7 oP T 7l + 7C,2 i 5°13 


3 4 5 





optimization problem. The optimal bidding strategy is to optimal bidding strategy can be determined by the algo- 
submit a further bid, if the previous one is declined, as long rithm that Spann, Skiera, and Schüfers (2004) propose. 

as the expected consumer surplus of this bid is not negative. We can state the optimal bid values for different optimal 
Thus, the optimal number of bids n, is determined. The numbers of bids n, according to the following general for- 
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mula (for bid values of optimal bidding strategies for one to 
six bids, see Table A1): 


pila, n +1 ae os ie 
1 . 
(n, +1-i)xd-D 
QW TU 
2, n, +1 i 
for 1 2 2 


Jm" 


n 
I (n, +2-m)xı 
+ $ e 
n +1 
m=1+1 J 


for 1 «n 


Because rational consumers determine ex ante their optimal 
bidding behavior, bidding costs influence their ex ante 
determination of bidding behavior and thus are not sunk. 


Appendix B 
Proof of Norms for Bidding 
Patterns 


P4: Any two consecutive bids b; 1- Il and b; aln, have positive 
. * *- $ 
increments Ab, iln, = Byala, -= bjii- In, > 9. 


Proof. Assuming constant bidding costs c; = c,; for each 


ith bid, we can state the general rule for the ith increment, 


which we derive from Equation A2: 
(Bl Ab*, =b*, -b* E tke ) 
jaln, = Jila, Ji-lln, e n, +] ) Bj 





zii] 
E [S Pisas enxi-n 
J 


a 41 6 7 Bj 


(n, -Dxn,—4 
*e X —L 3 —--2). 
2x(n, +1) 


Because Equation B1 decreases in the bid number i (for 
positive costs, which we develop a proof for subsequently), 
the increment value is lowest for the optimal bid number i = 
nj. Thus, increments are positive for all i € n, if (inserting i = 
n; into Equation B1): 


1 c 
DR MENT 
Pe P ala, n +1 5 - p) n+l 


(n, -])xn, 
x TT 2, + Dx, - 2) >0 





1 n 
=— x| (r, -p )+c,x 
n+l a a ae 


2 —n 


Ee 





>0 rj -p >c xX 
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Thus, for bidding costs of zero, increments are positive for 
rj > p. The value of Equation B2 could become negative 
for high values of c; and nj. However, surplus-maximizing 
consumers would never place such a bid with a negative 
increment relative to the previous bid, because it would lead 
to a negative expected consumer surplus of this specific bid: 


b, -b 
(B3) ECS , — (rj - b.) x E——E— - c, «0 
, p,— b. _ i J 
J ^M 

forb,,—b,, _, <0 and p, > bi sii 

The second norm for the rational bidding pattern is 

given in P}. If any two consecutive bids have positive incre- 

ments, the first bid a consumer submits is the lowest, 
reflecting the first norm. 

P3: The increments between consecutive bids decrease by the 

amount of the bidding costs. Thus, the normative bidding 


pattern has decreasing increments for positive bidding 
costs. 


* — Ah* 
b, APS — if 


(B4) Ab 


— —c, <Q forc, > 0. 
j j J 


Proof. If we assume constant bidding costs c, = c; for 
each ith bid, 


(B5) Ab. i Ab, Ila, E Ca 7 b m " 


a (or — In, = b. -— je 


-—  x(- py 


n,+1 


-Dxn,-4 
ta Itt acm 
2X (n, +1) 


1 
[zen 
(n, -1)xn,-4 
ees PETERS i-n-n 
J 


-cj-6-2)*-3)] 


= c, Ci +2+i-3)= c C1) =c. 


Or 


ð Ab* 
(B6) ' nece 
di J 
Thus, for positive and constant bidding costs, incre- 
ments are decreasing on the basis of P,, which is the third 
norm for the rational bidding pattern. For bidding costs of 
Zero, increments are constant (and positive). 


Appendix C 
Proof for Criteria for Classification 
of Four Complex Bidding Patterns 


Pattern 1 


Pattern 1 reflects increasing bids at a decreasing rate; the 
last bid is still the highest in the sequence (i.e., only the 
upward portion of an inverted U shape). A sequence follows 
Pattern 1 if the parameter of the linear term is positive 
(Oy, > 0), the parameter of the quadratic term is negative 
(Gy < 0), and the value of the last bid of the sequence is the 
highest (By Imax > By ta — 1). Inserting Equation 1 for bid 
values, this condition can be transformed as follows: 


(CD à X (T as)? + Oak X Iy max + Ogg i X (I max 7 D? 


+ Oo X (Ikma — 1) + Oy 


e Ou X (D rae t Moy x Li rax 


S 0> Qk X C2 X L ma tI) 02, 


e 0»a,- i 


Pattern 4 


Pattern 4 reflects the full inverted U shape. A sequence fol- 
lows Pattern 4 if the parameter of the quadratic term is neg- 
ative (Ot; < 0) and the value of the last bid is less than or 
equal to the value of the previous one (By Imax S By pax - 1); 
that is, the right, decreasing part of the inverted U shape is 
included. The parameter of the linear term (O54) can be 
positive, zero, or negative. Inserting Equation 1 for bid val- 
ues, this condition can be transformed analogously to 
Pattern 1: 


k 
C2 0» G4 € — 2E —. 
x D E 


Pattern 3 


Pattern 3 reflects an increasing curve at an increasing rate 
(i.e., exponential increase or only the increasing part of a U- 
shaped curve). A sequence follows Pattern 3 if the quadratic 
term has a parameter of positive value (a, > 0) and the 
value of the second bid of the sequence is greater than the 
value of the first one (By ? > B, ;). The parameter of the lin- 
ear term (05) can be positive, zero, or negative. Inserting 
Equation 1 for bid values, this condition can be transformed 
as follows: 


(C3) a, X2? t 05, X2 Ga, >a, X12? + 0, X1 04, 
€» Oy X44 04 x2»ay xlc-05, xl 


OO, X3ro,, x1»0 


S 0< Oy >, 


Pattern 5 


Pattern 5 reflects a U-shaped curve. A sequence follows 
Pattern 5 if the parameter of the linear term is negative 
(œx < 0), the parameter of the quadratic term is positive 
(04, > 0), and the value of the second bid of the sequence is 
less than or equal to the value of the first one (By? € By 1), 
that is, the left, decreasing part of a U-shaped curve. Insert- 
ing Equation 1 for bid values, analogously to Pattern 3, the 
conditions for Pattern 5 reduce to the following: 


(C4) 0«o s s- 


Appendix D 
Bivariate Tests of Differences of 
Means Between Groups 


Table D1 displays the results for bivariate t-tests (account- 
ing for different distributions within groups) of differences 
of means between groups for &y and o. Thus, each cell in 


TABLE D1 
Bivariate Tests of Differences of Means Between Groups 





A. Name-Your-Own-Price Retailer 





Parameter 04, Parameter a, 
Pattern 1 2 3 1 2 3 4 
2 —9.44 (.00) 6.10 (.00) 
3 —15.27 (.00) —13.91 (.00) ; 14.11 (.00) 13.79 (.00) 
4 3.24 (.00) 11.10(.00) 15.83 (.00) —.71 (.47) -8.49(.00)  —13.78 (.00) 
5 —10.48 (.00) -7.46 (.00) -3.91 (.00) -11.87(.00) 13.49 (.00) 11.29 (.00) 7.12 (.00) 13.65 (.00) 
B. Low-Cost Airline 
2 —17.81 (.00) 9.96 (.00) 
3 —12.12(.00) -7.44 (.00) 15.24 (.00) . 8.95 (.00) 
4 4.29 (.00) 6.40 (.00) 8.71 (.00) —3.81 (.00) -5.97 (.00) . -8.59 (.00) 
5 —5.49 (.00)  —4.20(.00) -2.28 (.03) -6.95(.00) -7.53 (.00) | 6.14 (.00) 4.20(.00) | 8.46 (.00) 


alndependent sample bivanate t-test (accounting for different distributions within groups) of differences of means between groups; p value Is In 


parentheses. 
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Table D1 gives the t-value (and p value) for a bivariate test 
of differences of means between groups for parameter 01, 
(left matrix) and parameter 0 (right matrix) for both stud- 
ies (Table D1, Panel À; Table D1, Panel B). Table D1 shows 


that all bivariate group means are significantly different 
from one another except for the means of parameter 05 
between Pattern 1 and Pattern 4 in Study 1 (name-your- 
own-price retailer). 
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Anthony D. Cox, Dena Cox, & Gregory Zimet 


Understanding Consumer 
Responses to Product Risk 
Information 


Two experiments examine how message framing moderates consumer responses to product risk information. The 
findings suggest that contrary to an influential theory, consumers exposed to loss-framed messages exhibit a gen- 
eral aversion to product risk involving both short-term adverse effects and more permanent harm. In contrast, con- 
sumers exposed to gain-framed messages differentiate among different types of product risk. They essentially 
ignore temporary product risks but give considerable decisional weight to risks of permanent harm. This article dis- 
cusses the implications of these findings for those who design and regulate promotional messages that contain 


product risk disclosures. 


ith increasing frequency, consumers are asked to 
Wes product risk information (i.e., informa- 
tion that product use may lead to negative conse- 
quences). For example, advertisements for tobacco products 
have long been required to disclose the health hazards asso- 
ciated with their use, and promotional materials for finan- 
cial investments often contain caveats about potential finan- 
cial losses (Fitch 1997; Fox et al. 1998). However, the most 
dramatic growth in advertised product risk disclosures has 
come from a relatively new category of advertising: direct- 
to-consumer (DTC) advertisements for complex medical 
products. Over the past 15 years, there has been an explo- 
sion of such advertising; total expenditures on DTC pre- 
scription drug advertising has grown more than 200-fold, 
from $12 million in 1989 to more than $3 billion in 2003 
(Calfee 2002; Thomaselli 2004). Most such advertising is 
required to contain information on product risks. However, 
as Cohen (2002, p. 172) notes, there is "continuing concern 
that consumers may not adequately comprehend the bene- 
fits and risks of using such highly technical products.” 
There are two kinds of product risk information that 
consumers may consider when evaluating such products. 
First, there are product risk disclosures (i.e., explicit state- 
ments of the potential adverse outcomes resulting from 
product use). For example, Food and Drug Administration 
rules require that any DTC advertisement that mentions 
both the name of a prescription drug (e.g., Celebrex) and its 
function (e.g., treatment of osteoarthritis) must also contain 
information on product side-effect risk—that is, the possi- 
bility of unintended physical harm (ranging from temporary 
discomfort to permanent disability or even death) resulting 
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from product use (for discussions of the rules governing 
such risk disclosures, see Calfee 2002; Menon et al. 2003b). 

Second, consumers may also make inferences about the 
risks of specific products, even when those risks are not 
explicitly stated. For example, a consumer may perceive 
that certain product forms (e.g., injections, radiological 
scanning) entail inherent risks. In addition, a growing num- 
ber of researchers have suggested that consumers perceive 
certain product functions (e.g., problem detection) as inher- 
ently riskier than others (see Meyerowitz and Chaiken 
1987; Rothman and Salovey 1997). 

It is important for marketers and policy makers to 
understand how consumers evaluate product risk informa- 
tion. Unfortunately, research on this topic has yielded 
inconsistent findings. Some studies suggest that information 
on product risks lowers consumers' product evaluations 
(e.g., Morris, Mazis, and Brinberg 1989; see also Goetzl 
2001); however, others suggest that consumers frequently 
ignore product risk information (e.g., Menon et al. 2003a), 
and still others suggest that product risk information can 
actually enhance consumers' product evaluations (e.g., 
Goetz] 2000; Rodale Publications 1999). 

In an effort to explain these contradictory findings, the 
current study proposes a potential contingency variable that 
may moderate the effect of product risk information on con- 
sumers' product evaluations: message framing. Any promo- 
tional message can stress either the potential gains from 
adopting a behavior or the potential losses from not adopt- 
ing it. For example, a loss-framed advertisement promoting 
flu vaccinations would stress the potential health costs of 
not getting vaccinated (e.g., getting influenza), whereas a 
gain-framed advertisement would stress the protective ben- 
efits of getting vaccinated (e.g., avoiding influenza). 
Although these two messages might contain the same 
essential information, research suggests that they elicit dif- 
ferent consumer responses (for a review of framing 
research, see Levin, Schneider, and Gaeth 1998). Relevant 
to the current study, there are theoretical reasons to believe 
that message framing may moderate how consumers 
respond to product risk. 
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Function/risk theory (e.g., Meyerowitz and Chaiken 
1987; Rothman and Salovey 1997) draws on Kahneman and 
Tversky’s (1979) prospect theory to suggest that negatively 
framed messages increase consumers’ willingness to use 
high-risk products (particularly those with a problem detec- 
tion function), whereas positively framed messages increase 
the attractiveness of low-risk products. In contrast, other 
theories (e.g., Adaval 2001; Isen 2000; Yuen and Lee 2003) 
seem to suggest the opposite, positing that consumers 
exposed to positive, upbeat stimuli should be more willing 
to tolerate certain types of product risks, whereas those 
exposed to negative stimuli should exhibit greater aversion 
to such risks. 

This article examines these theories and discusses their 
(sometimes conflicting) implications for understanding how 
consumers evaluate product risk. Specifically, we address 
three central research questions: 


1. How do consumers assess product risk? Specifically, how 
are product risk perceptions influenced by both explicit 
product risk disclosures and information on product func- 
tion (e g., problem prevention versus problem detection). 

2. How do consumers weigh perceived product risks 1n form- 
ing overall product evaluations? 


3. How are the weights given to product risks influenced by 
message framing? 


In the following sections, we discuss previous research and 
theory relevant to these questions. Then, we report the 
results of two experiments that examine these issues empir- 
ically. Finally, we discuss the implications of our findings 
and provide directions for further research. 


Effects of Product Function on 
Perceived Product Risk 


Several researchers have suggested that consumers perceive 
detection products as inherently risky. Meyerowitz and 
Chaiken (1987) first introduced the function/risk hypothesis 
in the context of breast self-examination (BSE). They noted 
(p. 89), "In the short run, BSE is a risky behavior... [It] 
requires that a woman risk aversive consequences in the 
present (e.g., finding a lump, experiencing anxiety) in hopes 
of enhancing future outcomes.... Of course, in the long run, 
not performing BSE clearly entails risks (e.g., failing to 
detect a malignancy). However, because of their temporal 
remoteness, such longer-term considerations may be less 
Salient than short-term considerations.” More recently, 
Rothman and Salovey (1997) reiterated the link between 
product function and perceived risk, stating that detection 
products are considered inherently risky because “one runs 
the risk of receiving significant, unpleasant information” (p. 
5), whereas prevention products (e.g., sunscreen, vaccines) 
are considered low risk because their “salient function ... is 
to provide a relatively certain, desirable outcome” (p. 11). 
The hypothesized link between product function and 
perceived risk is argued persuasively in these articles and 
has gained widespread currency among persuasion 
researchers and practitioners (as we discuss subsequently). 
However, closer examination reveals problems with this 
hypothesis. First, proponents of function/risk theory suggest 
that consumers view adopting detection behaviors as more 
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risky than not adopting them. However, as Levin, Schnei- 
der, and Gaeth (1998) note, this assumes that consumers 
ignore the long-term risks of not being screened (e.g., 
allowing a tumor to grow) and focus solely on short-term 
risks. The finding that consumer disease anxiety tends to 
increase screening behavior, not decrease it (McCaul et al. 
1996), suggests that many consumers adopt a longer-term 
perspective on detection behaviors. 

Second, although some detection products have short- 
term risks, so do many prevention products. Many people 
avoid flu shots for fear of pain and side effects (Centers for 
Disease Control and Prevention 1995); many women do not 
push condom use for fear of partners' reactions (Darroch 
and Frost 1999). If consumers focused only on short-term 
outcomes, they would view these preventive products as 
risky. 

Finally, there is little empirical evidence that risk per- 
ceptions vary with product function. Several studies have 
examined consumer response to products differing in func- 
tion (e.g., Rothman et al. 1999; Rothman et al. 1993), but 
none has shown that detection and prevention products dif- 
fer in perceived risk. 


Effects of Message Framing on 
Consumer Response to Product 
Risk 
Another tenet of function/risk theory is that loss-framed 
messages make consumers more likely to adopt high-risk 
(e.g., detection) products, whereas gain-framed messages 
increase the attractiveness of low-risk (e.g., prevention) 
products. In reviewing previous framing studies, Rothman 
and Salovey (19977) note that loss-framed messages seem to 
be more persuasive in studies involving detection products, 
whereas gain-framed messages seem to be more effective in 
studies involving prevention products. To explain this 
apparent difference, they note that prospect theory (Tversky 
and Kahneman 1981) suggests that consumers prefer riskier 
options when outcomes are framed as losses but prefer low- 
risk options when outcomes are framed as gains (see also 
Meyerowitz and Chaiken 1987, p. 501). Thus, they con- 
clude that loss-framed messages will persuade people to 
adopt detection (and other risky) products and that gain- 
framed messages will persuade people to adopt prevention 

(and other low-risk) products. 

This hypothesis is compelling and, if valid, has impor- 
tant marketing implications. It suggests that marketers pro- 
moting prevention products (e.g., vaccinations) should use 
gain-framed messages, whereas those promoting detection 
behaviors (e.g., HIV testing) should use loss-framed mes- 
sages. More broadly, it suggests that any product that con- 
sumers perceive as risky is most effectively promoted with 
loss-framed messages. 

This theory has become increasingly influential among 
both persuasion researchers and practitioners. Rothman and 
Salovey's (1997) article has been cited more than 100 times 
(Thomson 2004), and the function/risk hypothesis is 
becoming established doctrine among social marketers (see, 
e.g., Arkes 2004; Ferguson et al. 2003). However, despite 
increasing acceptance of this hypothesis, there are unre- 


solved questions about both its empirical support and its 
theoretical foundation. First, although some studies find 
loss framing more persuasive in promoting detection and 
find gain framing more persuasive in promoting prevention, 
other studies find just the opposite. For example, Robberson 
and Rogers (1988), Homer and Yoon (1992), and Blanton 
and colleagues (2001) all find loss framing more effective 
in promoting prevention behaviors (exercise, mouthwash, 
and condoms, respectively). 

Second, in most studies cited to support function/risk 
theory, function is confounded with other product traits. For 
example, mammograms and sunscreen differ not only in 
function but also in the problems they address, cost, dis- 
comfort, and so forth. If framing effects differ between 
these two products, which of these product traits accounts 
for the difference? The only study to manipulate product 
function while holding other product attributes constant 
(Rothman et al. 1999, Study 1) does not find the predicted 
two-way interaction between function and framing. 

Third, the function/risk hypothesis has theoretical prob- 
lems. Assume that consumers view detection as high risk 
and prevention as low risk. Does the function/risk hypothe- 
sis then follow logically from prospect theory? Indeed, 
Tversky and Kahneman (1981) find that when outcomes 
were framed as losses, consumers preferred the riskier of 
two options. However, Tversky and Kahneman use "risk" to 
mean increased variance of potential outcomes; therefore, 
the riskier options in their experiments had both greater 
upside potential and greater downside potential than the no- 
risk (or certain) options. Given this definition of risk, 
choosing a risky option seems more reasonable. Prospect 
theory posits that people abhor a certain loss (e.g., with 
Option À, there is a definite loss of $2,000) and thus are so 
attracted to the upside potential of complete loss avoidance 
(e.g., Option B provides a 50% chance to lose nothing) that 
they are willing to tolerate the downside potential of even 
greater losses (e.g., a 5096 chance of losing $4,000). 

In contrast, when function/risk theorists argue that 
detection behaviors are risky, they use risk to mean an 
increased possibility of negative outcomes (or what Tversky 
and Kahneman call downside risk). Consumers who adopt 
detection behaviors are said to expose themselves to the 
possibility of short-term psychological loss (e.g., learning 
they have cancer), whereas nonadopters completely avoid 
such short-term loss potential. Thus, if the function/risk 
theorists’ assumption that consumers focus solely on short- 
term outcomes is accepted, adoption of a detection behavior 
must be viewed as a kind of psychological Russian roulette, 
in which consumers actually seek potential short-term 
losses, with little thought to detection's long-term benefits. 
Such loss-seeking behavior is antithetical to the tenets of 
prospect theory. 


Alternative Theories of Framing and 
Risk: Valence Congruency and Risk 
Discrimination 


Whereas function/risk theory posits that consumers exposed 
to loss-framed messages actually seek short-term product 
risks, we propose two alternative theoretical formulations 


that suggest essentially the opposite: Loss-framed messages 
induce risk aversion, whereas gain-framed messages 
increase consumers' tolerance of at least some types of 
product risk. These two theories (valence congruency and 
risk discrimination) both rest on two key findings. 

First, there is evidence that message framing influences 
consumers' moods, particularly positive moods. In depth 
interviews, Cox and Cox's (2001) respondents reported 
positive, optimistic feelings after exposure to gain-framed 
messages that promoted mammography and less positive 
moods after exposure to equivalent loss-framed messages. 
Similarly, Meyerowitz and Chaiken (1987, p. 506) find that 
gain-framed respondents were significantly more likely to 
report feeling “reassured.” Thorsteinson and Highhouse 
(2003) find that respondents exposed to gain-framed want 
ads reported more positive scores on the Associative 
Valence index (e.g., “hopeful,” "confident," "optimistic") 
than those exposed to loss-framed messages. Perhaps this 
finding is not too surprising, because gain-framed messages 
depict happy endings (in which people do the right thing 
and, consequently, win money, live long lives, and so forth) 
and loss-framed messages depict unhappy endings (in 
which people miss opportunities, lose money, get sick, and 
so forth.) However, the potential emotional impact of mes- 
sage framing has received remarkably little attention in the 
framing literature, which has tended to adopt the cognitive 
emphasis of behavioral decision theory (cf. Loewenstein et 
al. 2001). 

Second, considerable research suggests that consumers 
exposed to affectively positive stimuli are more willing to 
tolerate at least some product risk to take advantage of 
product benefits. However, this literature contains two 
schools of thought that disagree on whether positive moods 
increase consumer tolerance of al] kinds of risk or simply 
make consumers more likely to disregard relatively small 
“low-stakes” product risks. 

The valence congruency theory suggests that consumers 
in positive moods tend to give less decisional weight to all 
negative product information (e.g., product risks) than con- 
sumers in negative moods. This hypothesis is often 
explained in terms of information priming, in which prior 
exposure to affectively negative material is theorized to 
increase the accessibility of potential negative conse- 
quences (i.e., risk) in memory, whereas exposure to affec- 
tively positive material decreases accessibility of potential 
negative consequences. For example, Pietromonaco and 
Rook (1987, p. 400) theorize that negative affect "may 
influence the perceived likelihood of unpleasant events 
because it increases the accessibility of previous similar 
experiences and thus facilitates imagining other potential 
negative outcomes." 

Several studies have produced results that appear to be 
consistent with the valence congruency hypothesis, such that 
respondents in positive moods exhibit greater risk tolerance 
(e.g., Deldin and Levin 1986; Pietromonaco and Rook 1987; 
Yuen and Lee 2003). However, as Isen (2000) observes, 
many of these studies presented behavioral risks that were 
relatively minor (e.g., temporary or minor embarrassment, a 
small or hypothetical bet). In contrast, in tasks involving 
high stakes, respondents in positive moods often exhibit 
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considerable caution regarding potential risks (see, e.g., Isen 
and Geva 1987). Therefore, Isen (2000) and Isen and Geva 
(1987) propose a more complex theory of affect and risk 
taking, which can be labeled risk discrimination theory. 
According to this theory, consumers in positive moods are 
better able to discriminate between important, high-stakes 
risks and more minor, low-stakes risks. Therefore, con- 
sumers in positive moods are more likely to disregard rela- 
tively trivial risks than are consumers in negative moods, but 
they may be reluctant to take large risks. In a sense, con- 
sumers in good moods are hypothesized to weigh risks more 
appropriately in proportion to their relative importance. 

Several studies appear to support this theory (for a 
review, see Isen 2000). For example, in an experiment 
involving gambling behavior, Isen and Geva (1987, p. 145) 
find that “when stakes were high,... positive affect subjects 
were more cautious, whereas when stakes were low, posi- 
tive affect subjects were more prone to take risks.” Several 
writers have suggested that this phenomenon is part of a 
broader pattern, in which respondents in positive moods are 
simply better decision makers (i.e., they are better able to 
weigh and integrate information rationally and to distin- 
guish the relevant from the irrelevant). For example, Isen 
(2000, p. 423) states that “positive affect promotes effective 
thinking about even negative material, if doing so is useful 
and necessary, even though it also leads people to sidestep 
unnecessary consideration of undesirable material. This is 
yet another way in which positive affect promotes flexible 
problem solving.” 

In contrast, negative affect is hypothesized to inhibit 
appropriate weighing of costs and benefits. In the words of 
Leith and Baumeister (1996, p. 1251), “negative affect 
apparently inhibits people from considering the affective 
consequences of possible actions ... [and may induce] poor 
choices in situations in which there is a long-term [versus] 
short-term tradeoff.” 


Hypotheses 


Our first two hypotheses challenge two central tenets of 
function/risk theory: (1) that consumers perceive detection 
products as inherently riskier than prevention products and 
(2) that product function (detection versus prevention) mod- 
erates the persuasive effects of message framing. As we dis- 
cussed previously, the theoretical and empirical bases for 
these tenets appear questionable. Therefore, our first two 
hypotheses are null or "theory refutation" hypotheses: 


H;: Product function per se does not influence perceived prod- 
uct risk; that is, consumers do not perceive disease detec- 
tion products as inherently riskier than prevention prod- 
ucts that have the same product form, usage experience, 
and side effects. 

H»: Product function per se does not moderate the persuasive 
effects of message framing; that is, message-framing 
effects do not differ between detection products and pre- 
vention products that have the same product form, usage 
experience, and side effects. 


The next two hypotheses are based on our discussion of 
the valence congruency and risk discrimination theories. As 
we noted previously, both theories predict that message 
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framing moderates how consumers respond to potential 
product risks but not in the manner predicted by function/ 
risk theory. Consistent with both the valence congruency 
and the risk discrimination theories, but contrary to 
function/risk theory, we hypothesize the following: 


H3: Consumers exposed to gain-framed messages exhibit less 
aversion to temporary product risks (i.e., those involving 
the potential for temporary negative effects) than con- 
sumers exposed to loss-framed messages. Consumers 
exposed to gain-framed messages respond less negatively 
to a disclosure of such temporary product risk and give 
less weight to perceived product risk 1n forming overall 
product evaluations than consumers exposed to loss- 
framed messages. 


However, as we noted previously, the valence congru- 
ency and risk discrimination theories diverge on how fram- 
ing influences the relative weight that consumers give to 
minor product risks (e.g., the risk that product use may 
cause temporary unpleasant effects) versus more serious 
product risks (e.g., the risk that product use may cause per- 
manent harm). Valence congruency theory suggests that 
exposure to positive, gain-framed information should 
decrease the weight given to all product risks, large and 
small. In contrast, risk discrimination theory suggests that 
exposure to gain-framed messages should enhance con- 
sumers' ability to assess the emotional impact of different 
types of risks accurately and therefore cause them to give 
greater weight to serious risks than to smaller risks. Because 
risk discrimination has been better able to explain the range 
of previous empirical results, we base H4 on this theory: 


H4: Message framing influences the relative weight that con- 
sumers give to product risks involving temporary versus 
permanent adverse consequences. Consumers exposed to 
gain-framed messages give significantly more decisional 
weight to permanent product risks than to temporary prod- 
uct risks. In contrast, consumers exposed to loss-framed 
messages respond equally negatively to both types of 
product risk. 


Study 1 


Product Context and Participants 


We examined our first three hypotheses in the context of 
skin cancer detection and prevention products. Skin cancer 
is the most common and one of the most rapidly growing 
forms of cancer. In 2002, there were 54,200 cases of skin 
cancer in the United States alone, and 7600 Americans died 
from the disease (American Cancer Society 2004). 

A primary target audience for promoting skin cancer 
prevention and detection behaviors is young adults, a group 
that is especially likely to engage in behaviors (e.g., sun 
bathing, other outdoor recreational activities) that increase 
lifetime skin cancer risk. Therefore, the participants for this 
study were young adults whom we recruited from under- 
graduate business classes at a large urban university in the 
midwestern United States. 


Experimental Design and Stimuli 


In designing this experiment, we attempted to minimize 
some of the methodological problems that have plagued 


prior research on product function and risk. Specifically, in 
most prior studies, product function has been confounded 
with other product differences, such as cost, discomfort, 
frequency of use, and so forth. To address this problem, we 
developed gain- and loss-framed print advertisements for 
two product concepts that differ in function (detection ver- 
sus prevention) but address the same underlying problem 
(skin cancer) and involve equivalent usage experience. 


Pretest 


Purpose. In the pretest, we attempted to test partici- 
pants’ comprehension of the message-framing and product 
function message components and to confirm that message 
framing serves as a mood manipulation. Perdue and Sum- 
mers (1986) suggest that such manipulation checks are best 
conducted in a separate pretest rather than included in the 
“main” experiment that examines the dependent variables 
(e.g., brand attitudes, intentions). They note (p. 319) that 
"including these checks in the main experiment can present 
problems independent of whether they come before or after 
the dependent variable measures.” Manipulation checks 
placed before the dependent measures may bias the latter 
(e.g., by creating demand effects), whereas manipulation 
checks placed after the dependent measures may fail to 
detect stimulus effects that are inherently transient, such as 
mood inductions (see, e.g., Vastfjall 2002). Therefore, we 
conducted a separate pretest to test the effects of our 
manipulations. 


Method. We recruited 70 participants (37 male and 33 
female) from undergraduate business classes. Each partici- 
pant was assigned randomly to one of four experimental 
advertisements, which varied in behavioral function (pre- 
vention versus detection) and message framing (gain framed 
versus loss framed). Each advertisement had the same pic- 
ture (a stylized picture of the sun) and headline (“Important 
News for Young Adults About Skin Cancer’). The body of 
each advertisement contained three components: (1) basic 
information about skin cancer (drawn from the American 
Cancer Society Web site), which was identical for all 
experimental groups; (2) one of two skin cancer protection 
product concepts that differed in function (prevention versus 
detection) but had the same product form and usage experi- 
ence (both were lotions applied in a doctor's office that 
were described as "highly effective and very safe"); and (3) 
an advocacy message that urged consumers to adopt the 
product, which was either gain framed or loss framed. 

After message exposure, pretest participants responded 
to manipulation checks for perceived product function, 
message framing, and mood. As a framing manipulation 
check, respondents used seven-point Likert scales to report 
whether they believed the message "emphasized the benefits 
of using the product" or “emphasized the costs of not using 
the product" As a product function manipulation check, 
participants reported agreement that “the lotion helps pre- 
vent skin cancer" and "the lotion helps detect skin cancer." 

We measured postexposure mood using scales we 
adapted from the work of Howard, Gengler, and Jain (1995) 
and Meyerowitz and Chaiken (1987). On seven-point Likert 
scales, each respondent reported the degree to which 14 


adjectives (e.g., assured, cheerful, afraid, happy, troubled) 
described his or her mood state. These responses were 
entered into a principal components analysis with a Vari- 
max rotation. Áfter the elimination of some items because 
of high cross-loadings, the final analysis extracted two 
interpretable factors with eigenvalues greater than one, 
which together explained 7596 of the variance in the indi- 
vidual items. The first factor captured participants' hopeful- 
ness or optimism; items loading on this factor were “hope- 
ful," "cheerful," "assured," and "happy." The second factor 
tapped 1nto participants' anxiety mood state; items loading 
on this factor were "afraid, “disturbed,” "troubled," and 
"worried." All loadings were greater than .75, and all cross- 
loadings were less than .15. Next, we constructed two 
summed scales based on the items loading on these two fac- 
tors. The hopefulness mood scale had a coefficient alpha 
value of .84, and the anxiety mood scale had an alpha value 
of .92. These two summed scales were not significantly cor- 
related with each other, suggesting that hopefulness and 
anxiety represent two distinct emotional dimensions, not 
simply different ends of the same continuum. 


Pretest results. When asked whether the message com- 
municated the "benefits of product use," participants 
exposed to the gain-framed version were slightly more 
likely to agree (X = 5.58) than those exposed to the loss- 
framed version (X = 5.07); however, this difference did not 
reach statistical significance (F = 2.69, p = .11). However, 
when asked more specifically if the advertisement “empha- 
sized the costs of not using the product," participants in the 
loss-framed condition were much more likely to agree (X — 
5.27) than those in the gain-framed condition (X = 4.06; F = 
11.9, p « .001). This latter finding indicates that the partici- 
pants comprehended the framing of the messages. 

Participants exposed to the prevention product concept 
were much more likely to agree that the "lotion helps pre- 
vent skin cancer’ (X = 4.90) than those exposed to the 
detection product concept (X = 3.11; F = 15.5, p « .001). 
Likewise, participants exposed to the detection product 
were much more likely to agree that the “lotion helps detect 
skin cancer" (X = 5.97) than those exposed to the preven- 
tion product (X = 3.04; F.= 60.0, p < .001). 

The pretest also confirmed that consumers experience 
more positive moods after exposure to gain-framed mes- 
sages than after exposure to loss-framed messages. Partici- 
pants exposed to the gain-framed message reported signifi- 
cantly higher levels of hopefulness (X = 3.88) than those 
exposed to the loss-framed message (X = 2.98; F = 8.9, p = 
.004). There was no significant difference in the reported 
anxiety level of the two message-framing groups. 


Main Experiment 


Purpose of study. This experiment has three main pur- 
poses. First, it tests H,, that contrary to function/risk theory, 
disease detection products are not perceived as inherently 
riskier than prevention products involving the same disease, 
product form, and usage experience. Second, it tests H», 
that the persuasive effects of message framing do not vary 
with product function. As we noted previously, function/ 
risk theory posits that loss-framed messages are most per- 
suasive when advocating detection behaviors, whereas gain- 
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framed messages are most persuasive when advocating pre- 
vention behaviors. However, we argue that the theoretical 
rationale for this hypothesis (ostensibly based on prospect 
theory) is problematic and that the empirical evidence in 
support of it is inconclusive (in part because in most stud- 
ies, product function is confounded with other product dif- 
ferences). Thus, this experiment attempts to provide a rigor- 
ous test of the proposed interaction between function and 
framing, independently manipulating message frame and 
product function while holding other variables constant. 
Third, the experiment examines H}, that consumers exposed 
to gain-framed messages exhibit less aversion to temporary 
product risks (i.e., those involving the potential for tempo- 
rary negative effects) than consumers exposed to loss- 
framed messages. To test this hypothesis, the experiment 
manipulates product risk independent of product function. 


Sample. We recruited 282 young adults from undergrad- 
uate business classes at a large urban university in the Mid- 
west. The sample was 41% female and had a median age of 
22.5 years. 


Design and stimuli. The experiment used a 2 x 2 x 2 
between-subjects factorial experimental design. The factors 
were message framing (gain framed versus loss framed), 
product function (prevention versus detection product), and 
product risk disclosure (presence versus absence of tempo- 
rary side-effect risk). The manipulation of framing and 
function were identical to the pretest. We varied product 
risk disclosure by having the messages state either that the 
lotion is "very safe" or that the lotion was "generally safe" 
but in some patients could cause a temporary skin rash that 
could last for several weeks. Because a product's side-effect 
profile (i.e., its potential for unintended physical discomfort 
or harm to the product user) is conceptually distinct from its 
function (its core purpose or stated customer benefit), this 
manipulation enabled us to vary product risk independent of 
product function. 

We randomly assigned participants to one of eight 
experimental advertisements. Each advertisement had the 
same picture, headline, and basic information about skin 
cancer as described in the pretest. These were followed by 
two paragraphs, which contained the experimental manipu- 
lations: (1) a paragraph that described one of four skin can- 
cer protection product concepts (a low-risk prevention prod- 
uct, a risky prevention product, a low-risk detection 
product, or a risky detection product) and (2) a message, 
which was either gain framed or loss framed, that urged 
consumers to adopt the product (examples of the experi- 
mental messages appear in the Appendix). Test booklets 
containing the eight experimental messages were randomly 
assigned to participants within each classroom. 


Measures. We assessed perceived product risk by hav- 
ing participants report their agreement on seven-point Lik- 
ert scales with a series of statements regarding the potential 
negative consequences of product use: “Using this lotion is 
risky,’ “This lotion can lead to bad results," “Using this 
lotion would make me feel anxious,” and “Using this lotion 
would cause me to worry.” We combined participants’ 
scores on these four items to form a mean scale with a coef- 
ficient alpha of .77. We assessed overall product evaluation 
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by having participants report their agreement on seven- 
point Likert scales with the following statements: “Overall, 
I think this lotion is a very good product,” “People who use 
this lotion will be safer than those who don't," “I would 
rather use this lotion than any currently available alterna- 
tive,” and “Using this lotion would be important to me.” We 
combined these items to form a mean scale with a coeffi- 
cient alpha of .76. We assessed behavioral intentions by 
having participants report on seven-point bipolar scales, 
anchored by “very likely” and “very unlikely,” their inten- 
tion to engage in three behaviors involving the target prod- 
uct: asking their doctor about the product, purchasing the 
product, and using the product. We combined these to form 
a mean scale with a coefficient alpha of .90. Finally, partici- 
pants provided demographic information (e.g., gender, age, 
education) and reported on their previous skin cancer pre- 
vention behaviors (e.g., use of sunscreen) and risk factors 
(e.g., sun exposure, severe sunburns, family history). 


Results. We examined perceived product risk with an 
analysis of variance (ANOVA), in which perceived product 
risk was the dependent variable and framing, product risk 
disclosure, and function were dichotomous factors. As we 
predicted in H,, there was no evidence that participants per- 
ceived detection products as inherently riskier than preven- 
tion products. Product function had no impact on perceived 
product risk (F = .24, p = .63). The mean perceived risk of 
the detection product (X = 3.60) was virtually identical to 
that of the prevention product (X = 3.53). Thus, we found 
support for H}. 

However, there was a strong main effect of the side- 
effect manipulation on perceived product risk (F = 24.9, p < 
.001). Participants in the side-effects disclosure condition 
perceived significantly greater product risk (X = 3.9) than 
those in the no-side-effects condition (X = 3.2). 

Message framing did not have significant main or inter- 
active effects on participants’ perceptions of product risk 
level. However, as we discuss subsequently, message fram- 
ing altered the weight that participants gave to such risk 
perceptions in forming overall product evaluations. 


Overall product evaluations. Next, we conducted an 
ANOVA that evaluated the effect of message framing, prod- 
uct function, and product risk disclosure on participants’ 
overall product evaluations. Consistent with H», there was 
no interaction between message framing and product func- 
tion (F = .03, p = .87). 

However, framing moderated the impact of the product 
risk disclosure on participants’ overall product evaluations 
(see Figure 1). As we predicted in H5, there was a statisti- 
cally significant interaction between message frame and 
product risk disclosure (F = 11.19, p = .001). To understand 
this significant interaction better, we analyzed the simple 
effects of the product risk disclosure in each of the two fram- 
ing conditions (see, e.g., Keppel 1982, pp. 214-19). This 
analysis revealed that consumers exposed to loss-framed 
messages rated the risky (temporary side effects) product 
significantly more negatively (X — 4.08) than the low-risk 
(no side effects) product (X — 4.77; F — 13.17, p « .001). In 
contrast, in the gain-framed condition, consumers' product 
evaluations did not differ between the risky product (X = 
4.40) and the low-risk product (X = 4.20; F = 1.20, p = .27). 


To examine further how message framing affects con- 
sumers' sensitivity to product risk, we examined whether 
the correlation between perceived product risk and overall 
product evaluations varied significantly between the gain- 
and loss-framed conditions (for a similar analysis, see 
Pietromonaco and Rook 1987). This analysis revealed that 
among participants exposed to loss-framed messages, over- 
all product evaluations were negatively correlated with per- 
ceived product risk (r = —31, p < .001). However, among 
participants exposed to gain-framed messages, overall prod- 
uct evaluations were not significantly correlated with per- 
ceived product risk (r = —07, p = .38). A Fisher r-to-z trans- 
formation (Kleinbaum and Kupper 1978, pp. 106—107) 
confirmed that the correlation between perceived product 
risk and overall product evaluation was significantly differ- 
ent across the two framing conditions (z = 2.04, p « .05). 
This analysis provides further evidence that participants 
exposed to gain-framed messages gave less evaluative 
weight to the product's risks than did those exposed to loss- 
framed messages. 


Behavioral intention. Next, we conducted an ANOVA in 
which behavioral intention was the dependent variable and 
message framing, product function, and product risk disclo- 


FIGURE 1 
Study 1: Effect of Stated Product Risk on Overall 
Product Evaluations: Gain-Framed Versus Loss- 
Framed Conditions 
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sure were dichotomous factors. As we predicted in H», there 
was no interaction between product function and message 
framing (F = 1.92, p = .17). However, there was a signifi- 
cant interaction between product risk disclosure and mes- 
sage framing (F = 4.80, p < .05). Tests of simple effects 
revealed that participants in the loss-framed condition 
reported significantly greater intention to adopt the low-risk 
product (X = 4.50) than the risky product (X = 3.75; F = 
6.58, p = .01). Among participants in the gain-framed con- 
dition, there was no significant difference in behavioral 
intention toward the risky (X = 4.17) and low-risk (X = 
4.03) products (F = .25, p = .62) (see Figure 2). 

Next, we conducted mediation analysis to determine 
whether within the loss-framed condition, the effect of risk 
disclosure on behavioral intention was mediated by overall 
product evaluation. We used Baron and Kenny’s (1986) 
mediation test method, estimating a series of regression 
models and testing the significance of the (unstandardized) 
regression coefficients. First, a bivariate regression model 
confirmed that risk disclosure had a significant impact on 
the overall product evaluation (B = —693; t = —3.68, p < 
.001). Second, a bivariate regression model confirmed that 
risk disclosure had a significant effect on behavioral inten- 
tion (B = —745; t = —2.5, p = .01). Third, we regressed 


FIGURE 2 
Study 1: Effect of Stated Product Risk on 
Behavioral Intention: Gain-Framed Versus 
Loss-Framed Conditions 
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behavioral intention simultaneously on both product risk 
disclosure and product evaluation. This analysis revealed 
that the proposed mediator (product evaluation) had a sig- 
nificant influence on behavioral intention (B = .986; t = 
9.35, p < .001), whereas the effect of the independent varia- 
ble (risk disclosure) became nonsignificant (B = —.06; t = 
—.25, p = .80) when the mediator was controlled. Thus, our 
analysis meets all four of Baron and Kenny’s criteria to 
establish that the effect of risk disclosure on behavioral 
intention is completely mediated by product evaluation. 
Finally, Sobel’s (1982) test confirmed the statistical signifi- 
cance of the indirect effect of risk disclosure on intention 
mediated by product evaluation (z = —3.43, p < .001). 


Discussion of Study 1 


Study 1 shows that consumers exposed to loss-framed mes- 
sages exhibit a greater aversion to a temporary product risk 
(i.e., involving potential temporary negative outcomes) than 
do consumers exposed to gain-framed messages (who gave 
essentially zero decisional weight to such product risks). 
This finding is contrary to function/risk theory, which states 
that loss-framed messages induce risk-seeking behavior, but 
is consistent with both valence congruency theory and risk 
discrimination theory. However, Study 1 does not indicate 
whether gain-framed messages increase consumers’ toler- 
ance for all risks (as would be predicted by valence congru- 
ency theory) or simply minor, temporary risks (as predicted 
by risk discrimination theory). Not only is this difference 
theoretically interesting, but it may also have important 
marketing and public policy implications. If gain-framed 
messages cause consumers to disregard all product risk 
information, such messages might (intentionally or uninten- 
tionally) result in imprudent decisions about products 
entailing serious risks (which, as we noted previously, are 
increasingly being marketed directly to consumers). Con- 
versely, if gain-framed messages merely cause consumers 
to disregard minor, temporary risks in the pursuit of long- 
term health benefits, such messages might actually help 
consumers make more rational decisions to protect their 
long-term health. 

Given the importance of this issue, we address it in a 
second study, which examines the impact of message fram- 
ing on consumers’ responses to two kinds of product risk: 
the risk that product use may cause temporary unpleasant 
effects versus the risk that product use may cause perma- 
nent serious harm. 


Study 2 


Purpose 


Study 2 serves two main purposes. First, it enables us to 
replicate the test of H4 (that consumers exposed to gain- 
framed messages give less weight to temporary product 
risks than consumers exposed to loss-framed messages) in a 
completely different product context. Second, it enables us 
to test H4 (that message framing influences the relative 
weight that consumers give to product risks involving tem- 
porary versus permanent adverse consequences). 
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Product Context 


This study examines the effects of message framing and 
perceived product risk 1n the context of hepatitis B virus 
(HBV) vaccination adoption. Hepatitis B is a serious viral 
infection of the liver, which can lead to chronic hepatitis 
and, in some cases, death. It is transmitted in a manner 
similar to HIV (e.g., through unprotected sex, sharing of 
needles). However, the risk of HBV infection (unlike HIV) 
can be greatly reduced by the use of an effective vaccine, 
ideally administered in a series of three doses (Krugman 
and Stevens 1994). Unfortunately, many sexually active 
adults decline the vaccine, even when it is offered for free 
(Weinstock et al. 1995). Therefore, increased adoption of 
the HBV vaccine is a major social marketing goal in both 
the United States and other countries. 


Overview of Study Design 


The study is a randomized experiment. We recruited volun- 
teers from the patient populations at three public sexually 
transmitted disease (STD) clinics located in two large mid- 
western cities, and we randomly assigned them to treatment 
conditions that included either gain-framed or loss-framed 
messages advocating the HBV vaccination. Because HBV 
is an STD, clinic patrons are a prime target audience for 
promoting the HBV vaccination. 

The randomization was executed and the experimental 
messages and questionnaire were administered by means of 
an audio computer-assisted self-interview (A-CASD) tool, 
which is an effective tool particularly for addressing sensi- 
tive topics such as STDs (see, e.g., Turner et al. 1998.) 

Before participants were exposed to the experimental 
messages, they were asked a series of questions about their 
perceptions of two distinct components of product risk: the 
temporary product risks associated with getting shots (e.g., 
pain) and the more permanent risks associated with the 
HBV vaccine (e.g., the possibility that the vaccine might 
harm their health). Then, following exposure to either gain- 
framed or loss-framed messages, participants were asked 
about their behavioral intentions regarding the product. 


Participants and Procedure 


A total of 213 adult (age 18 years and older) participants 
were recruited by a trained project manager from the patient 
populations of the three STD clinics over a period of several 
months. Participants were screened to ensure that they had 
no prior self-reported history of HBV infection or vaccina- 
tion and that they were fluent in English. The sample was 
60% male and had a median age of 27.5 years. Approxi- 
mately 85% of the sample had at least a high school 
diploma or equivalent. 

After screening for study eligibility, each participant 
Signed a written informed consent form and then was 
directed to a private area with a touch-screen notebook 
computer with headphones. The project manager familiar- 
ized the participant in the use of the A-CASI equipment and 
allowed the participant to complete the interviewing process 
in private. The project manager had no awareness of either 
the message treatment group to which participants were 
randomly assigned or participants’ questionnaire responses. 


Before the framing intervention, participants completed 
a series of questions about demographics, prior behaviors 
that might affect susceptibility to HBV infection (e.g., sex- 
ual behaviors), and perceptions of both the temporary risks 
of HBV vaccination (e.g., injection pain) and the risks of 
more permanent negative outcomes (e.g., the possibility 
that the vaccine itself might cause HBV infection; we dis- 
cuss these measures in greater depth subsequently). Next, 
each participant was randomly assigned a message that 
advocated the adoption of the HBV vaccination that was 
either gain framed or loss framed. After exposure to the 
message, participants reported their behavioral intentions. 


Stimull 
The message opened with a gain- (loss-) framed general 


statement about the consequences of getting (or not getting) . 


the vaccination, followed by two gain- (loss-) framed anec- 
dotes that illustrated these consequences. To increase the 
perceived personal relevance of the messages, the A-CASI 
program automatically matched the gender (i.e., names) of 
the people described in the two anecdotes to the partici- 
pant's gender; otherwise, the male and female messages 
were identical. The male version of the message is as fol- 
lows (the gain-framed version appears in normal text, and 
the loss-framed language appears in bold): 


People who get (don't get) the hepatitis B shot are gain- 
ing (losing) a chance to protect themselves and the ones 
they love. 


Michael felt very healthy and didn't believe he would be 
infected by the Hepatitis B virus, [b]ut he followed (so he 
didn't follow) the advice of the American Medical Asso- 
ciation and got (didn't get) 3 hepatitis B vaccine shots 
over 6 months. Because of this, he 1s (not) protected 
against hepatitis B infection. Now he is much less (much 
more) likely to get sick or die from serious liver disease, 
and he can look forward to (may miss out on) a long and 
healthy life, watching his child grow up. 


James decided he needed (didn't need) to protect himself 
and got (didn't get) the hepatitis B vaccine. As a result, 
he did not get infected (got infected) after having sex with 
an infected partner. Because James did not get infected 
(got infected) with the hepatitis B virus, he did not pass it 
on (passed it on) to his new partner. So his new partner 
did not get sick (got sick) with hepatitis B, and now they 
have gained (lost) a chance for a healthy future together. 


We checked this message-framing manipulation in a 
pretest, in which 38 young adults (average age of 29 years) 
were randomly assigned either the gain- or the loss-framed 
message and then responded on Likert scales (7 = “strongly 
agree,” 1 = “strongly disagree”) to two statements: “This 
message focuses on the benefits of getting the hepatitis B 
shot,” and “This message focuses on the negative conse- 
quences of not getting the hepatitis B shot.” As we 
expected, participants assigned to the gain-framed message 
condition (X = 5.23) were more likely than the participants 
in the loss-framed message condition (X = 3.25) to agree 
that the message focused on the benefits of compliance (t = 
4.29, p < .001). Conversely, participants in the loss-framed 
message condition (X = 6.25) were more likely than partici- 
pants in the gain-framed message condition (X = 4.23) to 


agree that the message stressed the negative consequences 
of noncompliance (t = 3.57, p = .001). 


Constructs and Measures 


Perceived product risk. Consumers’ decisions to get 
vaccinated are sometimes influenced by their perceptions of 
two fairly distinct types of product risk. First, there is the 
perceived risk of temporary pain or discomfort from the 
needle or injection itself, which sometimes deters con- 
sumers from getting vaccinations and other medical proce- 
dures involving the use of needles (see, e.g., French and 
Painter 1994; Zimet et al. 2004). Second, there is the per- 
ceived risk of more permanent harm caused by the specific 
vaccine (e.g., the perceived risk that the vaccine will trans- 
mit the disease in question or otherwise cause serious ill- 
ness or long-term disability; see, e.g., Ball, Evans, and 
Bostrom 1998). 

Before exposure to the framed messages, participants 
completed a series of measures that assessed these two 
dimensions of product risk. To assess the percerved risk of 
shots/needles, participants reported their agreement (on 
five-point Likert scales) with four items (e.g., “Shots are 
very painful," “Needles don’t bother me at all [reversed]’’), 
which we combined to form a mean scale with a coefficient 
alpha of .80. To assess perceived risk of the HBV vaccine, 
participants reported their agreement with six items (e.g., 
“The vaccine shots for hepatitis can make people very sick,” 
“You can get infected with hepatitis from the hepatitis vac- 
cine shots”), which we combined to form a mean scale with 
an alpha of .78. As we previously discussed, these two 
dimensions of perceived product risk appear to be distinct. 
The two mean scales were not significantly correlated (r = 
.098). Furthermore, when all ten items are combined in a 
factor analysis with an orthogonal rotation, the respective 
sets of scale items load together on two distinct factors 
(eigenvalues > 1) with factor loadings between .6 and .9 and 
minimal cross-loadings (all < .08). 


Behavioral intentions. We assessed participants’ inten- 
tions regarding the HBV vaccination with three items: a 
dichotomous yes/no question (“If you were offered the 
hepatitis B vaccine for free at this clinic, would you get the 
shot?") and two five-point Likert scale items (“I would rec- 
ommend that my partner get hepatitis vaccine shots,” and “T 
would recommend that my friends or family members get 
hepatitis vaccine shots”). These three items were highly 
intercorrelated and loaded together on a single factor that 
explained 78% of their combined variance. Therefore, the 
three items were all converted to standardized scores (to 
adjust for scaling differences) and then combined to form a 
mean scale with a coefficient alpha of .87. 


Results 


We tested H4 to examine whether (as in Study 1) message 
framing influences the decisional weight that consumers 
give to temporary product risks. To address this issue, we 
calculated the correlation between behavioral intention and 
perceptions of temporary product risk (i.e., risk of short-term 
injection discomfort), and then we examined whether this 
correlation differed between the gain- and the loss-framed 
conditions (see Pietromonaco and Rook 1987). As in Study 
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1, this analysis revealed that among participants exposed to 
loss-framed messages, overall product evaluations were 
negatively correlated with perceived temporary product risk 
(r 2 —30, p < .01). However, among participants exposed to 
gain-framed messages, overall product evaluations were not 
significantly correlated with perceived temporary product 
risk (r = —.01, p = .92). A Fisher r-to-z transformation 
(Kleinbaum and Kupper 1978, pp. 106-107) confirmed that 
the correlation between perceived temporary product risk 
and overall product evaluation was significantly different 
across the two framing conditions (z = 2.18, p < .05). 

As an additional test of H3, we conducted an ANOVA to 
examine how framing moderates consumer response to high 
versus low levels of temporary and permanent product risk. 
This analysis involved three steps. First, as we noted previ- 
ously, a factor analysis of the perceived risk items produced 
two orthogonal factors, one representing perceived tempo- 
rary product risk (i.e., shot/needle pain) and the other repre- 
senting perceived permanent product risk (i.e., serious ill- 
ness caused by the vaccine). Second, we performed a 
median split on each of these factor scores, which resulted 
in two uncorrelated dichotomous factors: perceived tempo- 
rary product risk (high versus low) and perceived permanent 
product risk (high versus low). Third, we performed a 2 x 
2 x 2 ANOVA of behavioral intentions, in which message 
framing (gain versus loss), temporary product risk (high 
versus low), and permanent product risk (high versus low) 
were dichotomous factors. This analysis showed significant, 
negative main effects of both permanent product risk (F = 
25.4, p « .001) and temporary product risk (F = 4.1, p « .05) 
on behavioral intentions. However, consistent with Ha, the 
ANOVA also revealed a significant interaction between 
framing and temporary product risk (F = 4.08, p « .05). As 
we hypothesized, perceived temporary product risk had a 
significant, negative effect on behavioral intentions among 
loss-framed participants (F — 6.34, p — .01) but had no 
impact among gain-framed participants (F = .09, p = .77). 

Thus, both the HBV study and the skin cancer experi- 
ment, though using different participant populations, set- 
tings, and products, produced remarkably similar results: In 
both studies, participants exposed to loss-framed messages 
gave considerably more decisional weight to temporary dis- 
comfort risk (either skin rash or shot pain) than participants 
exposed to gain-framed messages. Indeed, gain-framed par- 
ticipants in both studies gave essentially zero decisional 
weight to such temporary product risks. 

Next, we examined Hy, that message framing influences 
the relative weight that consumers give to temporary prod- 
uct risks versus permanent product risks. As we hypothe- 
sized, among participants in the loss-framed condition, 
behavioral intentions were negatively correlated with both 
perceived temporary product risks (r = —.30, p < .01) and 
perceived permanent product risks (r = —35, p « .01). To 
assess the difference between these values, we calculated 
Meng, Rosenthal, and Rubin's (1992) z statistic. Note that 
in testing H}, we used the Fisher r-to-z transformation, 
which is most appropriate for comparing the same correla- 
tion across two different samples. However, Meng, Rosen- 
thal, and Rubin's test is most appropriate for comparing the 
magnitude of two different correlations (€.g., Tiemprisk,ntent 
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Versus Ipermnsk,ntent) Within the same sample (e.g., loss- 
framed participants), in which each correlation is between a 
common dependent variable (intention) and alternative pre- 
dictor variables (temporary risk versus permanent risk). 
Meng, Rosenthal, and Rubin's test found no significant dif- 
ference between Ttemprisk,intent 8Dd Tpermnsk,ntent 2Mong loss- 
framed participants (z = .38, p = .70), suggesting that loss- 
framed participants give roughly equal weight to permanent 
versus temporary risks in forming behavioral intentions. 

Among gain-framed participants, behavioral intentions 
were (as we reported previously) uncorrelated with per- 
ceived temporary product risks (r = —01). However, gain- 
framed participants' behavioral intentions exhibited a strong 
negative correlation (r = —.47, p < .0001) with perceived per- 
manent product risks. Meng, Rosenthal, and Rubin's (1992) 
test indicates that the difference between these two correla- 
tions was statistically significant (z = 4.03, p < .0001). 

Thus, as we predicted in H4, consumers exposed to 
gain-framed messages appear to give significantly more 
decisional weight to permanent product risks than to tempo- 
rary product risks. In contrast, consumers exposed to loss- 
framed messages respond equally negatively to both types 
of product risk. 


Discussion 


Consumers are increasingly called on to evaluate product 
risk (i.e., the possibility that product use may cause negative 
consequences). However, there has been relatively little 
research on how consumers evaluate product risk informa- 
tion, and the research that exists has produced contradictory 
findings. To help explain these contradictory findings, we 
hypothesized that the persuasive effects of product risk 
information may be moderated by message framing. Specif- 
ically, we evaluated three competing theories of how fram- 
ing and product risk may interact to influence consumer 
product evaluations and intentions. 

We evaluated function/risk theory (Meyerowitz and 
Chaiken 1987; Rothman and Salovey 1997), which posits 
that (1) loss-framed messages induce risk-seeking behavior, 
whereas gain-framed messages induce risk-avoidance 
behavior; (2) consumers perceive detection behaviors as 
risky and prevention behaviors as low risk (or "safe"); and 
(3) loss-framed messages increase consumers' motivation to 
use detection products, whereas gain-framed messages 
increase consumers' motivation to use prevention products. 

None of these three premises was supported by our data. 
First, although manipulation checks confirmed that con- 
sumers understood the respective functions (detection ver- 
sus prevention) of the two product concepts presented in 
Study 1, participants did not perceive the detection product 
as inherently riskier than a prevention product with the 
same form and usage experience. Second, Study 1 did not 
find any difference in message-framing effects between 
detection and prevention products. Third, function/risk 
theory's most basic premise, that loss-framed messages 
induce risk-seeking behavior, was contradicted by both 
Studies 1 and 2. In both experiments, participants exposed 
to loss-framed messages exhibited a general aversion to 
product risk involving both relatively short-term adverse 


effects (Studies 1 and 2) and more permanent harm (Study 
2). There was no evidence of risk-seeking behavior among 
loss-framed participants in either experiment. 

As we noted previously, proponents of function/risk 
theory claim that their hypothesis of risk seeking among 
people exposed to loss-framed messages is grounded in 
Tversky and Kahneman’s (1981) prospect theory. In that 
case, does our empirical disconfirmation of this hypothesis 
mean that prospect theory is wrong? Absolutely not. It sim- 
ply means that in the attempt to derive function/risk theory 
from prospect theory, key aspects of prospect theory (partic- 
ularly the meaning of the word “risk’’) were lost in transla- 
tion. As we noted previously, Tversky and Kahneman used 
the word “risk” in a technical sense to mean the variance of 
potential outcomes, so that a riskier option entails both 
extremely positive and extremely negative possibilities 
rather than a predictable (but mediocre) certain option. 
When risk is operationalized in this way, participants are 
sometimes motivated to choose the risky option because of 
its positive possibilities. This is particularly true if outcomes 
are framed as losses because the certain option then entails 
a certain small loss, whereas the risky option offers the 
upside possibility of no loss. Given this somewhat esoteric 
definition of “risk,” risk-seeking behavior seems plausible. 

However, although this “risk as variance” definition is 
used in a few academic fields, including expected utility 
theory (the point of departure for prospect theory) and 
financial investment analysis, it does not match the way the 
word “risk” is used by function/risk theorists or, for that 
matter, in most academic disciplines. To most people, acad- 
emic or layman, “risk” specifically connotes the possibility 
of something bad. For example, Merriam-Webster (2005) 
defines risk as “possibility of loss or injury: peril.” Simi- 
larly, in a typical consumer behavior definition, Solomon 
(2004, p. 304) defines perceived risk as “the belief that the 
product has potentially negative consequences,” and risk 
perception researcher Paul Slovic (2000) uses the words 
"risk" and “hazard” interchangeably. In none of these cases 
does "risk" imply the possibility of extremely positive con- 
sequences that help explain Tversky and Kahneman's 
(1981) results. 

Therefore, to understand how message framing might 
influence consumers' responses to product risk (defined as 
potential negative consequences), we proposed two alterna- 
tive theories: valence congruity theory and risk discrimina- 
tion theory. Both theories are based on prior findings that 
message framing influences mood and that mood influences 
consumers' responses to risk. Both theories suggest that 
exposure to gain-framed messages increases consumers' tol- 
erance for temporary product risks, a finding that was con- 
firmed in both Study 1 and Study 2. However, the theories 
diverge in predicting how message framing will affect con- 
sumers' responses to more serious, long-lasting product 
risks. Thus, Study 2 was designed to address this issue. The 
findings of this experiment support risk discrimination 
theory: Participants exposed to gain-framed messages 
appear to be better able to discriminate between important 
and unimportant risks; they essentially ignore minor or tem- 
porary product risks but exhibit considerable caution 
regarding the possibility of more permanent, serious risks. 


In addition to their theoretical implications, our findings 
may have implications for those who design and regulate 
promotional messages for risky products, including DTC 
advertisements for prescription drugs. Food and Drug 
Administration requirements dictate that any DTC adver- 
tisement that mentions a product's brand name (e.g., Levi- 
tra) and its benefits (improving male sexual function) must 
disclose the product's side-effect risks (e.g., severe 
headaches). The only way to avoid such disclosure is to 
either not name the product (e.g., "Ask your doctor about 
new treatments for erectile dysfunction") or not say what it 
does (e.g., "Ask your doctor about Levitra"). Thus, DTC 
advertisers must either mention product risks or produce 
rather cryptic advertisements (e.g., the Levitra advertise- 
ment that does not indicate what it does but shows a foot- 
ball going through a swinging tire). 

Although there has been relatively little research on how 
consumers process such product risk information, there are 
several theories. One theory (e.g., Goetz] 2000; Rodale 
Publications 1999) is that product risk disclosures actually 
enhance advertisement credibility by making the advertiser 
seem more forthcoming. Another theory is that product risk 
disclosures undermine the positive content of an advertise- 
ment and reduce advertising persuasiveness (Goetzl 2001; 
Morris, Mazis, and Brinberg 1989). However, our findings 
do not support either of these simple main effects of prod- 
uct risk. Instead, our data suggest that consumers respond 
differently to product risk depending on both the nature of 
this risk (temporary versus permanent) and the framing of 
the persuasive message. Participants exposed to loss-framed 
messages exhibited an aversion to product risk, whereas 
those exposed to upbeat, gain-framed messages essentially 
disregarded temporary product risks in forming product 
evaluations and intentions. 

These results suggest that advertisers are able to reduce 
the decisional weight that consumers give to certain types 
of product risk information by placing such information in 
the context of upbeat, gain-framed message executions. If 
confirmed by further research, this finding has potentially 
important implications both for marketers required to 
include product risk disclosures in their promotional materi- 
als and for the consumer protection agencies tbat institute 
these disclosure requirements. 

To understand the policy implications of these findings 
more fully, further research should explore several issues. 
Specifically, further research should examine whether the 
framing-induced optimism we observed in our study is ben- 
eficial or harmful in general to consumer welfare. As we 
noted previously, our research found that consumers 
exposed to gain-framed messages gave essentially zero 
weight to temporary product risks (i.e., skin rash or sbot 
pain) in evaluating a disease protection product (i.e., skin 
cancer Jotion or hepatitis vaccination). However, is this dis- 
regard of temporary product risk a good thing or a bad 
thing? On the one hand, in general, consumers should con- 
sider risks before pursuing any course of action. On the 
other hand, perhaps consumers should give the possibility 
of a skin rash (or a sore shoulder) considerably less weight 
than the chance to protect themselves from skin cancer or 
liver disease. Perhaps optimism induced by gain-framed 
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messages simply helps consumers overcome their natural 
hesitancy toward taking calculated risks (i.e., the omission 
bias). This is an important issue for further research. 

Finally, further research should explore the emotional 
effects of message framing, examining more broadly how 
these effects may influence consumer judgment and deci- 
sion making. As we noted previously, most message- 
framing research to date has been grounded (at least osten- 
sibly) in Kahneman and Tversky’s (1979; see also Tversky 
and Kahneman 1981) prospect theory, which, as is the case 
with much of behavioral decision theory, tends to empha- 
size cognitive over affective phenomena (for a discussion of 
the neglect of emotion in behavioral decision research, see, 
e.g., Loewenstein et al. 2001). However, the finding that 
message framing influences consumers’ moods, along with 
the wealth of research showing that mood affects decision 
making (e.g., Isen 2000), could open up entirely new 
avenues in the study of framing effects. We strongly encour- 
age researchers to explore those avenues. 


Appendix 
Examples of Experimental 
Messages 


Detection/Low Risk/Loss 


Skin cancer is the most common form of cancer among 
people in their twenties. Every 53 minutes, someone dies of 
skin cancer. Skin cancer can be hard to see with the naked 
eye or may look like a normal mole or freckle. 

Fortunately, there is now a medicated lotion which can 
detect skin cancer. This product is not a sunblock or sun- 


— screen. It is applied by a doctor to the patient's skin and cre- 


ates a gentle reaction, which allows the doctor to easily 
detect the early development of skin cancer. This lotion 1s 
highly effective and very safe. 

If you fail to use this lotion, you will be losing an 
opportunity to detect the development of skin cancer cells 
before they can do serious harm. By failing to use this 
lotion, you can significantly harm your chances of living a 
long, cancer-free life. 

For more information, call 1-800-4CANCER, or talk to 
your doctor. 


Prevention/High Risk/Galn 


Skin cancer is the most common form of cancer among 
people in their twenties. Every 53 minutes, someone dies of 
skin cancer. Skin cancer can be hard to see with the naked 
eye or may look like a normal mole or freckle. 

Fortunately, there is now a medicated lotion which can 


. prevent skin cancer. This product is not a sunblock or sun- 


screen. It is applied by a doctor to the patient's skin and cre- 
ates a gentle reaction, which allows the doctor to easily pre- 
vent the early development of skin cancer. This lotion is 
highly effective and generally safe. However, in some 
patients, ıt can cause a severe skin rash lasting several 
weeks. 

If you use this lotion, you will be gaining an opportu- 
nity to prevent the development of skin cancer cells before 
they can do serious harm. By using this lotion, you can sig- 
nificantly improve your chances of living a long, cancer- 
free life. 

For more information, call 1-800-4CANCER, or talk to 
your doctor. 
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How Does Enforcement Deter Gray 
Market Incidence? 


Gray market activity has become increasingly prevalent. The prevailing wisdom in marketing is to use more severe 
enforcement to deter gray marketing. However, the certainty and speed of enforcement may also have a bearing 
on the incidence of violations. This article examines whether and how enforcement deters gray marketing. The 
results from a field survey of manufacturers and an experimental design suggest that, by itself, enforcement sever- 
ity has no impact. Deterrence results only when the multiple facets of enforcement are used in combination. 


ingly rely on their authorized distributors to perform 

value-adding functions (Coughlan et al. 2001). To safe- 
guard distributor incentives to perform these services, 
manufacturers typically deploy resale restrictions through 
explicit contracts or implicit agreements. By circumscribing 
to whom distributors may sell, resale restrictions limit intra- 
brand competition and maintain distributor margins. Gray 
market activity—that is, the sale of genuine trademarked 
products through distribution channels unauthorized by the 
manufacturer or brand owner—poses a direct and signifi- 
cant threat to manufacturers’ deployed resale restrictions. 
Given free reign, gray markets create free-riding problems 
across distributors that provide customer service, make a 
selective distribution system more intensive, and harm dis- 
tributors that have made specific investments in the channel 
of distribution. Gray marketing is problematic for manufac- 
turers because it can have a negative impact on distributor 
relations and the manufacturer’s brand equity, ultimately 
undermining the integrity of the distribution channel 
(Corey, Cespedes, and Rangan 1989; Myers and Griffith 
1999). 

Gray markets are endemic across, though not limited to, 
a wide variety of industries, ranging from heavy construc- 
tion equipment, personal computers, and cellular phones to 
perfumes, watches, personal care, and other consumer prod- 


[: today’s competitive markets, manufacturers increas- 
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ucts. Estimates of gray market activity range from $10 bil- 
lion in economy-wide annual gray market sales (Cespedes, 
Corey, and Rangan 1988) to $20 billion in the information 
technology sector alone (estimate by the Alliance for Gray 
Market and Counterfeit Abatement; see www.agmaglobal. 
org). Surveys conducted by Michael (1998) and Myers 
(1999) confirm the increasing incidence and scope of gray 
market activity. 

Most firms are keenly aware of the costs of allowing 
gray market activity to go unchecked (Bucklin 1993; 
Hwang 1999; Nelson 1999) and allocate significant 
resources to limiting the incidence of violations. In general, 
the prevailing advice from the academic literature is that 
increasing enforcement severity (i.e., raising the magnitude 
of the punitive response undertaken) lowers the incidence of 
gray market activity. For example, the enforcement litera- 
ture in marketing (Banerji 1990; Dutta, Bergen, and John 
1994) exclusively focuses on severity as the dimension of 
enforcement that firms should manage. Such a focus 
reflects the implicit assumption that the critical manage- 
ment variable for manufacturers seeking effective gray mar- 
ket deterrence is "the severity of the principal's disciplinary 
response to an agent's violation of a contractual obligation" 
(Antia and Frazier 2001, p. 67). 

Although such advice may seem intuitively appealing 
and finds favor with those urging manufacturers to "get 
down to business" against gray market activity (Nelson 
1999), to the best of our knowledge, there is no evidence of 
the efficacy of severe enforcement in the context of gray 
markets. A well-established literature spanning economics, 
law, sociology, and social psychology, which has come to 
be known as the deterrence doctrine, proposes that severe 
enforcement alone may not be enough to curb the incidence 
of violations. Instead, the doctrine advocates a broader 
notion of enforcement, comprising (in addition to severity) 
the certainty and speed of the response (Howe and Brandau 
1988; Manson 2001; Posner 1985). 

Understanding how these dimensions of enforcement 
work together is essential to managing gray market inci- 
dence more effectively. Yet to the best of our knowledge, no 
attempt has been made to incorporate the possible role of 
severity, certainty, and speed of enforcement together in 
curbing gray market incidence; severity remains the sole 


Journal of Marketing 
Vol. 70 (January 2006), 92—106 


focus of any discussion of enforcement, both in the trade 
press (Hwang 1999; Nelson 1999) and in the relevant mar- 
keting literature (Antia and Frazier 2001; Bergen, Heide, 
and Dutta 1998). Accordingly, current enforcement-related 
advice to managers is limited. 

In this article, we attempt to broaden our conceptualiza- 
tion of enforcement and explore the incidence of gray mar- 
ket violations as a function of the severity, certainty, and 
speed of the manufacturer’s general enforcement behavior. 
In the interests of appropriate model specification, we also 
incorporate several context-specific determinants of inci- 
dence gained from multiple in-depth interviews and a 
review of the rich literature on gray markets (Ahmadi and 
Yang 2000; Cavusgil and Sikora 1988; Corey, Cespedes, 
and Rangan 1989; Coughlan and Soberman 1998). 


We first test our hypotheses with data collected from a ' 


field survey of 104 manufacturers in the personal care prod- 
ucts industry. The difficulty of obtaining such data is well 
acknowledged, given the sensitive nature of the phenome- 
non and its implications (Cavusgil and Sikora 1988; Myers 
1999). Ours is the first study to bring evidence to bear on 
the relationship between enforcement and gray market inci- 
dence. We then complement the insights gained from the 
field study with an experiment that we designed to (1) pro- 
vide the perspective of the target rather than the source of 
enforcement and (2) manipulate the facets of enforcement 
in a controlled setting. Together, both studies yield rich 
insights into the impact of enforcement on deterrence. 

In the sections that follow, we introduce the deterrence 
doctrine as the lens through which we view the relationship 
between gray market incidence and enforcement. We then 
discuss hypotheses that link gray market incidence to the 
three facets of enforcement. We follow this with descrip- 
tions of the research method and a discussion of the results 
and implications of our studies. We conclude with the limi- 
tations of our study and provide avenues for further 
research. 


Conceptual Framework 


Gray Market Incidence, Enforcement, and the 
Deterrence Doctrine 


The notion of deterrence has its roots in the seminal works 
of Bentham (1962) and Beccaria (1963) and refers to the 
preventive effect of actual or threatened punishment on 
potential offenders (Ball 1955). In the current context, the 
aim of deterrence is to minimize or preclude violations of 
deployed resale restrictions not only by the perpetrator but 
also by other potential offenders. Thus, we relate the inci- 
dence of gray market violations to the manufacturer’s 
enforcement behavior in general rather than the punitive 
response to a particular violation (cf. Antia and Frazier 
2001; Bergen, Heide, and Dutta 1998). 
Within a gray market context, enforcement refers to the 
means by which the manufacturer ensures distributor com- 
pliance with resale restrictions, including fines, litigation, 
social ostracism, and termination. The manufacturer’s abil- 
ity to enforce stems from the legitimate authority bestowed 
by explicit or implicit agreements with its intermediaries 


(see French and Raven 1959), but this is not the only way to 
deter gray market activities or achieve dealer compliance. 
In addition to enforcement, manufacturers may also with- 
hold ongoing benefits, such as promotional discounts, 
advertising support, and training, from noncompliant down- 
stream intermediaries. Such influence attempts are related 
to, though distinct from, enforcement as we define it. 

Severity is fundamental to the notion of enforcement 
and can be defined as the strength or intensity of corrective 
actions across detected violations (Gibbs 1975). The costs 
imposed on the violating party are a direct function of the 
severity of the punishment that is received. The magnitude 
of a fine affects the offending distributor’s net payoff from a 
gray market violation (in this case, the profit captured by 
the violation less the fine). Other nonfinancial costs may 
also be important. The costs associated with litigation and 
termination include the time spent in legal consultation and/ 
or in seeking alternative sources of supply. Gray market 
violations are also likely to have a negative effect on the 
relationship between the manufacturer and the supplier, 
thus increasing monitoring and surveillance costs. Overall, 
the potential gains from violating resale restrictions are 
eroded by nontrivial costs, which reduces the incentive to 
commit the violation (Becker 1968; Stigler 1970). 

The marketing literature treats enforcement and its 
severity as essentially synonymous. Dutta, Bergen, and 
John (1994) and Bergen, Heide, and Dutta (1998) conceptu- 
alize enforcement in terms of termination. Antia and Frazier 
(2001) explicitly define enforcement in terms of severity. 
Moreover, as it currently stands, the preceding literature 
could be characterized as advocating (even taking for 
granted) a single-minded focus on severity. Such a focus 
rests on the assumption that the rigor of enforcement has a 
Significant inverse relationship to the incidence of viola- 
tions, notwithstanding variations in certainty or speed. 
Thus: 


Hı: The greater the severity of enforcement, the lesser is the 
likelihood of gray market incidence. 


Interaction with Certainty 


In contrast to the single-minded focus on severity as a deter- 
minant of deterrence in the marketing literature, work in 
other disciplines emphasizes the need to consider enforce- 
ment certainty (the likelihood that a violator will be pun- 
ished) and speed (the time taken by the manufacturer to 
undertake corrective action). As originally conceptualized 
and specified, the deterrence doctrine emphasized the facets 
of severity and certainty (Andenaes 1971; Bailey and Smith 
1972; Bentham 1962; Gibbs 1968). Subsequent augmenta- 
tions acknowledge the potential role of speed and incorpo- 
rate its impact on deterrence (Gray et al. 1982). Mirroring 
the development path of the literature, we first focus on the 
potential deterrent impact of severity and certainty and then 
incorporate the role of speed in our hypotheses. 

Research in sociology conceptualizes deterrence as a 
joint function of severity and certainty (Antunes and Hunt 
1973; Gray and Martin 1969). The severity of enforcement 
is posited to have significant deterrent value but only when 
it is matched with a high likelihood of action being taken. 
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The notion of matching facets of enforcement implies the 
need for high levels of each facet of enforcement in order to 
achieve deterrence. 

By certainty, we mean the manufacturer’s general 
propensity to undertake enforcement in response to viola- 
tions (Gibbs 1975). To take corrective action, the manufac- 
turer must be able to obtain information about violations 
once they occur (Dutta, Heide, and Bergen 1999) and have 
the motivation to enforce (Antia and Frazier 2001). This is 
our point of departure from the deterrence doctrine, as rep- 
resented in the sociology literature. In the latter case, the 
motivation to eradicate or at least minimize crime is a 
given. In contrast, manufacturers that detect gray market 
violations may undertake selective enforcement for a vari- 
ety of reasons (Banerji 1990; Bergen, Heide, and Dutta 
1998; Coughlan and Soberman 1998). In the current con- 
text, therefore, certainty of enforcement is a function of the 
manufacturer’s ability and motivation. The former is 
reflected in the manufacturer’s detection ability because 
enforcement hinges on first knowing that a violation has 
occurred (Antia and Frazier 2001). 

Unfortunately, gray markets raise three well-known 
problems with regard to inferring motivation. First, in con- 
trast to sociology, the certainty of enforcement cannot be 
estimated conveniently from archival information on 
reported crime statistics and conviction rates (Gibbs 1975), 
because no such information exists for gray market viola- 
tions. In the absence of archival information on the inci- 
dence of violations and manufacturers’ enforcement 
responses, it is necessary to rely on microlevel, survey- 
based data to gain insights into this highly sensitive issue. 
Second, manufacturers do not know of all instances of vio- 
lations (Wathne and Heide 2000), because detection is 
costly (Stigler 1970) and imperfect (Dutta, Heide, and 
Bergen 1999). Third, and perhaps equally important, no 
manufacturer would admit to selective enforcement, thus 
introducing social desirability bias (Fisher 1993). 

Recognizing this potential for bias, survey-based studies 
on enforcement have not solicited informant evaluations of 
their own likelihood of taking action. Instead, motivation is 
inferred from the reported enforcement response to particu- 
lar instances of violations (for recent examples of this 
approach, see Antia and Frazier 2001; Bergen, Heide, and 
Dutta 1998). Moreover, previous studies have hypothesized 
and found evidence of a direct and strong relationship 
between detection ability and likelihood of enforcement 
(Antia and Frazier 2001; Dutta, Bergen, and John 1994; 
Dutta, Heide, and Bergen 1999; Ghosh and John 1999). 
Given the preceding difficulty and in light of the strong 
relationship between the two constructs, we represent cer- 
tainty in terms of detection ability. Our approach is consis- 
tent with and builds on the existing marketing literature on 
enforcement. 

The rationale for matched and high levels of severity 
and certainty suggests that when perpetrators decide 
whether to engage in gray market activities, they consider 
not only the previously discussed costs imposed by severe 
enforcement but also the likelihood of severe enforcement. 
If detection ability and, consequently, the certainty of an 
enforcement response are low, potential miscreants are 
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likely to discount the costs imposed by severe enforcement 
(Luckenbill 1982). Severity poses no threat to the offender 
if the offence remains undetected by the manufacturer. 
Similar discounting of negative outcomes occurs if, for 
example, high detection capabilities are accompanied by 
less severe enforcement behavior. In this case, potential vio- 
lators may reason that though the likelihood of apprehen- 
sion is high (as a result of well-honed detection capabili- 
ties), their offenses are likely to incur nothing more serious 
than a “slap on the wrist" (Stafford et al. 1982). Proponents 
of this perspective argue that the credibility of sanctions 
hinges on both detection ability and severity (Gray et al. 
1982; Hollinger and Clark 1983). The interaction perspec- 
tive is also known as the "credibility of severe sanctions 
hypothesis" (Grasmick and McLaughlin 1978) and suggests 
the following: 


H,: Severe enforcement deters gray market incidence when 
the certainty of enforcement 1s high 


The Augmented Deterrence Doctrine 


Early empirical work on deterrence (Silberman 1976) 
focused exclusively on the certainty and severity of enforce- 
ment. Only in later studies (Gray et al. 1982) did scholars 
acknowledge and incorporate a third, additional dimension 
of enforcement: speed. The speed of enforcement refers to 
the time elapsed between the detection of violations and the 
corrective actions taken in response. Similar to severity and 
detection ability, our notion of speed is the time taken to 
respond to detected violations. 

Although the relationship between enforcement speed 
and deterrence is not as well specified as it is for detection 
ability and severity, our interdisciplinary literature review 
yields some insights into the effects of variation 1n enforce- 
ment speed. First, swift (slow) corrective action reduces 
(increases) the length of time over which the party commit- 
ting the violation may enjoy its payoff (Tedeschi 1976). 
Second, delay in applying sanctions allows the violating 
party “room to maneuver,” that is, to undertake actions to 
avoid (or at least minimize) bearing the brunt of the costs 
imposed by corrective action (Hufbauer, Schott, and Elliott 
1990). Third, research in psychology (Diver-Stamnes and 
Thomas 1995) and law (Manson 2001) suggests that tempo- 
ral proximity of the violation and the consequent corrective 
enforcement action reinforce the punitive consequences to 
the perpetrator, as well as the group at large. 

In contrast to tbe well-documented interaction between 
severity and detection ability, there has been little, if any, 
attention focused on explicating the interactions involving 
speed and severity (for a notable exception, see Gray et al. 
1982). The logic underlying this interaction mirrors that 
which we proposed in H., that the deterrent impact of 
enforcement is deemed to reside in high levels of speed 
combined with severity. It is plausible that severe enforce- 
ment may do little to deter violations if the enforcement 
response is inordinately delayed. Because a slow response 
allows the perpetrator time to enjoy a longer payoff from 
the violation, gives the perpetrator room to maneuver to 
avoid bearing the full brunt of enforcement, and/or weakens 
the causal link between violation and enforcement, delayed 


enforcement could dampen the deterrent impact of severity. 
By extension, severity should have a significant effect on 
deterrence when both the likelihood of detection and the 
speed of enforcement are high. An enforcement strategy 
that involves high levels of all three components serves to 
emphasize the manufacturer’s commitment to maintaining 
system integrity by presenting a unified, unambiguous 
deterrent. Thus: 


Ha: Severe. enforcement deters gray market incidence when 
the speed of enforcement is high. 


Hy: Severe enforcement deters gray market incidence when 
both detection ability and the speed of enforcement are 
high. 


Context-Specific Determinants of Gray Market 
Incidence 


Scholars. of deterrence agree that any examination of deter- 
rence must account for contextual variables that have a 
bearing on the incidence of violations, independent of 
enforcement (Gibbs 1975; Meier and Johnson 1977). Our 
review of the academic literature on gray markets yielded 
five factors that emerged most frequently: price differen- 
tials, premium brand positioning, product scarcity, free- 
riding potential, and customer heterogeneity on services 
demanded. 

According to Onkvisit and Shaw (1989, p. 205), “price 
differential is the only true reason for the gray market to 
exist." In attempting to price optimally for local conditions 
(Bucklin 1993; Duhan and Sheffet 1988), manufacturers 
facilitate arbitrage, whereby the diverter may source prod- 
uct from the low-priced market and sell it in the high-priced 
market without the manufacturer's authorization (Assmus 
and Wiese 1995). A related driver of gray market violations 
is the extent to which the product has a unique and differen- 
tiated brand name (Bucklin 1993). The ability to obtain a 
product with brand name cachet at significantly lower 
prices is a powerful motivator of consumer purchase (Ass- 
mus and Wiese 1995), which in turn drives the occurrence 
of gray marketing. Gray market violations are also likely to 
be high when the authorized distributors in a market are 
unable to satisfy demand for the product. Pent-up demand 
for a product creates the incentive necessary for arbitrage to 
occur, and goods are diverted from markets in which the 
product is readily available (and usually lower priced) to the 
markets in which the product is in short supply (Banerji 
1990). 

The potential for some distributors or unauthorized third 
parties to free ride off full-service distributors' market 
development efforts (Mathewson and Winter 1984; Rubin 
1990; Telser 1960) also facilitates the development of a 
gray market (Cross, Stephan, and Benjamin 1990). 
Although services such as advertising, pre- and postsales 
service, and warranty servicing (Coughlan et al. 2001) no 
doubt add value to the product, the revenue streams accru- 
ing to authorized distributors from their marketing efforts 
are vulnerable to appropriation (Anderson and Weitz 1986). 
Customer heterogeneity on services also is likely to foster 
gray markets. Some customers may value and even require 
services, whereas others may not be willing to pay for per- 


ceived “frills.” Enterprising diverters would be quick to pro- 
vide the actual product with little or no augmentation to 
those unwilling to pay the high price for the “bundled” 
product or service offering, thus realizing incremental 
(albeit unauthorized) sales to an additional, hitherto ignored 
customer segment (Ahmadi and Yang 2000; Yang, Ahmadi, 
and Monroe 1998). The preceding discussion leads to the 
following hypothesis: 


Hs: The greater the (a) price differential, (b) premium posi- 
tioning, (c) product scarcity, (d) free-riding potential, and 
(e) customer heterogeneity on services in a market, the 
greater is the likelihood of gray market incidence. 


Research Method 


Study 1: Field Survey of Gray Marketing of 
Personal Care Products 

Our sampling frame comprised U.S.-based manufacturers 
of branded personal care products (e.g., skin toners, hair 
and nail care products, medicated soap) that distribute 
through wholesaler distributors in the professional salon 
industry. The reliance on external (non-company-owned) 
distribution intermediaries and the high value-to-weight 
ratio of the products in this study make this a particularly 
appropriate context in which to test our hypotheses. 
According to the Beauty and Barber Supply Institute, the 
association representing personal care product manufac- 
turers and distributors, more than half of its members had 
experienced gray marketing of their products in some form 
or other. Thus, the phenomenon of gray markets is well 
known to most industry participants. 

We defined the term *gray marketing" for informants, 
and then we asked them to focus on a geographic market in 
which they had experienced, or were likely to experience, 
gray marketing. After they identified such a market, we 
asked them to report their enforcement behavior (i.e., sever- 
ity, certainty, and speed) in general rather than in response 
to a particular violation. We did this to assess deterrence 
attributable to a general propensity for enforcement rather 
than a particular response. Consistent with previous 
channel-related research in marketing (Anderson, Hakans- 
son, and Johanson 1994; Anderson and Weitz 1992; Antia 
and Frazier 2001; Cavusgil and Sikora 1988; Dutta, Heide, 
and Bergen 1999; Dwyer, Schurr, and Oh 1987), we 
solicited manufacturers’ evaluations of their own severity, 
detection ability, and speed of enforcement behavior. We 
developed and refined the survey instrument on the basis of 
a literature review, interviews with channel management 
personnel from the personal care products industry, and a 
mailed pretest. 

Measures. Our dependent variable of interest, incidence 
(INCID), is a dichotomy that describes whether gray mar- 
keting occurred during the previous two years. We coded 
INCID as 1 if the informants indicated that they were aware 
of diversion of any product from their product line within 
the specified time period and 0 if otherwise. 

We measured the independent variables with multi-item 
summated scales. The final item sets, response formats, 
individual item loadings, composite reliability, and average 
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variance explained (AVE) for each scale appear in Appen- 
dix A. The preceding metrics provide evidence that the psy- 
chometric properties of our measures are satisfactory. Table 
1 displays the correlation matrix and descriptive statistics 
for the variable set. In the following sections, we describe 
the measurement approach for each variable. 

The manufacturer’s enforcement severity (SEV) con- 
sists of a four-item Likert scale. We derived the items that 
constitute the scale from the work of Antia and Frazier 
(2001), but we modified them to ground the measure in our 
research context. Detection ability (DETECT) reflects the 
extent to which the manufacturer is able to evaluate the 
extent and nature of instances of product diversion. We 
derived the scale for this construct from the work of Dutta, 
Heide, and Bergen (1999) and, again, adapted it to our 
research context. The final scale is composed of four 
reverse-coded Likert items. Enforcement speed (SPEED) 
refers to the time elapsed between detection of the violation 
and the manufacturer’s undertaking of corrective action. We 
developed the four specific items that constitute the scale 
using prior research in sociology and social psychology 
(Erickson, Gibbs, and Jensen 1977; Howe and Brandau 
1988; Howe, Brandau, and Loftus 1996) and modified them 
on the basis of field interviews. 

Price differential (PRDIFF) refers to the extent to which 
the manufacturer charges different prices for the product 
across markets. Our scale for this variable consists of a 
three-item Likert measure. Premium positioning (PREM) 
describes the extent to which the manufacturer positions the 
specific product as a high-end “premium” brand. On the 
basis of our literature search and field interviews, we devel- 
oped a four-item Likert scale for this variable. Product 
scarcity (SCARCE) reflects the manufacturer’s inability to 
supply the relevant market with enough product to satisfy 
demand. We measured the extent of product scarcity with a 
four-item Likert scale. 

Potential for free riding (FR) reflects the extent to 
which the manufacturer believes that other authorized dis- 
tributors or unauthorized third parties can gain from the 
sales efforts of its authorized distributors in that market. 
During the interviews we conducted, it became clear that 
we needed to capture the risk of both types of free riding. 
This corresponds to a second-order confirmatory factor 
model, in which the observed six items (three items reflect- 
ing each first-order factor) are hypothesized to originate 
from the two first-order factors (AUTHFR and UNAU- 
THFR), and in turn, the first-order factors originate from a 
second-order factor. 

Customer heterogeneity (HETER) reflects the extent to 
which the specific market is characterized by distinct cus- 
tomer segments, in which each segment values services dif- 
ferently. We measured this variable with a three-item Likert 
scale, which we developed and refined on the basis of 
pretests. 


Data collection. The Beauty and Barber Supply Institute 
agreed to send out a letter on our behalf that explained the 
purpose of the study and informed each of its members to 
expect further communication from us. A week later, we 
contacted each firm by telephone to introduce ourselves, 
ascertain the willingness to participate, and identify suitably 
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qualified key informants. The modal designation of the 
informants was general manager; sales and channels man- 
agers were also well represented. Of the 581 firms called, 
we were unable to contact 21, 26 expressed their unwilling- 
ness or inability to participate, and another 41 had open 
lines or used company-owned distribution channels, leaving 
us with a final sample of 493. 

We then sent out a survey packet by mail to the identi- 
fied key informants at the remaining 493 manufacturer 
firms. We dropped an additional 20 informants from the 
study because they indicated that the study did not apply to 
their current business context. Two further rounds of call- 
backs and two subsequent mailings yielded a total of 104 
usable questionnaires, representing a 22% response rate. Of 
the 104 responding firms, 40 reported experiencing at least 
one incidence of gray marketing, and 64 reported otherwise. 
Nonsignificant differences between early and late respon- 
dents (Armstrong and Overton 1977) on the focal variables 
of the study suggest that nonresponse bias is unlikely. Post 
hoc checks indicate acceptable levels of involvement (M = 
3.92 on a five-point scale; SD = 1.37) and knowledge (M = 
4.34; SD = .97) with respect to their firms’ dealings in the 
specific product-market. 


Measure validation. Given our sample size and the 
number of constructs, we conducted confirmatory factor 
analyses (CFAs) on groups of maximally similar constructs 
(see Moorman and Miner 1997), namely, the enforcement 
characteristics (severity, speed, and detection ability) and 
the contextual variables (price differences, customer hetero- 
geneity, premium positioning, and supply shortage). The 
higher-order construct conceptualization of free-riding 
potential necessitated the estimation of 14 additional para- 
meters, thus requiring us to conduct a separate CFA for this 
construct to meet sample size requirements. The CFA 
model diagnostics (see Table 2) suggest unidimensionality 
of the reflective scales. All items loaded on their prespeci- 
fied constructs and had t-values significant at .05, providing 
evidence of convergent validity. Discriminant validity of the 
scales is further supported by the Lagrange-multiplier tests; 
none of the possible cross-loadings exceeds the critical 
value of the chi-square with one degree of freedom (Speier 
and Venkatesh 2002). 

Given the significant correlation between enforcement 
severity and its speed, we undertook a rigorous test of the 
discriminant validity of these constructs in particular. We 
asked three academics with expertise in the area of enforce- 
ment to sort the eight items that purported to measure sever- 
ity and speed. All three experts were able to perform the 
item-sorting task with full accuracy. Interitem correlations 
among the items measuring severity (average Pseventy = -68, 
p < .0001) and speed (average Pspeed = .73, p < .0001) are at 
a significantly higher level than cross-construct item corre- 
lations (average Pspeed severity = -53, p < .0001); t-tests of dif- 
ferences further suggest that the differences in correlations 
are significant. We also found that the AVE for each con- 
struct was greater than the squared correlation between the 
constructs (Fornell and Larcker 1981), thus providing fur- 
ther evidence of discriminant validity between severity and 
speed of enforcement. In all, the preceding tests suggest 
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TABLE 2 
Study 1: CFA Results for Reflective Measures 


Average 
Off-Diagonal Compara- Incremental Tucker- Lowest 

Squared tive Fit Fit Lewis t-Value of 
Construct Groups y2 p Value Residual index Index Index Loading 
SEV, SPEED, DETECT . x&,- 90.03 .001 07 .96 .96 .94 5.81 
FH x2 = 16.61 .03 .10 .98 .98 .96 7.36 
PRDIFF, SCARCE, 

PREMIUM, HETER X5, = 89.9 .07 .05 .97 .97 .96 5 26 


that severity and speed of enforcement have adequate dis- 
criminant validity. 

Model specification. We test our hypotheses using a 
maximum likelihood estimation-based logistic regression 
model, first specifying the main effects only (Equation 1) 
and then adding the two- and three-way interactions among 
the enforcement characteristics (Equation 2) as follows: 


|. 8 
D 
and 


(1) PINCID; = 1) = 


D 
1+ vo + Yun 
l 
12 
sn 2x4 
14 oa + Yun 
| 


where INCIDi = 1 if firm i reports at least one gray market 
violation and 0 if otherwise, and 


(2) PINCID: = 1) = 


XG] = SEV, 

X; = DETECT, 
X4 = SPEED, 
X4 = PRDIFF, 
X5 = FR, 

Xis = SCARCE, 
X17 = PREM, 
Xig = HETER, 


X, = SEV x DETECT, 
X9 = SEV x SPEED, 

Xu = SPEED x DETECT, and 
X, = SEV x SPEED x DETECT. 


To reduce the potential multicollinearity arising from 
multiplicative interaction terms (Aiken and West 1991) and 
to yield coefficient estimates within the observable range of 
the independent variables (Friedrich 1982; Jaccard, Turrisi, 
and Wan1990), we mean centered all the explanatory varia- 
bles and interactions created as the product of the mean- 
centered components (see Brown, Dev, and Lee 2000; Jac- 
card, Turrisi, and Wan 1990; Rokkan, Heide, and Wathne 
2003).1 


IMain effects in this case represent the conditional effects of 
each enforcement characteristic on incidence at moderate (mean) 
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Table 3 displays the estimation results for both equa- 
tions. The likelihood ratio for the main effects vector of 
coefficients is not significantly different from zero (%23) = 
1.58, p = .66). In contrast, the likelihood ratio is signifi- 
cantly different from zero for the two-way (%2) = 6.69, p = 
.08) and three-way (x?(,) = 2.82, p = .09) interactions. Fur- 
thermore, the interaction model (y? = 35.71, p = .0004) 
gives us confidence that the null hypothesis of all the coef- 
ficients being zero can be rejected. The model correctly 
classifies 80% of the observations, suggesting that the 
inclusion of the interaction terms is warranted. Therefore, 
we discuss the findings of this study as inferred from the 
estimation of the interaction model.2 


Study 1 results. None of the three enforcement charac- 
teristics (severity, detection ability, and speed) has a signifi- 
cant impact on deterrence (incidence) in isolation (b; = .11, 
b, = ~.04, b; = .01). However, our examination of the full 
model coefficients requires us to separate the main effects 
from the interaction terms (for a full explanation, see Aiken 
and West 1991, p. 38; for a recent application in a channels 
context, see Brown, Dev, and Lee 2000). We do this by dif- 
ferentiating Equation 2 with respect to each of the three 
enforcement characteristics. Equation 3 shows how INCID 
varies as a function of SEV (for derivation and details on 
obtaining simple slopes, standard errors, and t-values, see 
Appendix B; for the results, see Table 4). 


dINCID 


© ISEV 


= b, + bs DETECT + b;SPEED + b; SPEED 


x DETECT 


levels of the other two facets. This interpretation 1s distinct from 
the “constant effect of one variable over all values of another 
variable” (Aiken and West 1991, p. 38) interpretation, as is the 
case with uncentered (raw) data. However, both approaches are 
consistent in that estimates of the latter may be recovered as a spe- 
cial case of the mean-centered approach, if required (Freidrich 
1982). 

2Table 1 shows evidence of considerable multicolinearity 
between the interaction variables and their components, such mul- 
ticolinearity 1s known to inflate the standard errors of the param- 
eter estimates, though the estimates themselves remain unbiased 
(Jaccard, Turrisi, and Wan 1990). The statistical significance of the 
higher-order interactions in the presence of the lower-order terms 
indicates that the imprecision due to multicolinearity does not 
pose a threat to validity (Buvik and John 2000). Parameter esti- 
mates and significance levels remain robust across both model 
specifications (the main effects—only and the fully specified inter- 
action model), thus lending further confidence in our findings. 


TABLE 3 
Study 1: Parameter Estimates 








Full Model 
Maln Effects Only Coefficient 
Re (Interactlons Hypothesis 
Independent Variable Coefficient t-Value Included) t-Value Supported?a 
Constant — 661 —2.73*** —.519 —1.8** 
DETECT (b4) —.012 —.19 .109 1.20 H4 (No) 
SEV (bs) —.042 —.53 —.041 — 49 
SEV x DETECT (bs) —.055 —2.04** Hə (Yes) 
SPEED (b4) .065 81 .010 11 
SEV x SPEED (bs) —.009 —.59 Ha (No) 
SPEED x DETECT (bg) —.008 —.30 
SEV x SPEED x DETECT (by) —.007 —1.68** H; (Yes) 
PRDIFF (bg) .147 1.60* .145 1:517 Hs, (Yes) 
FR (bg) 190 3.01*** 206 2.89*** Hsp (Yes) 
PREM (b40) .035 .58 .056 .84 Hse (No) 
SCARCE (b41) .094 1.31* .121 1 45* Hsg (Yes) 
HETER (b49) .201 2.08** .194 1.82” Hse (Yes) 
X& = 25.44; p = .0013 Xo = 35.71; p = .0004 
*p « .10 (one-tailed test). 
™p < .05 (one-tailed test) 
""*p < .01 (one-tailed test) 
aBased on Interaction model estimates. 
TABLE 4 


Study 1: Impact of Enforcement Characteristics on Incldence 


Estimated Impact 
on Incidence 





Enforcement Characteristic (Simple Slope) SE t-Value 
Main Effects 
SEV .098 .145 .67 
DETECT 246 152 1.61 
SPEED —.046 .127 —.36 


Interaction Effects 
Impact of SEV on INCID at various levels of DETECT (Ho) 


DETECT iow .187 .137 1.35 

DETECT aerae —.041 .084 —.49 

DETECThigh —.269 .137 —1.96* 
Impact of SEV on INCID at various levels of SPEED (H3) 

SPEEDiow —.028 127 —224 

SPEED moderate .010 .091 .110 

SPEED 048 .087 .556 


Impact of SEV on Incidence at various levels of DETECT 
and SPEED (H4) 


DETECThgh and SPEEDpgh —43 157 —2.85* 
DETECThigh and SPEED ow ~11 157 —70 
DETECT,, and SPEED, 27 157 1.80 
DETECT gy and SPEED ow 10 157 65 


*Significant at the .05 level (two-talled test). 


The t-value of .098 is not significant at the .05 level. different from zero. We now proceed to examine the inter- 


Therefore, we are unable to reject the null hypothesis for action coefficients. 
SEV, which in turn enables us to reject the main effect we We find that just one two-way interaction is statistically 


hypothesized in H,. We evaluate the main effects of significant: SEV x DETECT (b, = —.055, p < .05). On fur- 
DETECT (dINCID/ODETECT = .246) and SPEED ther probing (see Table 4), we find that strict enforcement 
(QINCID/dSPEED = —.046) similarly (for estimates, see behavior has a significant inverse relationship to gray mar- 
Table 4); in each case, the main effect is not significantly . ket incidence when it 1s combined with high detection abil- 
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ity (OINCID/OSEV = -.269, p < .05). Note that no such 
deterrent impact occurs under low (dINCID/dSEV = .187, 
p > .10) or even moderate (QINCID/dSEV = —.041, p > .10) 
levels of detection ability. The preceding result underscores 
the importance of matched and high levels of severity and 
detection ability, in support of the interaction we hypothe- 
sized in Hz. We fail to find support for the exploratory 
hypotheses with respect to SPEED’s pairwise interaction 
with SEV (bs = —009). That is, SPEED does not seem to 
confer any deterrent impact in combination with SEV or 
with DETECT. 

In Table 3, we find that the three-way interaction among 
SEV, SPEED, and DETECT is statistically significant (b; = 
—.007, p < .05), in support of Hy, suggesting that the combi- 
nation of all three facets of enforcement has a deterrent 
impact. Post hoc probing of this significant interaction 
reveals that increasing SEV results in deterrence only when 
SPEED and DETECT both have high levels (the simple 
slope of SEV on INCID is negative and significant). As can 
be seen in Table 4, if either SPEED or DETECT is low, the 
simple slope of SEV on INCID is not significant. 

The significant and inverse relationship between SEV x 
DETECT and INCID suggests that the combination of high 
levels of SEV and DETECT is sufficient to exert a deterrent 
impact on gray market activity. Furthermore, the significant 
three-way interaction suggests that when SPEED is added 
to the combination of SEV and DETECT, deterrence is 
enhanced. Our inability to find similar effects for the pair- 
wise combinations involving SPEED or for a main effect of 
SPEED suggests that speed must be accompanied by severe 
action and high detection ability to achieve incremental 
deterrence. Swift enforcement provides an additional deter- 
rent "boost," but not by itself. 

Our hypothesis with respect to the gray market-specific 
determinants of incidence finds significant support for four 
of the five contextual variables. The presence of price dif- 
ferentials among markets has a significant, though mar- 
ginal, effect on gray market incidence (bg = .145, p « .10). 
Free-riding potential has a strong, significant, direct rela- 
tionship to gray market activity (bg = .206, p < .01). We do 
not find any support for the hypothesized relationship 
between premium positioning and gray market incidence 
(bio = .056). However, we find that product scarcity is posi- 
tively related to the likelihood of gray market violations, 
though marginally so (bj; = .121, p < .10). We also find a 
strong positive relationship between customer heterogeneity 
on services and gray market incidence (b, = .194, p < .05). 

To summarize, our examination of the main and interac- 
tions effects suggests that none of the characteristics of 
enforcement has any significant deterrent impact in isola- 
tion. Matched and high levels of detection ability and sever- 
ity are prerequisites for achieving deterrence, and swift 
enforcement boosts the preceding interaction. 


Study 2: Experiment 


We conducted Study 2 to replicate and extend the results of 
Study 1 with an alternative methodology, context, and mea- 
sures. First, we conducted Study 2 as an experiment 
because the dimensions of enforcement severity, detection 
ability, and speed are likely to be naturally associated in 


100 / Journal of Marketing, January 2006 


field contexts. Manufacturers that are concerned with gray 
market activities tend to design detection systems that 
enable them to identify problems accurately and then react 
both quickly and severely against the offending firm. In 
contrast, manufacturers that are not affected by gray market 
activities are likely to be relatively lax on all three enforce- 
ment facets. Given this logic, some combinations of 
enforcement strategies are unlikely to occur naturally in 
field settings, such as high severity but low speed and cer- 
tainty of detection or low severity and low speed but high 
certainty. The coincident use of these enforcement charac- 
teristics is evident in our first study with significant correla- 
tions between severity and speed (r = .68, p « .05), severity 
and detection (r = .28, p « .05), and detection and speed (r = 
.29, p < .05). Therefore, Study 2 uses an experimental 
design to examine the full range of possible enforcement 
strategies and to use orthogonal manipulations that enable 
us to assess the independent effects of severity, detection, 
and speed on deterrence. 

Second, Study 2 assesses the robustness of the results of 
study 1 by examining deterrence from the perspective of 
the dealer rather than the manufacturer. Given that it is deal- 
ers’ evaluations of the effectiveness of the enforcement 
facets that affect their willingness to engage in gray market 
activities, it is important to understand their perceptions. 
Study 2 also differs from Study 1 in its use of a continuous 
rather than dichotomous measure of deterrence, thus 
enabling us to estimate the psychometric properties of the 
dependent variable. 


Procedure. Participants were 112 MBA students who 
were recruited by student representatives of an on-campus 
exchange program at a major university. A donation of $10 
was made toward the program on behalf of each participant 
who completed the study. Sixty-four percent of the partici- 
pants were male, their average age was 29 years, and they 
had an average of three years of management experience. 
Seventy-seven percent of participants had at least one year 
of experience as “a supplier or dealing with suppliers in a 
business situation." 

After a brief introduction to a study on manufacturer— 
distributor relationships, participants were randomly 
assigned to one condition in a 2 (severity: high versus 
low) x 2 (certainty: high versus low) x 2 (speed: fast versus 
slow) full-factorial between-subjects design. Participants 
read a scenario that explained that an electronics compo- 
nents dealer had violated its agreement with a manufacturer 
by selling $100,000 of integrated circuits outside its terri- 
tory. The scenario included manipulations of the severity, 
certainty, and speed of the manufacturer's response. The 
method assumes that participants project themselves into 
the hypothetical situation and provide answers that reflect 
how dealers would actually respond to the situation outlined 
in the scenario. Evidence suggests that projective methods 
can accurately represent participants' underlying attitudes 
and behaviors (e.g., Fisher 1993) and that the judgments of 
individual managers can provide important insights into 
organization-level strategies (e.g., Chandy, Prabhu, and 
Antia 2003). The scenario and manipulations appear in 
Appendix C. Participants then responded to a series of 
questions. 


The dependent variable was a three-item measure that 
reflected the degree to which the offending dealer would be 
deterred from engaging in future gray market activities by 
the manufacturer’s enforcement activities. The items 
included the following: “In future, the offending dealer 
would definitely comply with the manufacturer’s regula- 
tions governing unauthorized sales,” “In future, the offend- 
ing dealer would not dare [to] disregard the resale restric- 
tions of the manufacturer,’ and “In future, the offending 
dealer would be deterred from selling the manufacturer’s 
products in an unauthorized manner.” The items were mea- 
sured on a five-point Likert scale, anchored by “very 
unlikely” and “very likely.” The mean of the summated 
scale items was 2.74, with a standard deviation of .97 and a 
coefficient alpha of .86. 


Analysis. We used a full-factorial analysis of variance to 
examine the manipulation checks and to test the hypothe- 
ses. We found a significant effect of the detection manipula- 
tion on the item “The manufacturer is very likely to catch 
dealers who undertake gray market activities” (F(1, 111) = 
131.51, p < .01), a significant effect of the severity manipu- 
Jation on the item “The manufacturer's response to gray 
market transactions is severe" (F(1, 111) = 76.10, p < .01), 
and a significant effect of the speed manipulation on the 
item "The manufacturer punishes gray market activity 
immediately" (F(1, 111) = 122.77, p < .01). The manipula- 
tions had no unintended main or interaction effects. 

In terms of the hypotheses, we found significant main 
effects of certainty (F(1, 111) = 5.20, p « .05) and severity 
(F(1, 111) 2 22.81, p « .01) on deterrence. In terms of inter- 
actions, the two-way interaction between certainty and 
severity was significant (F(1, 111) = 5.23, p « .05), as was 
the three-way interaction effect (F(1, 109) = 4.17, p « .05). 
Examination of the simple effects reveals that severity has a 


positive effect on deterrence in three of four possible 
instances: high certainty/high speed (F(1, 27) = 10.32; 
Migw seventy = 2-43 > Mhigh seventy = 3.29, p < .01), low cer- 
tainty/high speed (F(1, 27) = 3.48; Miow seventy = 238 > 
Mhaigh severity = 3-15, p < .10), and high certainty/low speed 
(EC, 27) = 19.97; Miow seventy = 2-31 > Mhigh severity = 3-62, 
p < .001). As we predicted, severity has no effect on deter- 
rence when both certainty and speed are low (p > .90). Note 
that when certainty and severity are high, speed has no 
effect on deterrence (p > .30). We included participants’ 
gender and years of supervisory experience as covariates, 
but they were not significant (p > .20). The analysis of vari- 
ance interaction results appear in Figure 1. 


Study 2 results. The results are consistent with those of 
Study 1 in three of four possible effects of severity on deter- 
rence. Consistent with Study 1, the results of Study 2 sup- 
port the view that severity of response alone is insufficient 
to reduce gray market activities. Increasing the severity of 
the punishment for gray market violations had no effect 
when dealer violations were unlikely to be detected and 
when violators could expect a long period of time before 
they would be punished. It seems logical that the threat of 
even the most severe punishment will be ineffective if it has 
a low likelihood of imposition and if it will only take place 
in the distant future. 

Increasing the severity of the manufacturer’s response to 
gray market activity was effective when it was used in con- 
cert with systems that ensured a high likelihood of detec- 
tion, immediate responses to violations, or both. Consistent 
with Study 1, participants indicated that future gray market 
activities would be reduced when the likelihood of detec- 
tion was high, regardless of whether enforcement speed was 
fast or slow. The results support the view that deterrence is 
a function of both the severity and the likelihood of punish- 


FIGURE 1 
Study 2: Certainty x Speed x Severity Interaction Effect 
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ment. In contrast to the Study 1, Study 2 found that severity 
had a deterrence effect when speed was high but the likeli- 
hood of detection was low. By implication, speed appears to 
compensate for detection certainty in some conditions. 
Despite a low probability of detection and, thus, punish- 
ment, participants believed that dealers would be less likely 
to reoffend if severe punishments were meted out quickly. 


Discussion 


The results of the two studies are consistent in their support 
of the deterrence doctrine, despite varying dramatically on 
various dimensions, including methodology (field survey 
versus experiment), perspective (manufacturer versus dis- 
tributor), and measurement of the dependent variable 
(dichotomous measure of actual incidence versus multi- 
item measure of future recidivism). Both studies indicate 
that the severity, certainty, and speed of enforcement inter- 
act in their effects on deterrence. The results have important 
implications for both research and practice in the area. 


Research Implications 


Prior research in marketing has implicitly taken the rela- 
tionship between enforcement severity and deterrence to be 
an article of faith. So well entrenched is this assumption 
that there has been no attempt to test the efficacy of 
enforcement in a marketing context. Our central finding is 
that, by itself, enforcement severity does not deter gray 
market incidence. As a result, an expanded conceptualiza- 
tion of enforcement is necessary. Our study also sheds light 
on how the facets of enforcement work together to mini- 
mize gray market violations. Across both studies, none of 
the three characteristics of enforcement (i.e., severity, cer- 
tainty, and speed) has deterrent value alone. Rather, deter- 
rence is most likely to occur when the penalties for gray 
market violations are severe, when manufacturers are able 
to detect violations or mete out punishments in a timely 
fashion, or both. 

The introduction of speed of response further extends 
knowledge of the subtleties of enforcement. Specifically, 
we find that swift enforcement bolsters the deterrent impact 
of high levels of detection ability and severity. The ability to 
increase the effectiveness of the latter combination is partic- 
ularly interesting considering that, by itself, enforcement 
speed has little to contribute to deterrence objectives. This 
finding is of great importance to the marketing literature on 
enforcement; it emphasizes the dual imperative for an 
expanded conceptualization of enforcement, and it forces us 
to examine the deterrent impact of enforcement rather than 
take it for granted as research has done to date. 

In examining the deterrent impact of enforcement, this 
study also links the evolving marketing literature on oppor- 
tunism (Brown, Dev, and Lee 2000; Dahlstrom and 
Nygaard 1999; John 1984; Wathne and Heide 2000) and 
enforcement (Antia and Frazier 2001; Bergen, Heide, and 
Dutta 1998; Dutta, Bergen, and John 1994). The preceding 
literature bases have tended to develop along distinct paths, 
despite the obvious linkages. To the best of our knowledge, 
ours is the first study to test the assumed deterrent efficacy 
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of enforcement. Our findings set the stage for a synthesis of 
these still disparate research streams. 

Our choice of empirical context facilitates an additional 
important contribution. To date, the literature on gray mar- 
kets has tended to develop along two lines: analytical mod- 
els (Ahmadi and Yang 2000; Banerji 1990; Bucklin 1993; 
Coughlan and Soberman 1998) and descriptive case studies 
(Antia and Everatt 2000; Cespedes, Corey, and Rangan 
1988). By their very nature and design, the preceding 
efforts emphasize deeper understanding of a select few 
issues relevant to gray markets. What is needed is an inte- 
gration and subsequent large-sample validation of the com- 
monly suggested drivers of gray market activity. We address 
this crucial gap in the understanding of gray markets by 
developing an integrative model of gray market incidence 
and by validating this model with both field data and an 
experiment. Our study represents an attempt to “take stock" 
of received thought on gray market incidence. 


Managerial Implications 


Until now, the vocabulary and emphasis of the business and 
academic press have been limited to enforcement severity. 
Yet our results suggest that current prescriptions calling for 
stricter measures against gray market participants are likely 
to be ineffective. Managers who want to make progress 
against gray market incursions must invest in systems that 
increase the ability to both detect and quickly punish viola- 
tors. It is only when severe enforcement behavior is com- 
bined with certainty and/or speed that gray market inci- 
dence is significantly curbed. 

Our work points out the need to use an integrated 
approach to the management of distribution channel rela- 
tions. In addition to devoting attention to enforcement, 
manufacturers should provide adequate incentives to autho- 
rized distributors for the continued provision of valued ser- 
vices in the right mix to the right targeted segment. Due 
consideration should also be given to pricing and supply 
Issues, in acknowledgment of their potential impact on gray 
market activity. The combination of all the preceding mea- 
sures is more likely to lead to desired deterrence objectives. 


Limitations and Future Research 
Directions 


Although the two studies we reported herein provide several 
contributions to academics and managers alike, much work 
remains to be done. The term "gray marketing" encapsu- 
lates a variety of unauthorized sales, including cross-border 
parallel importation, diversion, and domestic bootlegging, 
to name but a few. Although cognizant of these subtle dis- 
tinctions, we were unable to control for the location and 
specific type of gray market activity because of nonre- 
sponse on the particular items in our survey. We have taken 
initial steps toward understanding the degree to which the 
deterrence doctrine is generalizable by testing it from both 
the manufacturer's (Study 1) and the dealer's (Study 2) per- 
spective, but a variety of other industry and violation con- 
texts remain unexamined. 


Our empirical context imposed certain constraints on 
our construct operationalizations. First, the absence of 
archival data on gray market incidence and the steps taken 
by manufacturers in response forced us to restate our con- 
ceptual framework in terms of detection ability rather than 
the notion of certainty, as originally conceptualized by the 
deterrence doctrine. Second, in Study 1, we measured gray 
market incidence as a binary response in an attempt to 
strike a balance between examining gray markets in a 
meaningful manner and minimizing the potential for nonre- 
sponse attributable to what could be perceived as excessive 
probing of a “hot-button” channel issue. In Study 2, we 


were able to replicate the findings of Study 1 using a con- 
tinuous measure of deterrence in an experimental context. 
We consider our studies a stimulus for more sophisticated 
attempts to capture the intricacies of channel violations and 
manufacturers’ subsequent responses. 

Consistent with a well-established body of channels 
research, we asked manufacturer informants to report their 
own behavior. In Study 2, we considered the dealer’s per- 
spective and found similar results. It would be useful for 
research to incorporate the simultaneous views of both 
manufacturers and dealers to understand this issue further. 


APPENDIX A 
Measures 





Enforcement Severity (Composite Rellabllity = .90; 
AVE = .69) 
We have a policy of full enforcement of our sales 


agreements. T9 
We are well known for our strict policy on product 

diversion. 75 
We have a tough stance on product diversion. .95 
We (would) take strict action against unauthorized 

sales. .85 


Enforcement Speed (Composite Rellability = .92; 
AVE z .73) 
Our response to violations is (would be) 
instantaneous. .88 
We (would) take immediate action against violations. .83 
Very little time (would) elapse between detection of 


violations and our response to them .87 
Our enforcement response process is (would bo) 
very timely. .85 


Detection Ability (Composite Rellabillty = .87; 
AVE z .63) 
At a given time, rt would be difficult to evaluate the 
extent of product diversion. (R) .82 
Determining compliance with resale restraints 
requires a great amount of effort on our part. (R) .56 
It would be difficult for us to evaluate the extent to 


which our product is diverted. (R) .95 
Our evaluation of the extent of unauthorlzed sales is 
based on very "fuzzy" information. (R) 78 
Price Differential (Composite Reliability = .80; 
AVE = .58) 
We maintaln a uniform pricing policy between 
markets. (R) .88 
In general, we try to keep price differences between 
markets to a minimum. (R) .80 
The price we charge for the product in each market 


varles considerably. 57 


Free-Riding Potential 
Unauthorized Free Hiding (Composite Rellability = .85; 
AVE - .65) 
Customers may learn about our product from our 
distributors and purchase it from unauthorized 
sources. .91 


Our distributors' presales services might stlmulate 
unauthorized sales of the product. .80 
Unauthorized distributors could benefrt from the 
market development efforts of our authorized 
distributors. .70 


Authorized Distributor Free Riding (Composite 
Reliability = .92; AVE = .80) 
Our authorized distributors’ sales efforts Increase 
sales of other authonzed distnbutors of this 
product. .84 
The services provided by one authorized 
distributor may help other authorized 
distributors’ sales. .92 
Authorized distributors could benefit from the 
market development efforts of other authorized 
distributors. .92 


Product Scarcity (Composite Rellabllity = .88; 
AVE = .64) 
We have trouble keeping up with demand for this 


product. .61 
Order fulfillment for this product is frequently 
delayed due to production constralnts. .82 


We frequently experience stock-outs of this product. .93 
We do not produce enough product to satisfy 
demand. .82 


Premium Positioning (Composite Reliability = .89; 
AVE - .67) 
Our brand name commands a significant price 


premium in this market. 74 
Our company is considered a market leader in this 
product category. .86 
We are able to leverage our strong brand name 
to a great extent In this market .87 
Our products have a strong reputation in this 
market .80 
Customer Heterogeneity (Composite RellabllIty = 
.78; AVE = .54) 
Customers in this market require very different 
service levels. .61 
Our customers have very dissimilar product 
preferences. .71 
Customers In this market differ considerably in 
their preference for service. .60 


Notes: All Items are measured on a flve-polnt Likert scale, anchored by "strongly agree" and “strongly disagree" (R) = reverse-scored items. 
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Appendix B 
Post Hoc Probing of Interactions 


Two-Way Interactlon Between SEV and DETECT 


The objective of this post hoc probing is to ascertain the 
impact of severity on incidence for different levels of detec- 
tion ability. Accordingly, we rearrange the terms of Equa- 
tion 2 to obtain the simple regression of INCID on SEV at 
plus and minus one standard deviation of DETECT, respec- 
tively (Aiken and West 1991, p. 13). This yields two simple 
slope coefficients, indicating the impact of SEV on INCID 
at each level of DETECT. 
From Table 3, Equation 2, 


INCID = —519 — .041SEV + .109DETECT - .055SEV 
x DETECT. 
If we rearrange the terms, 
INCID = (—.041 ~ .055DETECT)SEV + (—519 + .109DETECT). 
At DETECT, (DETECT = -4.1447), 
INCID = (-.041 — .055 x -4.1447)SEV 

+ (—519 + .109 x 4.1447), 
Or 

INCID = .187SEV - .971. 


We obtain the simple slope coefficients similarly for 
each two-way interaction (see Table 4). Using the variance- 
covariance matrix of the beta estimators, we then obtain the 
standard error and t-value for each simple slope. 


Three-Way Interaction Among SEV, SPEED, and 
DETECT 


We rearrange the terms of Equation 2 yet again, this time 
replacing SPEED and DETECT with values plus and minus 
one standard deviation to represent high and low values of 
SPEED and DETECT, respectively. 

From Table 2, Equation 2, 


INCID = —519 + —041SEV + .010SPEED + .109DETECT 

— .055SEV x DETECT - .009SEV x SPEED 

— .008SPEED x DETECT 

— .0O7SEV x SPEED x DETECT. 

If we rearrange the terms, 
INCID = (—041 — .OSSDETECT - .009SPEED 

— .007SPEED x DETECT)SEV 
+ (—519 + .010SPEED + 1095DETECT 
— .008SPEED x DETECT). 


We then obtain the simple slopes of SEV for the four 
combinations of SPEED and DETECT: 


At SPEEDy and DETECT, (SPEED = 4.3051, DETECT = 
4.1447), INCID = —355SEV — .167; 


At SPEED; and DETECT, (SPEED = —4.3051, DETECT = 
4.1447), INCID = —183SEV + .032; 
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At SPEED; and DETECT, (SPEED = 4.3051, DETECT = 
—4.1447), INCID = .351SEV — .785; and 


At SPEED; and DETECT, (SPEED = -4.3051, DETECT = 
—4.1447), INCID = .023SEV - 1.157. 


Using the variance-covariance matrix of the beta esti- 
mators, we obtain the standard error and t-value for each 
simple slope (see Table 4). 


Appendix C 
Study 2 Scenario and Experimental 
Manipulations 


Avon is a company that manufactures integrated chips (ICs) 
for use in laptops and personal computers. Avon dealers are 
expressly prohibited from selling their ICs outside their ter- 
ritory (called gray market activity). Nevertheless, one of 
Avon’s dealers recently sold a large order of Avon ICs 
(approximately $100,000 in sales) outside its authorized 
territory to get rid of excess inventory. Avon has the follow- 
ing procedures and policies related to gray market activities 
by its dealers: 


Detection Manipulation 


High: The dealer has a high probability of being caught. 
Avon is able to identify unauthorized sales over 90% 
of the time because of its sophisticated inventory 
management system. 

Low: The dealer has a low probability of being caught. 
Avon is able to identify unauthorized sales only 10% 
of the time because of an unsophisticated inventory 
management system. 


Severity Manipulation 


High: The penalties are significant. If Avon identifies an 
unauthorized sale, it levies a fine equal to twice the 
value of the sale (in this case $200,000). If caught, 
dealers lose a significant amount of money on the 
gray market transaction. 

Low: The penalties are not very significant. If it identifies 
an unauthorized sale, it levies a fine equal to 1596 of 
the contract's value. If caught, dealers still make a 
small profit on the gray market transaction. 


Speed Manipulation 


Fast: Avon's inventory management system is able to detect 
gray market activities immediately. After a violation is 
identified, the dealer is notified that [it is] being fined 
within 24 hours. Consequently, dealers are notified 
and punished before they make any additional gray 
market sales. 

Slow: Avon requires a dealer audit before it can confirm a 
suspected gray market activity, so contract violations 
are detected six months after the fact. After a viola- 
tion is identified, the dealer is notified that [it is] 
being fined within 24 hours. Consequently, many 
dealers are notified and punished after they have 
made additional gray market sales. 


eee 
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Velitchka D. Kaltcheva & Barton A. Weitz 


When Should a Retailer Create an 
Exciting Store Environment? 


Addressing inconsistent results in the literature, the conceptual framework that the authors develop proposes that 
the consumers motivational orientation moderates the effect of the arousal produced by a store environment on 
the pleasantness of the environment. Two experiments support the framework. When consumers have a recrea- 
tional motivational orientation, high arousal has a positive effect on pleasantness, but when consumers have a 
task-oriented motivational orientation, high arousal decreases pleasantness. In addition, high arousal increases 
consumer intentions to visit and make purchases in the store for recreationally oriented consumers, but it has a 
negative impact on shopping behavior for task-oriented consumers. Pleasantness mediates the effect of arousal 
on shopping behavior. 





ecognizing the significant impact of store environ- tant situational variable—namely, the consumer’s motiva- 
ment on consumer behavior, retailers devote consid- tional orientation in the store—moderates the effect of 
erable resources to store design and merchandise arousal on pleasantness. 
presentation activities. For example, Toys “R” Us spent $35 The next section briefly reviews research that investi- 
million to make its Times Square New York store “the ulti- gates the impact of store environment on shopping behav- 
mate toy store that is the personification of every kid’s ior. This review highlights some inconsistent and, in some 


dream” (Prior 2001, p. 46). Some retailers (e.g., The cases, contradictory findings, suggesting the need to con- 
Sharper Image, Bass Pro Shops, Barnes & Noble) attempt sider moderator variables. Then, we propose a theoretical 


to create engaging environments in their stores that encour- framework that focuses on the moderating effect of motiva- 
age shopping. In contrast, other retail chains (e.g., Costco, tional orientation and resolves some of the inconsistent 
Home Depot) are successful with their minimalist findings. We describe two laboratory experiments that test 
warehouse-style shopping environments (Levy and Weitz the theoretical framework. The article concludes with a dis- 
2004). With respect to electronic retailing contexts, it has cussion of the limitations of our research and its managerial 


been suggested that the most effective and desirable Web implications. 

sites are those that are challenging to consumers and 

heighten their arousal level (Novak, Hoffman, and Yung 

2000; Tedesco 2000). However, research has actually found Research on Environment and 
no relationship, and even a negative relationship, between Shopping Behavior 

the use of arousal-inducing Web design elements and the 


pleasantness of the Internet experience (Da Cunha, Mehrabian and Russell's Conceptualization of the 


Chakravarti, and Weitz 2003). Environmental Impact on Behavior 

Although considerable research has examined the Mehrabian and Russell’s (1974) seminal conceptualization 
impact of various store environment elements on shopping is the basis of most research on the impact of environmental 
behavior, the existing research does not provide an explana- factors on shopping behavior. Mehrabian and Russell pro- 
tion for the differential effectiveness of arousal levels in pose that environment sensory variables, the environment’s 
store environments or managerially relevant guidelines for information rate (the amount of information in the environ- 
selecting the appropriate arousal level for a specific store ment), and individual differences in affective response 
environment. The objective of this article is to present and influence people’s affective responses to the environment, 
provide empirical support for a theoretical framework that which in turn induce people to approach or avoid the envi- 
outlines the impact of arousal-inducing features in a store ronment. Mehrabian and Russell identify three dimensions 
environment on consumer affective responses to the envi- of affective response: pleasantness, arousal, and dominance. 
ronment and subsequent shopping behaviors in the environ- However, subsequent research has found that pleasantness 
ment. This theoretical framework proposes that an impor- and arousal explain most of the variance in affect and 


behavior, and thus research attention has been focused on 
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TABLE 1 
Empirical Research on the Impact of Pleasantness and Arousal on Shopping Behavior 


Shopping 
Behaviors Pleasantness Arousal 
Approach P: Baker, Levy, and Grewal (1992) P: Baker, Levy, and Grewal (1992)c 
shopping P: Donovan and Rossiter (1982) P: Donovan and Rossiter (1982)8 
behavior P: Hui and Bateson (1991) O: Donovan and Rossiter (1982)b 
O: Spies, Hesse, and Loesch (1997) O: Sweeney and Wyber (2002)¢ 
P: Sweeney and Wyber (2002) 
Purchasing P: Sherman, Mathur, and Smith (1997) N: Milliman (1982)c 
Intentions/ O: Sples, Hesse, and Loesch (1997) P: Sherman, Mathur, and Smith (1997) 
spending O: Smith and Curnow (1966)¢ 
Store visit P: Donovan and Rossiter (1982) P: Donovan and Rossiter (1982)a 
duration O: Sherman, Mathur, and Smith (1997) O: Donovan and Rossiter (1982)> 
P: Spies, Hesse, and Loesch (1997) P: Sherman, Mathur, and Smith (1997)c 
N: Smith and Curnow (1966)¢ 
Affillation P: Donovan and Rossiter (1982) P: Donovan and Rossiter (1982)8 
(social P: Dubé, Chebat, and Morin (1995) O: Donovan and Rossiter (1982)b 
Interaction) P: Sweeney and Wyber (2002) P: Dubé, Chebat, and Morin (1995)c 
P: Sweeney and Wyber (2002)c 
Unplanned P: Donovan and Rossiter (1982) O: Donovan and Rossiter (1982)a, b 
spending P: Donovan et al. (1994) O: Donovan et al. (1994)a 
P: Sples, Hesse, and Loesch (1997) N: Donovan et al. (1994)b 
Unplanned P: Donovan et al. (1994) O: Donovan et al. (1994)a. b 
extension of 
the store visit 
Satisfaction/ P: Sherman, Mathur, and Smith (1997) O: Sherman, Mathur, and Smith (1997)c 
attitude P: Spies, Hesse, and Loesch (1997) O: Wirtz and Bateson (1999)c 
P: Wirtz and Bateson (1999) P: Yalch and Spangenberg (2000)¢ 
P: Yalch and Spangenberg (2000) 
P: Yoo, Park, and Macinnis (1998) 





aPleasant environments only. 
bUnpleasant environments only. 
cThe level of pleasantness in the environment is not controlled for. 


Notes: Approach shopping behavior includes the other six variables, which are forms of approach. P = positive relationship, O = no relationship, 


and N = negative relationship. 


unplanned spending, duration of the store visit, and social 
interaction. However, arousal effects are less consistent. For 
example, Sherman, Mathur, and Smith (1997) report that 
arousal increases purchasing intentions and spending, Milli- 
man (1982) finds the opposite effect, and Smith and 
Curnow (1966) find no effect. Sherman, Mathur, and Smith 
find that arousal increases store visit duration, but Smith 
and Curnow find the opposite. Donovan and Rossiter 
(1982) replicate Sherman, Mathur, and Smith’s results in 
pleasant retail environments only. Finally, in a study that 
involved different types of retail stores, Donovan and 
Rossiter find that arousal has a positive effect on most 
dependent variables in pleasant retail environments. How- 
ever, Donovan and colleagues (1994) fail to confirm this 
relationship in a study conducted only in discount stores. 
The overall pattern of results suggests that pleasantness 
has a consistently positive effect on shopping behavior, but 
the impact of arousal varies across studies. Thus, arousal 
effects may be moderated by a previously unidentified con- 
struct. In the next section, we propose a theoretical frame- 
work that resolves a number of inconsistencies in the 
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research on arousal effects on shopping behavior, providing 
some managerially relevant insights into the appropriate 
design of store environments and retail Web sites. 


Theoretical Framework 


Our theoretical framework appears in Figure 1. The frame- 
work proposes that environmental characteristics affect 
consumer arousal, which in turn affects pleasantness and 
(through pleasantness) consumer shopping behaviors. Con- 
sumer motivational orientation moderates the relationship 
between arousal and pleasantness. When consumers have a 
recreational motivational orientation, arousal has a positive 
effect on pleasantness. Conversely, when consumers have a 
task-oriented motivational orientation, arousal has a nega- 
tive effect on pleasantness. 

The relationship between various environmental charac- 
teristics and arousal is well documented in the literature. 
For example, there is evidence that warm colors (Kueller 
and Mikellides 1993; Valdez and Mehrabian 1994), fast 
music tempo (Holbrook and Gardner 1993; Kellaris and 


FIGURE 1 
Theoretical Framework 
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Kent 1993), and complex environments (Berlyne 1960; 
Mehrabian and Russell 1974) increase arousal. The previ- 
ously discussed studies in the marketing literature provide 
significant support for the proposed relationship between 
pleasantness and shopping behavior. Although we examine 
the relationship between pleasantness and shopping behav- 
ior in this research, the novel element in our framework 
(shown in the shaded area of Figure 1) is the proposed rela- 
tionship among motivational orientation, arousal, and pleas- 
antness. Thus, the theoretical discussion and tbe two exper- 
iments we describe subsequently focus on this element of 
the framework. 


Constructs 


In this section, we define the three constructs of primary 
interest: motivational orientation, arousal, and pleasantness. 
Then, in the next section, we develop a proposition about 
the relationship among these three constructs. 


Motivational orientation. Various shopping motives 
were identified in the early literature (Stone 1954; Tauber 
1972; for a review, see Westbrook and Black 1985). How- 
ever, subsequent research found that two fundamental moti- 
vational orientations underlie the different shopping 
motives. À summary of the empirical studies over the past 
20 years that investigate the two underlying shopping moti- 
vational orientations appears in Table 2. 

The first motivational orientation (e.g., economic, utili- 
tarian) involves consumers engaging in shopping out of 
necessity to obtain needed products, services, or informa- 
tion with little or no inherent satisfaction derived from the 
shopping activity itself. We refer to this motivational orien- 
tation as the “task-oriented motivational orientation." The 
other motivational orientation (e.g., recreational, hedonic) 
describes consumers engaging in shopping to derive inher- 
ent satisfaction from the shopping activity itself. In this 
case, the shopping activity is freely chosen, and there is no 
need to engage in it. We refer to this motivational orienta- 
tion as the “recreational motivational orientation.” The two 
fundamental motivational orientations identified in the 
shopping-behavior literature are consistent with the funda- 


mental motivational orientations identified in psychology 
(Apter 1982; Deci and Ryan 1985). 


Arousal. The arousal construct is used to designate two 
somewhat distinct phenomena in psychology. “Objective 
arousal" is defined as the release of energy collected in the 
tissues (Duffy 1962), a variable representing the level of 
nonspecific physiological response (Cacioppo, Berntson, 
and Crites 1996). "Subjective arousal" is defined as the sub- 
jective experience of energy mobilization (Russell and 
Feldman Barrett 1999). Objective and subjective arousal are 
moderately correlated (Thayer 1970). Mehrabian and Rus- 
sell's (1974) framework involves subjective arousal. Mehra- 
bian and Russell define arousal as an affective dimension 
ranging from sleep to frantic excitement and use a self- 
report scale to measure it. Our framework also involves sub- 
jective arousal because it is the subjective state that con- 
sumers can experience as pleasant or unpleasant. 


Pleasantness. "Pleasantness" is defined as the hedonic 
valence (pleasant or unpleasant) of the affective response to 
a stimulus; it is based on the extent to which the stimulus 
(the object of the affective response) enables people to 
achieve their salient goals. Stimuli that facilitate goal 
achievement are experienced as pleasant, whereas stimuli 
that impede goal achievement are experienced as unpleasant 
(for a review, see Clore, Schwarz, and Conway 1994). 


Motivational Orlentation as Moderator 


Because task-oriented consumers derive satisfaction from 
the outcome of the shopping activity (the acquisition of the 
needed product, service, or information) rather than from 
the activity itself, their focus is on efficiently completing 
the shopping activity and obtaining its outcome with mini- 
mum expense of energy. Such consumers would find that 
the high energy demands in high-arousal environments 
require more effort to complete the shopping activity and 
therefore would find such environments to be unpleasant. In 
contrast, the recreational motivational orientation involves 
pursuing shopping to derive intrinsic rewards from the 
shopping activity itself. Such consumers desire rich experi- 
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TABLE 2 
Task-Oriented and Recreational Shopping Motivational Orientations: Conceptualizations in the Shopping- 
Behavior Literature 











Motivational Definitions 
Orlentation 
Article Labels Task Orlented Recreational 
Bellenger and Economic shoppers Economic shoppers: Consumers Recreational shoppers: 
Korgaonkar (1980) Recreational who have a high opportunity Consumers who have a low 
shoppers cost for shopping activities. opportunity cost for shopping 


Westbrook and Black 
(1985) 


Economic shoppers 
Social shoppers 


Product-oriented 
motives 


Experiential motives 


Dawson, Bloch, and 
Ridgway (1990) 


Babin, Darden, and Utilitarian shopping 
Griffin (1994) value 


Hedonic shopping 
value 


Reynolds and Beatty Shopping 
(1999) reluctance/apathy 
Shopping enjoyment 


Economic shoppers feel that 
shopping is not an enjoyable use 
of their time (p. 79). 


Economic shoppers: Consumers 
who engage in shopping 
primarily to acquire (a) needed 
product(s) (p. 85). 


Product-oriented shopping 
motives: Consumers visit stores 
to make needed purchases or 
acquire needed product 
information (p. 409). 


Utilitarian shopping value: 
Derived from task completion, 
the achievement of an intended 
outcome that the consumer is : 
pursuing out of necessity 

(pp. 645—46). 


Shopping reluctance/apathy: 
Satisfaction is not derived from 
the shopping activity Itself 


(p. 511). 


activities Recreational shoppers 
feel that shopping is an 
enjoyable use of their time 


(p. 79). 


Social shoppers: Consumers 
who engage in shopping 
primarily to satisfy needs 
unrelated to the acquisition of 
needed products (p. 85). 


Experientia! shopping motives: 
Consumers visit stores to derive 
pleasure from the visit itself 

(p. 410). 


Hedonic shopping value: 
Derived from the spontaneous 
hedonic responses ellcited in the 
course of shopping activities 

. 645—46). 


(pp 


Shopping enjoyment: 
Satisfaction Is derived from the 
shopping actlvity itself (p. 511). 





ences from shopping and therefore would find the high 
energy demands in high-arousal environments to be pleas- 
ant; conversely, they would view environments that call for 
Jess energy mobilization as unpleasant. 

The following scenario illustrates this phenomenon: À 
businesswoman is likely to have a task-oriented motiva- 
tional orientation when she feels that, while on a business 
trip to New York, she must find a gift to take to her assis- 
tant's son. Because the gift-buying activity is not inherently 
satisfying, she will try to complete the task as efficiently as 
possible. She will find the high-arousal environment (e.g., 
loud music, bright lights, action displays) in the Times 
Square Toys “R” Us store to be unpleasant because it will 
require her to mobilize more energy to complete her shop- 
ping task. In contrast, if the businesswoman were interested 
in some recreational shopping before departing from New 
York City, she would seek to derive some intrinsic satisfac- 
tion from the shopping activity itself. As a result, she would 
find the high-arousal environment at the Times Square Toys 
"^R" Us to be pleasant because it would enhance her shop- 
ping experience. On the basis of the preceding discussion, 
we propose the following: 
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P, Motivational orientation moderates the effect of arousal on 
pleasantness: Arousal (a) decreases pleasantness for con- 
sumers with a task-oriented motivational orientation and 
(b) increases pleasantness for consumers with a recrea- 
tional motivational orientation. 


The proposed moderating effect of motivational orienta- 
tion resolves some of the inconsistent findings in the 
shopping-behavior literature we reviewed previously. For 
example, Milliman (1982) finds that arousal decreases pur- 
chasing intentions and spending in supermarkets. In con- 
trast, Sherman, Mathur, and Smith (1997) find that arousal 
has a positive effect on purchasing intentions and spending 
in mall-based fashion stores. Few consumers find grocery 
shopping intrinsically satisfying (Levy and Weitz 2004). 
Thus, we can assume that consumers shopping for groceries 
are likely to have a task-oriented motivational orientation, 
which explains Milliman's results. Conversely, consumers 
often visit mall-based fashion stores as a form of recreation. 
The assumption that the consumers in Sherman, Mathur, 
and Smith's study were predominantly recreationally ori- 
ented would explain their results. 


Empirical Research 


Procedure 


We used the same procedure in both Study 1 and Study 2. 
The participants were undergraduate students from a state 
university in the southeastern United States who volun- 
teered to participate in the studies in return for extra credit 
in introductory marketing classes. Participants were first 
induced to adopt either a recreational motivational orienta- 
tion or a task-oriented motivational orientation. Then, par- 
ticipants visited a high-arousal or a low-arousal shopping 
environment that was similar to shopping over the Internet. 
Next, participants indicated the pleasantness of their experi- 
ence in the environment and their shopping-behavior inten- 
tions in relation to the environment. The studies concluded 
with manipulation checks. 


Measures 


All studies used the same measures to assess the dependent 
variables (pleasantness and shopping-behavior intentions) 
and the effectiveness of the arousal and motivation manipu- 
lations. (The complete scales, with descriptive statistics, 
appear in the Appendix.) We assessed pleasantness with 
four items based on Mehrabian and Russell's (1974) pleas- 
antness scale. We could not use the entire scale, because 
three of the anchor adjectives (hopeful, despairing, and 
melancholic) signify intense emotions that are not typically 
felt in retail contexts, and one of the items (relaxed versus 
bored) implies recreational motivation. Mehrabian and Rus- 
sell’s arousal scale was used for the arousal manipulation 
check. We developed a four-item scale for the motivation 
manipulation check. A six-item scale based on the work of 
Donovan and Rossiter (1982) was used to assess partici- 
pants' shopping-behavior intentions. 

We assessed the psychometric properties of the mea- 
sures with confirmatory factor analysis. To ensure an ade- 
quate sample size for the confirmatory factor analysis, all 
observations from Studies 1 and 2 were combined into a 
single sample, and the confirmatory factor analysis was 
conducted on the combined sample. The error variances of 
the three negatively worded behavioral intention items were 
allowed to correlate because of shared method. Although 
the chi-square test was statistically significant (x2(164) = 
495.28, p « .01), the comparative fit index (.95), the 
Tucker-Lewis index (.94), and the root mean square error 
of approximation (.08) indicated an acceptable fit (Bagozzi 
and Baumgartner 1994; Bagozzi and Yi 1988; Browne and 
Cudeck 1993). The scale reliabilities had acceptable psy- 
chometric properties: The coefficient alpha values were 
between .81 and .90, and the composite reliabilities were 
between .50 and .66 (Bagozzi and Baumgartner 1994). 

We used two different methods to assess discriminant 
validity (Anderson and Gerbing 1988; Fornell and Larcker 
1981). According to Fornell and Larcker (1981), discrimi- 
nant validity between two factors is established when the 
squared correlation between them is less than the mean item 
communality for each one. All six squared interfactor corre- 
lations were less than the mean item communality of either 
factor, demonstrating that the four measures have discrimi- 


nant validity. Anderson and Gerbing’s (1988) method 
involves chi-square difference tests. Two models are esti- 
mated for each pair of two factors. One of the models is a 
one-factor model; all 1tems are set to load on a single factor. 
In the other model, each item is set to load on its respective 
factor. A chi-square difference test is then performed to 
determine whether the two-factor model explains signifi- 
cantly more variance than the one-factor model. Discrimi- 
nant validity between the two factors is established if the 
two-factor model explains significantly more variance than 
the one-factor model. We performed six chi-square differ- 
ence tests. All six tests resulted in significant chi-square dif- 
ferences, in support of the discriminant validity of the 
measures. 


Study 1 


We conducted Study 1 as a 2 (motivational orientation: task 
oriented versus recreational) x 2 (arousal: high versus 
low) X 2 (replicates) between-subjects experiment. The 166 
participants in the study were asked to view a picture of a 
retail store ambience on individual computer screens and to 
imagine visiting the store with a task-oriented or recrea- 
tional motivation. 


Manipulation of Motivational Orientation 


Participants were randomly assigned to read one of the fol- 
lowing two scenarios about a hypothetical shopping 
experience: 


You are going on a camping trip this weekend, and you 
realize that you don't have enough suitable T-shirts for the 
trip. Às a result, you decide to purchase at least one more 
T-shirt. You drive to a store that sells T-shirts. All you 
want to do in the store is find one or more suitable T-shirts 
for your trip and leave. (Task-oriented motivation 
treatment) 


It's just past noon on a Saturday, and none of your friends 
are around. It's pouring rain, so you can't do anything out- 
doors. You find what's on TV too dull to watch. You feel 
very, very bored. You decide to visit some stores to relieve 
the sense of boredom. (Recreational motivation treatment) 


To increase the salience of the respective motivational ori- 
entation, participants were asked to write a story five or 
more sentences long about themselves in a similar situation. 


Manipulatlon of Arousal 


The arousal manipulation involved varying three visual ele- 
ments of the stimulus shopping environment: (1) complex- 
ity (the number of nonredundant elements in an environ- 
ment; Berlyne 1971), (2) color warmth (the warmth- 
coolness color continuum has been identified as red, yellow, 
green, and blue; Erwin et al. 1961; Jacobs and Hustmyer 
1974), and (3) color saturation (the subjective experience of 
a wavelength's spectral purity; Hogg 1969). Higher com- 
plexity (Berlyne 1960; Mehrabian and Russell 1974), 
warmer colors (Kueller and Mikellides 1993; Valdez and 
Mehrabian 1994), and higher color saturation (Valdez and 
Mehrabian 1994) all increase arousal. There were two repli- 
cates of the arousal manipulation: two high-arousal environ- 
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ments and two low-arousal environments. Participants were 
randomly assigned to view one of the four environments. 
All participants were exposed to the treatment environment 
for the same amount of time (i.e., 20 seconds). 


Manipulation Checks 


We conducted a 2 (motivational orientation) x 2 (arousal) x 
2 (replicates) analysis of variance (ANOVA) to test the 
effectiveness of the two manipulations. Motivational orien- 
tation, arousal, and the replicates, as well as all interactions 
among these three variables, were the independent varia- 
bles. The motivation and arousal manipulation check mea- 
sures were the dependent variables. The participants in the 
recreational motivation condition rated their motivational 
orientation as significantly more recreational than the par- 
ticipants in the task-oriented motivation condition 
(F(1, 158) = 70.56, p < .01; Mpc = 5.92 versus Mro = 
3.82). There were no other significant main or interaction 
effects on the motivation manipulation check (F(1, 158) < 
2.83, p > .09). The arousal manipulation had a significant 
main effect on the arousal manipulation check (F(1, 158) = 
30.86, p < .01; My = 5.58 versus Mj = 4.59). There were no 
other significant effects on the arousal manipulation check 
(F(1, 158) < 2.02, p > .15). 


Results 


Pleasantness. We conducted a 2 (motivational orienta- 
tion) x 2 (arousal) x 2 (replicates) ANOVA to test the 
hypothesized interaction effect between arousal and motiva- 
tion on pleasantness. We found that motivational orientation 
moderates the effect of arousal on pleasantness, producing a 
significant interaction between arousal and motivation 
(F(1, 158) z 28.08, p « .01). Neither motivational orienta- 
tion (F(1, 158) = 1.64, p > .20; Mpc = 6.17 versus Mro = 
5.86) nor arousal (F(1, 158) = .40, p > .20; Mj. = 6.09 ver- 
sus My = 5.94) had a significant main effect on pleasant- 
ness. There were no other significant effects (F(1, 158) < 
1.07, p > .20). 

We conducted a 2 (arousal) x 2 (replicates) ANOVA 
separately in each motivation condition to test whether the 
significant interaction between arousal and motivation was 
a significant crossover effect. The participants in the task- 
oriented motivation condition found the low-arousal envi- 
ronment to be significantly more pleasant than the high- 
arousal environment (F(1, 80) = 21.47, p < .01; Mj, = 6.59 
versus My = 5.13). The participants in the recreational 
motivation condition found the high-arousal environment to 
be significantly more pleasant than the low-arousal environ- 
ment (F(1, 78) = 9.12, p < .01; My = 6.75 versus Mj. = 
5.60) (see Figure 2). There were no other significant effects 
in these analyses (F < 1.22, p > .20). 


Shopping behavior. We estimated the same ANOVA 
models to test whether arousal and motivation have an inter- 
active effect on shopping-behavior intentions. We found a 
significant interaction between arousal and motivation 
(F(1, 158) = 14.80, p « .01). Neither motivational orienta- 
tion (F(1, 158) = .18, p > .20; Mpc = 5.94 versus Mro = 
5.84) nor arousal (F(1, 158) = .28, p > .20; Mj, = 5.95 ver- 
sus My = 5.83) had a significant main effect on behavioral 
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FIGURE 2 
Pleasantness as a Function of Motivational 
Orlentation and Arousal 
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intentions. Arousal had a significant, negative effect on 
behavioral intentions in the task-oriented motivation condi- 
tion (F(1, 80) = 16.25, p < .01; Mj, = 6.36 versus My = 
5.32) and a significant, positive effect on behavioral inten- 
tions in the recreational motivation condition (F(1, 78) = 
3.81, p = .05; My = 6.34 versus M; = 5.55). There were no 
other significant effects in these analyses (F € 1.34, p » 
.20). 

To test whether pleasantness mediates the interactive 
effect between arousal and motivation on shopping- 
behavior intentions, we conducted the mediation analysis 
recommended by Baron and Kenny (1986). We coded the 
two levels of each independent variable (motivational orien- 
tation, arousal, and the replicates) as 1 or —1. The behav- 
ioral intentions variable was the dependent variable. We 
estimated two regression models: 

(1) Behavioral intentions = By + B, x motivational orientation 
+ B x arousal + B4 x replicates 
+ B4 x motivational orientation 
x arousal + Ds 
x motivational orientation 
X replicates + Bg x arousal 
x replicates + By 
x motivational orientation x arousal 


X replicates. 


(2) Behavioral intentions = By + B, x motivational orientation 
+ B2 x arousal + B4 x replicates 
+ B4 X motivational orientation 
X arousal + Bs 


x motivational orientation 


x replicates + Bc x arousal 

x replicates + By 

x motivational orientation x arousal 
x replicates + Bg x pleasantness. 


Evidence in support of mediation arises if D4 is statisti- 
cally significant in Model 1 but not statistically significant 
in Model 2. The results support the hypothesized mediating 
effect of pleasantness. The interaction between arousal and 
motivation was significant in. Model 1 (B4 = .29, t(158) = 
3.85, p < .01) but was not significant in Model 2 (B,4 = .05, 
t(157) = .74, p > .40). Pleasantness had a significant, posi- 
tive effect on behavioral intentions in Model 2 (Bg = .62, 
t(157) = 9.51, p « .01). 


Study 2 


The objectives of Study 2 were (1) to replicate the results 
we obtained in Study 1 with different and more realistic 
manipulations of arousal and motivational orientation and 
(2) to provide further insight into the processes underlying 
the relationship between arousal and pleasantness by col- 
lecting free-response data from participants. The study was 
a 2 (motivational orientation: task oriented versus recrea- 
tional) x 2 (arousal: high versus low) between-subjects 


experiment. 


Subjects and Procedure 


The 161 participants were told that a music retailer was 
planning to open a music store in town. Participants were 
asked to look at and evaluate a store concept as prospective 
customers. On individual computer screens, participants 
were shown an interior view of a music store that sells CDs. 
Before the music store was displayed on the screen, on- 
screen instructions informed participants that more detailed 
information could be obtained about the CDs shown in the 
store by positioning the mouse over the merchandise. A 
mouse click on a CD revealed a CD information window 
that displayed the composer's portrait, a brief description of 
the CD, and the CD's price. Participants could return to the 
store view window by pressing a button in the CD informa- 
tion window. While shopping in the store, participants heard 
music through earphones. So that participants would 
become familiar with the setting, they were asked to com- 
plete a trial task—namely, to open a product's information 
window in a trial store and find out the product's price. The 
trial store was different from the treatment store and was the 
same for all participants. 


Manipulation of motivational orientation. In the task- 
oriented motivation condition, participants were asked to 
imagine themselves in the situation described in the follow- 
ing Scenario: 

You are planning to meet some friends at a movie theater. 

Your mother asks you to stop at a certain nearby music 

store (before going to the movie theater) and check the 

prices of two classical music CDs. Your mother intends to 
purchase one of those two CDs as a gift for an elderly 
couple, but she hasn't made up her mind which CD to 
choose. Your mother gives you the composers’ names and 


album titles: Beethoven's 9 Symphonien and Chopin's The 
Piano Works. Your task is to go to the store, find the CDs, 
and remember their prices. 


Then participants were specifically instructed to locate 
the two CDs indicated in the scenario and uncover their 
prices. There was no time limit on the visit to tbe store; par- 
ticipants could leave the store at any time by pressing an 
"exit" button in the store view window. Participants were 
also informed that at the end of the session, they would be 
asked to identify the two CD prices in a multiple-choice 


" question. This element in the instructions was necessary so 


that participants would view identifying the CD prices as a 
task rather than as a recreational activity that may or may 
not be completed. However, participants were informed that 
there would be no penalty or other adverse consequences 
for those who failed to identify the two prices correctly. 

The participants in the recreational motivation condition 
were asked to imagine themselves in the situation described 
in the following scenario: 


It is spring break. You have to stay in town while all your 
friends have left town. It is a weekend. You are at home, 
and you feel very, very bored. It's pouring rain, so you 
can't do anything outdoors. You find what's on TV too 
dull to watch. So you decide to visit some stores to relieve 
the sense of boredom. 


Participants were specifically instructed to visit the 
music store “to relieve boredom (as when you feel very, 
very bored and look for diversion)" They were also 
informed that more detailed information could be obtained 
about the CDs shown in the store and that they could (but 
did not have to) access that information. Participants were 
also informed that there was no time limit on their visit to 
the store; they could leave the store at any time by pressing 
an "exit" button in the store view window. Notepaper was 
available to each participant in both motivation conditions. 


Manipulation of arousal. We used music tempo and vol- 
ume to manipulate arousal. The high-arousal treatment 
involved faster and louder music (the first 60 seconds of 
Tchaikovsky's “Capriccio Italien in A”), and the low- 
arousal treatment involved slower and softer music (the first 
60 seconds of Camille Saint-Saens’ “The Carnival of the 
Animals: The Swan"). Participants who remained in the 
store for more than a minute heard the same musical selec- 
tion from the beginning. 


Manipulation checks. We conducted a 2 (motivational 
orientation) x 2 (arousal) ANOVA to test the effectiveness 
of the two manipulations. Motivational orientation, arousal, 
and their interaction were the independent variables. The 
motivation and arousal manipulation check measures were 
the dependent variables. The participants in the recreational 
motivation condition rated their motivational orientation as 
significantly more recreational than the participants in the 
task-oriented motivation condition (F(1, 157) = 123.47, p « 
.01; Merc = 5.84 versus Mro = 3.58). There were no other 
significant main or interaction effects on the motivation 
manipulation check (F(1, 157) = .77, p > .20). The arousal 
manipulation had a main effect on the arousal manipulation 
check (F(1, 157) = 129.39, p < .01; My = 6.02 versus My = 
3.80). There were no other significant main or interaction 
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effects on the arousal manipulation check (F(1, 157) = 1.55, 
p > .20). 


Results 


Pleasantness. We conducted the same 2 (motivational 
orientation) x 2 (arousal) ANOVA to test the hypothesized 
moderating effect of motivational orientation on the rela- 
tionship between arousal and pleasantness. We found that 
motivational orientation significantly moderated the effect 
of arousal on pleasantness (F(1, 157) = 20.25, p < .01). 
Motivational orientation also had a main effect on pleasant- 
ness: The participants in the task-oriented motivation condi- 
tion indicated more pleasant affect than the participants in 
the recreational motivation condition (F(1, 157) = 9.44, p < 
.01; Mao = 6.13 versus Mpc = 5.28). Arousal had no sig- 
nificant main effect on pleasantness (F(1, 157) = .20, p > 
20; My = 5.77 versus My = 5.64). 

We examined the effect of arousal on pleasantness sepa- 
rately in each motivation condition to test for a significant 
crossover interaction. The participants in the task-oriented 
motivation condition found the low-arousal environment to 
be significantly more pleasant than the high-arousal envi- 
ronment (F(1, 79) = 13.53, p < .01; M; = 6.81 versus My = 
3.44). The participants in the recreational motivation condi- 
tion found the high-arousal environment to be significantly 
more pleasant than the low-arousal environment (F(1, 78) = 
7.47, p < .01; My = 5.84 versus Mj = 4.73) (see Figure 2). 


Free-response process measures. Participants were 
asked a free-response question about the store environment 


to examine their thought processes in the different condi- 
tions and to test our theoretical rationale. After experiencing 
the treatment store environment (the arousal manipulation), 
in their own words, participants answered the question, 
“Why did (didn’t) you like the store environment?” Two 
coders independently coded participants’ responses into cat- 
egories. Each coder independently coded a subset of 
responses to establish a set of relevant categories. Then, the 
two coders met and, through discussion, determined the set 
of categories to be used for classifying participants’ 
responses. Next, the coders used the developed categoriza- 
tion scheme and independently coded all participant 
responses. Coder agreement was calculated as the percent- 
age of responses that both coders independently classified 
into the same category. The agreement rate was 92%. Dis- 
agreements were resolved through discussion. On average, 
participants listed 1.81 reasons for their store experience. 
We used one reason per participant in the analysis—the rea- 
son indicated as the most important. If a participant listed 
more than one reason and did not explicitly indicate which 
reason was the most important one, we used the reason 
listed first. Table 3 shows the reasons listed in each 
arousal x motivation condition and the frequency with 
which each reason occurs in the condition. 

The free-response data are consistent with our explana- 
tion for the interactive effect between arousal and motiva- 
tion. Task-oriented participants liked the low-arousal store 
environment because it allowed them to complete their 
shopping efficiently, but they did not like the high-arousal 
environment because it created inefficiencies. Recreational 


TABLE 3 


Reasons for Participants’ Store Environment Experience 
—————————————ÉÓ——————— ÉL O O 


Experlence Reason 


Frequency 
n 96 


————————————————————————————————— f 
Task-Orlented Low-Arousal Condition (n = 39) (found to be pleasant in general) 


1. Efficlency (effortlessness, expediency) 
2. Boredom (limited opportunity to engage In activities) 
3 Demotivation (no motivation to engage in any activity) 


31 
3 T.T 
2 


Task-Oriented High-Arousal Condition (n = 41) (found to be unpleasant In general) 


1, Inefficiency (effortfulness, extra time needed) 
2 Individual difference (dispositional) reasons 


3. Rich experience (ample opportunity to engage in activities) 
4. Pleasantness (nonspecific reasons, such as "because It's pleasant") 


5. Efficiency (effortlessness, expediency) 


NO 

Oaa ha 
~y 
Q 


Recreational Low-Arousal Condition (n = 40) (found to be unpleasant in general) 


1. Boredom (limited opportunity to engage In activities) 
2. Efficiency (effortlessness, expediency) 
3. Individual difference (dispositional) reasons 


4. Pleasantness (nonspecific reasons, such as “because It’s pleasant’) 


M) M o 
~J 
Ci 


Recreational High-Arousal Condition (n = 40) (found to be pleasant In general) 


1. Rich experlence (ample opportunity to engage In activities) 
2. Pleasantness (nonspecific reasons, such as "because It's pleasant") 


3. Individual difference (dispositional) reasons 


4. Unpleasantness (nonspecific reasons, such as “because it's unpleasant") 


5. Inefflclency (effortfulness, extra time needed) 
Notes: Reasons listed only once in a condition are not shown. 
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participants liked the high-arousal store environment 
because it created rich experiences, but they disliked the 
low-arousal environment because it involved limited oppor- 
tunities to engage in activities. 


Shopping behavior. A 2 (motivational orientation) x 2 
(arousal) ANOVA was estimated to test whether arousal and 
motivational orientation affect shopping-behavior inten- 
tions. A significant interaction between arousal and motiva- 
tion was obtained (F(1, 157) = 6.10, p < .05). Neither moti- 
vational orientation (F(1, 157) = 1.03, p > .20; Mro = 5.42 
versus Mpc = 5.13) nor arousal (F(1, 157) = 1.17, p > .20; 
My = 5.43 versus M, = 5.11) had a significant main effect 
on behavioral intentions. We also examined the effect of 
arousal on behavioral intentions separately in each motiva- 
tion condition. Arousal had a significant, positive effect on 
behavioral intentions in the recreational motivation condi- 
tion (F(1, 78) = 6.20, p < .05; Mg = 5.65 versus Mj, = 4.61) 
but did not have a significant effect in the task-oriented 
motivation condition (F(1, 79) = .98, p > .20; M; = 5.63 
versus My = 5.22). 

To test for the hypothesized mediating effect of pleas- 
antness, we conducted the same type of mediation analysis 
as that in Study 1. The results were consistent with expecta- 
tions. The interaction between arousal and motivation was 
significant in the model that did not include pleasantness 
(B4 = .33, t(157) = 2.47, p < .05), but it was not significant 
in the model that included pleasantness (B4 = —.09, t(156) = 
—.83, p > .40). Pleasantness had a significant, positive effect 
on behavioral intentions in Model 2 (Bg = .73, t(156) = 
11.94, p « .01). Because arousal had no effect on behavioral 
intentions in the task-oriented motivation condition, we also 
conducted a mediation analysis only for the recreational 
motivation condition. Arousal had a significant, positive 
effect on behavioral intentions in the model that did not 
include pleasantness (B4 = .27, t(78) = 2.49, p « .05), but it 
had no effect in the model that included pleasantness (B4 = 
.07, t(77) = .85, p > .30). Pleasantness had a significant, 
positive effect on behavioral intentions in the latter model 
(Bg = .67, t(77) = 7.87, p < .01). 


General Discussion 


The two studies provide consistent support for the proposi- 
tion that motivational orientation moderates the effect of 
arousal on pleasantness. In both studies, there is a signifi- 
cant crossover interaction, in which both simple effects are 
significant. The generalizability of the results is strength- 
ened by the use of different manipulations of arousal and 
motivational orientation in the two studies. As we proposed, 
task-oriented consumers view shopping primarily as a 
means for obtaining a needed outcome (product, service, or 
information) and wish to complete their shopping as effi- 
ciently as possible; therefore, task-oriented consumers find 
high-arousal retail environments to be unpleasant. Con- 
versely, recreational consumers derive inherent satisfaction 
from the shopping activity itself and therefore like high- 
arousal retail environments that create rich shopping experi- 
ences. The free-response data we collected in Study 2 sup- 
ports our explanation for the interactive effect between 
arousal and motivational orientation. 


In addition, the two studies provide evidence that is con- 
sistent with the results pertaining to shopping behavior 
obtained in the marketing literature (Table 1). Arousal and 
motivational orientation have an interactive effect on shop- 
ping behavior, and this interactive effect is mediated by 
pleasantness. Note, however, that the interactive effect 
between arousal and motivation was not a crossover effect 
in Study 2; arousal had no significant effect on shopping- 
behavior intentions in the task-oriented condition in that 
study. 


Limitations 


The two studies involve participants engaging in a simu- 
lated shopping experience. Although Study 2 creates an 
experience similar to the experience that consumers would 
have when shopping on the Internet, the studies do not cap- 
ture many elements of the experience that consumers have 
when shopping in stores. Thus, there is a need to provide 
more information about the generalizability of the results 
by carrying out either controlled experiments or correla- 
tional studies in stores. 


Managerial Implications 


This research indicates that the level of excitement retailers 
should create in their stores depends on the shopping moti- 
vation of their customers. Retailers can infer the motiva- 
tional orientation of their current and prospective customers 
from the retail offering (e.g., school supplies versus books, 
dental care versus cruises), the day of the week and the time 
of day (e.g., a Monday morning versus a Saturday after- 
noon), the time of the year (a nonholiday versus a holiday 
season), and the retailer's location (e.g., a business district 
versus a theme park). After a retailer determines the domi- 
nant motivational orientation of its customers, our frame- 
work suggests that the retailer should create a high-arousal 
environment for recreational consumers and a low-arousal 
environment for task-oriented consumers. 

Retailers can use different elements of the store environ- 
ment to influence customer arousal levels (e.g., music, 
color, complexity of the store layout and the merchandise 
presentation). For example, if a grocery store chain finds 
out that its customers are predominantly task oriented, it 
should use store designs with less-saturated cooler colors 
(e.g., light blues) and even associate such colors with its 
brand name. It should not use intense orange or red colors. 
The stores should have a simple layout and merchandise 
presentation. In contrast, the customers of a sporting-goods 
retailer may be predominantly recreationally oriented. This 
retailer should design its stores with a more complex layout 
and merchandise presentation and regularly reorganize the 
store and the merchandize presentation. The retailer should 
also use warm and highly saturated colors throughout its 
stores and ın 1ts brand name. 

When customers' dominant motivational orientation 
varies in time, the retailer faces a more challenging task in 
designing the store environment. The elements of the store 
environment that are more difficult to modify frequently 
(e.g., the store's layout and wall color) should be designed 
to induce moderate arousal (i.e., intermediate levels of com- 
plexity, color warmth, and color saturation). More variable 
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elements, such as the background music, can be used to 
increase arousal at the times when customers are predomi- 
nantly recreationally oriented and to decrease arousal when 
customers are predominantly task oriented. 

Sometimes customers’ dominant motivational orienta- 
tion varies across the different store departments. In such 
cases, retailers are advised to design each department in 
such a way as to complement the dominant motivational 
orientation of the customers shopping there. For example, a 
consumer electronics retailer may create a low-arousal envi- 
ronment in the accessories area to accommodate customers 
(who typically have a task-oriented motivational orientation 
when shopping for accessories) but create a high-arousal 
environment in the home-entertainment centers, which are 
typically visited by recreationally oriented consumers. 

Although our research and review of the literature focus 
on environmental effects in retail stores, the framework can 
be applied to. consumer experiences with other marketing 
stimuli (e.g., packages and labels, advertising messages, 
Web sites). For example, according to our framework, 
Amazon.com should serve more complex, high-arousal 
Web sites to customers who indicate that they have come to 
the Web site to browse, but it should offer simpler, low- 
arousal Web sites to customers trying to find a specific 
book. 


Conclusion 


Our theoretical framework identifies and examines a situa- 
tional variable, motivational orientation, that moderates the 
relationship between arousal and pleasantness. By identify- 
ing the antecedents that determine the pleasantness of 
people’s experiences while engaging in shopping and con- 
sumption, the framework offers an explanation for impor- 
tant aspects of consumer behavior and furnishes guidelines 
for effectively managing retail and consumption 
environments. 


Appendix 
Measurement Scales 


Dependent Variables 

Pleasantness (scored on a nine-point semantic differen- 
tial scale) 

(Study 1) In this store, I would feel: 

(Study 2) Throughout my visit to the concept music 
store, I felt: ; 


*Displeased versus pleased; satisfied versus dissatisfied 
(reversed); pleasant versus unpleasant (reversed); unhappy 
versus happy. 


(Study 1: M = 6.05, SD = 1.70, a = .856; Study 2: M = 
5.70, SD = 1.88, a. = .912; Studies 1 and 2: M = 5.88, SD = 
1.80, & = .887) 

Behavioral Intentions (scored on a nine-point Likert 
scale, anchored by “strongly disagree” and “strongly 
agree”) 

What would you generally do if this store opens in this 
town or your hometown? 


*[ would enjoy shopping in this store. 

ef would avoid ever having to return to this store. (reversed) 

eI would want to avoid looking around or exploring this envi- 
ronment, (reversed) 

*This is a place where I might try to avoid other people, and 
avoid having to talk to them. (reversed) 

*[ would be willing to buy things at this store. 

eI would be willing to recommend this store to my friends. 


(Study 1: M = 5.94, SD = 1.60, œ = .884; Study 2: M = 
5.27, SD = 1.89, a = .907; Studies 1 and 2: M = 5.61, SD = 
1.77, & = .900) 


Manipulation Checks 


Arousal (scored on a nine-point semantic differential 
scale) 

(Study 1) In this store, I would feel: 

(Study 2) Throughout my visit to the concept music 
store, I felt: 


*Relaxed versus stimulated; excited versus calm (reversed); 
frenzied versus sluggish (reversed); dull versus jittery; wide 
awake versus sleepy (reversed); unaroused versus aroused. 


(Study 1: M = 5.11, SD = 123, & = .784; Study 2: M = 
4.92, SD = 1.66, a = .886; Studies 1 and 2: M = 5.02, SD = 
1.46, a = .848) 

Motivational Orientation (scored on a nine-point Likert 
scale, anchored by "strongly disagree" and "strongly 
agree") 

(Study 1) On this shopping occasion, I would primarily 
want: 

(Study 2) In the store, I primarily wanted: 

*To have fun; to get things done (reversed); to be task focused 

(reversed); to relieve boredom. 


(Study 1: M = 4.87, SD = 1.91, 1 2.850; Study 2: M = 
4.70, SD = 1.72, & = .771; Studies 1 and 2: M = 4.79, SD = 
1.82, œ = .808) 


— SS LL LAB 
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Wolfgang Ulaga & Andreas Eggert 


Value-Based Differentiation in 
Business Relationships: Gaining and 
Sustaining Key Supplier Status 


Many business customers today consolidate their supply bases and Implement preferred supplier programs. Con- 
sequently, vendors increasingly face the alternative of either gaining a key supplier status with their customers or 
being pushed into the role of a backup supplier. As product and price become less important differentiators, sup- 
pliers of routinely purchased products search for new ways to differentiate themselves in a buyer—seller relation- 
ship. This research investigates avenues for differentiation through value creation in business-to-business relation- 
ships. The results suggest that relationship benefits display a stronger potential for differentiation in key supplier 
relationships than cost considerations. The authors identify service support and personal interaction as core differ- 
entiators, followed by a suppliers know-how and its ability to improve a customer's time to market. Product quality 
and delivery performance, along with acquisition costs and operation costs, display a moderate potential to help a 


firm gain and maintain key supplier status. Finally, price shows the weakest potential for differentiation. 


ver the past decades, there has been a growing 

recognition among scholars and practitioners that 

collaborative buyer-seller relationships represent a 
source of competitive advantage. The marketing literature 
began to acknowledge the need to manage interfirm rela- 
tionships as strategic assets in the 1980s (Jackson 1985; 
Webster 1984). Since then, numerous empirical studies 
have investigated the benefits of close buyer-seller relation- 
ships and their potential to achieve superior results (Cannon 
and Homburg 2001; Ganesan 1994; Hewitt, Money, and 
Sharma 2002; Jap 1999; Kalwani and Narayandas 1995; 
Lyons, Krachenberg, and Henke 1990). 

Along with the recognition of the strategic importance 
of supplier relationships, firms have fundamentally changed 
the way they manage supplier portfolios, and customers 
have increasingly moved away from an adversarial relation- 
ship management style with many suppliers toward a logic 
of building long-term relationships with selected key sup- 
pliers (Kalwani and Narayandas 1995; Narayandas and 
Rangan 2004; Spekman 1988). Consequently, many firms 
have consolidated their supply bases. For example, at the 
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end of 2003, Sony Corporation announced Transformation 
60, a global program intended to boost operating profit mar- 
gins from 4% to 10% over a period of three years. In a 
strong move of reforms, the group planned, among other 
measures, to reduce the number of suppliers for compo- 
nents and raw material from 4700 to 1000. Over the same 
period, the number of registered parts was to be reduced 
from 840,000 in 2003 to 100,000 to achieve cost reductions 
and quality improvements. 

Such a profound shift in the management of buyer- 
seller relationships raises several challenges for customers 
and suppliers alike. From a customer perspective, firms 
need to differentiate among qualified suppliers and identify 
vendors that will form their consolidated supply base. This 
is particularly relevant for key components, for which 
single sourcing or multiple single sourcing tend to become 
the norm rather than the exception. Consequently, cus- 
tomers rely on performance evaluation tools and vendor 
development programs to strengthen their ties with pre- 
ferred suppliers. 

In turn, with growing supply base consolidation, many 
suppliers are challenged to move into a main supplier posi- 
tion and to defend this top spot against competitors. In 
short, vendors face the alternative of either gaining key sup- 
plier status with their customers or being pushed into the 
role of a backup supplier, capturing only a small share of a 
customer's business. This challenge is further reinforced by 
the trend toward product commoditization in many markets 
(Rangan and Bowman 1992). As product and price become 
less important differentiators, suppliers of routinely pur- 
chased products search for new ways to differentiate them- 
selves through improved customer interactions (Vanden- 
bosch and Dawar 2002). 

From an academic point of view, differentiation in busi- 
ness relationships can be researched from a value-based 
perspective. To be effective, differentiation must contribute 
to customer value by either providing benefits to the cus- 
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tomer or lowering a customer’s costs. The key role of cus- 
tomer value has been well established in the business mar- 
keting discipline over the past decade (Anderson, Jain, and 
Chintagunta 1993; Anderson and Narus 1998, 2004). Build- 
ing on this literature, scholars have advanced the under- 
standing of relationship value in recent years, both concep- 
tually and empirically (Anderson 1995; Hogan and 
Armstrong 2002; Krapfel, Salmond, and Spekman 1991; 
Walter et al. 2003; Wilson 1995; Wilson and Jantrania 
1995). However, a review of the emerging literature on rela- 
tionship value reveals several unresolved research issues. 

From a conceptual perspective, a sound understanding 
of the dimensions that drive value creation in manufacturer— 
supplier relationships is needed. Previous studies explored 
either individual drivers or subsets of value-creating dimen- 
sions in business relationships. For example, Cannon and 
Homburg (2001) investigate value creation through cost 
reduction in a manufacturer-supplier relationship. How- 
ever, a thorough investigation of value drivers that integrate 
the dimensions into a framework grounded in managerial 
practice 1s still missing. 

From a measurement perspective, guidelines need to be 
provided for modeling relationship value as an overall mea- 
sure (Parasuraman 1997). Previous research does not pro- 
vide any guidance as to how the different dimensions 
should be consolidated. Should a reflective or formative 
measurement approach be used (Diamantopoulos and Win- 
klhofer 2001)? Indeed, researchers have conceptualized 
value as a reflective construct without justifying their 
approach (Lapierre 2000). More research is needed in this 
area. 

Finally, from a managerial perspective, there needs to 
be a greater understanding of which value-creating dimen- 
sions of a business relationship represent a promising basis 
for differentiation. In other words, from a vendor perspec- 
tive, which value drivers are key when competing for main 
supplier status? Which dimensions show only a limited 
potential for differentiation and are to be considered a 
“must-have”? In turn, from a customer perspective, which 
dimensions help discriminate among competing vendors 
and further consolidate a supply base? 

To address these challenges, our article is structured as 
follows: Building on Cannon and Homburg’s (2001) cate- 
gorization of relationship costs, we develop an integrated 
framework of relationship value. Next, we present the 
results of our qualitative study aimed at identifying and 
describing key value drivers in each category of our frame- 
work. We then present the results of a nationwide cross- 
sectional survey among senior purchasing managers. 
Finally, we discuss the implications of our research for both 
academics and managers and conclude by outlining study 
limitations and an agenda for further research. 


Toward an Integrated Framework of 
Relationship Value 


Creating superior customer value is key to a company’s 
long-term survival and success (Slater 1997; Woodruff 
1997). In business markets in particular, customer value is 
regarded as the cornerstone of the marketing management 
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process (Anderson and Narus 2004). Yet despite its impor- 
tance, research on customer value in business markets is 
still in an early stage (Flint, Woodruff, and Gardial 2002). 
Although value assessment studies enjoy a long tradition in 
business marketing, they typically focus on the value of the 
physical product, neglecting relational dimensions of 
customer-perceived value (Dwyer and Tanner 1999). 

In recent years, researchers have adopted a relational 
approach that investigates customer value from a relation- 
ship marketing perspective (Payne and Holt 1999). It is 
clear that the value of a business relationship is a multi- 
dimensional concept that reaches beyond the price versus 
quality trade-off that is prevalent in consumer research 
(Gassenheimer, Houston, and Davis 1998). To date, how- 
ever, there is a lack of concurrence in the marketing litera- 
ture on the salient value dimensions of a business relation- 
ship (Woodall 2003). 

Recently, Cannon and Homburg (2001, p. 29) directed 
attention toward the costs incurred in business relationships, 
indicating that "one method for creating value is to reduce 
costs in commercial exchange." On the basis of a multidis- 
ciplinary literature review, they identified three sources of 
relationship costs: (1) direct costs, (2) acquisition costs, and 
(3) operations costs. They further suggested that a supplier's 
success in lowering a customer firm's cost in each of the 
three categories leads the customer to expand its business 
with the supplier. In discussing directions for further 
research, Cannon and Homburg recognized that “cost 
reduction is only one source of value in business relation- 
ships. A more comprehensive theory would consider costs 
and benefits beyond economic costs.” 

Following this line of reasoning, we suggest thinking of 
relationship value in terms of a 2 x 3 matrix that distin- 
guishes between two fundamental dimensions of value 
creation (i.e., benefits and costs) and three levels at which 
these drivers operate (i.e., the core offering, the sourcing 
process, and the customer firm’s internal operations). 
Rather than relying on a hypothetico-deductive approach, 
we defined the value drivers from a managerial perspective 
(see, e.g., Bendapudi and Leone 2002; Flint, Woodruff, and 
Gardial 2002). We conducted depth interviews with pur- 
chasing managers using a grounded theory approach 
(Glaser and Strauss 1967). We then used the results of this 
qualitative study as a basis for a cross-sectional survey to 
investigate the various value drivers’ potential of serving as 
key differentiators and to develop an overall measure of 
value creation in key supplier relationships. 


Qualitative Study 


Data Collection and Analyses 


Sampling procedure. We gathered data through depth 
interviews with purchasing managers in manufacturing 
companies located in the Midwest of the United States. Par- 
ticipants were contacted through the local chapter of the 
Institute of Supply Management (ISM) and through the 
alumni network of a midwestern university. The sampling 
process ceased when saturation was reached, which was 
indicated by information redundancy. Our final sample con- 


sisted of ten participants in nine manufacturing organiza- 
tions. This is consistent with sample sizes recommended by 
scholars for exploratory research purposes (McCracken 
1988, p. 17). 


Sample characteristics. Table 1 displays key sample 
characteristics. Our sample consisted of manufacturers in a 
variety of areas, such as aircraft landing systems, amplifiers 
and microphones, audiovisual video projection equipment, 
automobiles, household appliances, orthopedic implants, 
and vacuum pumps. Components considered by our study’s 
participants also varied significantly. For example, cus- 
tomers purchased aluminium wheel forgings, car seats, 
electronic components, motors, springs, and surgical instru- 
ments. The size of participating companies ranged from 
small and medium-sized manufacturers to multinational 
corporations, and the selected buyer-supplier relationships 
had been in place between 2 and 25 years. 


In developing our sample, we aimed to maximize diver- 
sity among participants to discover various potential value 
drivers and their underlying facets. At the same time, study 
participants and firms also needed to share several charac- 
teristics to allow for comparability. For example, because 
we relied on key informants, it was critical to select man- 
agers who were influential decision makers involved in 
selecting and monitoring supplier relationships. Conse- 
quently, we invited only senior-level managers to partici- 
pate in the study. In addition, key manufacturing character- 
istics of companies .needed to allow for comparability 
across firms. Thus, only companies in comparable indus- 
tries were contacted (Standard Industrial Classification 
codes 34-38). 


Interview guide. Our interview guide was composed of 
three sections. In the first section, we asked respondents to 
select a specific component they purchased from at least 





TABLE 1 
Qualitative Study Sample 
Participant Company Activity Component Purchased 
Name Background and Size and Supplier Context 
Denice Contract Admin. Supervisor, age 42, 22 Amplifiers Electronic components 
years in purchasing Sales: $75 million One main supplier, multiple other 
Employees: 450 suppliers 
Frank Manager, e-procurement, age 35, 12 years Automobiles Car interior system 
In industry, 4 years in purchasing Sales: $177 billion One system supplier, four alternative 
Employees: 347,700 suppliers 
Jack Director of Purchasing, age 45, 25 years in Aircraft landing systems Aluminium forgings 
production, quality control, and Sales: $9.7 billion One main suppller (9096), one backup 
purchasing Employees: 37,500 supplier (1096) 
Jeff Senior Purchasing Supervisor, age 38, 13 Automotive brakes Pins 
years In purchasing Sales: $6.2 billion One main supplier (95%), multiple 
| Employees: 21,750 other suppliers 
Jerry Purchasing Manager, age 50, 19 years In Video screens Springs 
planning and Inventory control, 9 years in Sales: $110 million One main supplier, multiple other 
purchasing Employees: 500 suppliers 
John Purchasing Manager, age 38, 3 years in Electric sensors Circult Boards 
production planning, 16 years In Sales: $458 million One main supplier (80%), one backup 
purchasing Employees. 5313 supplier (2096) 
Mary Purchasing Manager, age 45, 25 years In Vacuum pumps Electric motors 
quallty control and purchasing Sales: $82 million Two main suppliers, multiple other 
Employees: 400 suppliers 
Richard Director, Strategic Sourcing, age 40, 18 Orthopedic implants Surgical instruments 
years In purchasing Sales: $1.2 billion Two main suppliers, multiple other 
Employees: 3600 suppliers 
Scott Director Global Commodities, age 38, 6 Household appliances Electric motors 
years in engineering, 7 years in Sales: $11 billion Two main suppliers, multiple other 
purchasing Employees: 59,408 suppliers 
Shawn Project Leader, Global Sourcing, age 28, 4 Aircraft landing systems Aluminium forgings 


years in purchasing 


Sales: $9.7 billion 
Employees: 37,500 


One maln supplier (8096), one backup 
supplier (10%) 


Notes: Names are pseudonyms All participants are key decision makers in the purchasing departments of their firms. 
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two suppliers. We further invited participants to describe the 
product, how it was used in the manufacturing process, its 
relative importance in the final product, and the supply mar- 
ket. Customers needed to maintain a collaborative relation- 
ship with suppliers for the selected product. Therefore, we 
used characteristics described in the extant literature (Jap 
1999) to ask participants why they considered a particular 
relationship collaborative. The purpose of this initial stage 
was to ensure that respondents considered a specific use sit- 
uation and were prepared to compare alternative buyer- 
supplier relationships. 

In the second part, we asked respondents to describe 
how suppliers created value for their organizations and to 
provide examples from the specific supplier relationship 
under consideration. To facilitate the process, participants 
were asked to describe activities between the supplier and 
the manufacturer, which in turn allowed the interviewer to 
probe the different benefit and cost dimensions perceived in 
the relationship. Particular attention was given to the com- 
parison of each company’s main supplier and its second- 
best alternative supplier of the same component. In this sec- 
tion, we carefully phrased our questions in such a way that 
respondents’ testimonies were elicited in an unobtrusive and 
nondirective manner to avoid potential pitfalls of “active lis- 
tening” (McCracken 1988, p. 21). Thus, the objective in this 
stage was to facilitate the emergence of relationship value 
categories grounded in the managers’ own language rather 
than to capture merely the dimensions previously specified 
by the investigators. Finally, in the third part of the inter- 
view guide, we invited participants to describe themselves 
and their company background. 


Analysis .and interpretation. On average, interviews 
lasted one and a half to two hours. Each interview was 
audiotaped and transcribed verbatim. We used grounded 
theory coding (i.e., open, axial, and selective coding) to 
identify relationship value drivers and to describe their 
facets (see, e.g., Flint, Woodruff, and Gardial 2002). We 
assessed the trustworthiness of our findings by applying the 
techniques of triangulation, informant feedback, and repli- 
cation (Miles and Huberman 1994; Strauss and Corbin 
1998). To improve content validity further, we presented the 
study’s methodology and findings during a workshop with 
27 purchasing professionals at ISM. We asked participants 
to assess the descriptions of the core value drivers and to 
comment on how well they reflected their practice. Man- 


agers largely agreed on the meaning of the value drivers. 
Therefore, we made only a few changes in wording and 
illustrations after this step. 


Field Study Findings 


Key supplier relationships. Across the board, study par- 
ticipants mentioned that their companies had reduced their 
supply bases over time, as Jack illustrated: 


Over the last ten years, we have been streamlining our 
supply base. We used to have over 300 suppliers. We now 
have driven that down to 115 elite suppliers that do a great 
job for us. 


Along with the trend toward downsizing supplier port- 
folios, the remaining vendors captured larger shares of pur- 
chasing volumes. Consider the following comment from 


John: 


We have between 60 and 65 suppliers of production items, 
and 10 or 11 of those make up our key suppliers, that is, 
80% of the dollars we spent. For those suppliers, we have 
a specific supplier development program. 


Participants identified situations of sourcing key com- 
ponents from at least two firms, that is, the main supplier 
and its challenger. The following quotation from Richard 
illustrates this: 


We have about 60 suppliers of surgical instruments, and 
we spend $63 million on them. The number one supplier 
has $10 million of it, and the number two supplier has $7 
million. These are the two suppliers that stick out. It then 
drops dramatically. The other suppliers will get down to a 
couple of $100,000 each. 


After participants described their supply base and 
selected a specific supplier relationship, we moved on to 
explore how vendors gained and sustained key supplier sta- 
tus with a customer. As we show in Table 2, nine value dri- 
vers emerged from our depth interviews. We classified these 
drivers by value dimension (i.e., relationship benefits and 
costs) and value sources (i.e., value creation through the 
core offering, within the sourcing process, and at the level 
of the customer's operations). In the next section, we pre- 
sent these core value-creating dimensions. 


Value Creation Through the Core Offering 


We asked managers what was needed for a vendor to gain a 
key supplier status. Participants consistently described a set 


TABLE 2 
Value Drivers In Key Supplier Relationships 





Relationship Value Dimensions 








Sources of Value Creation Benefits Costs 
Core offering Product quality Direct costs 
Delivery performance 


Sourcing process 


Service support 


Acquisition costs 


Personal Interaction 


Customer operations 


Supplier know-how 


Operatlon costs 


Time to market 
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of core dimensions. They identified product quality and 
delivery performance as the two benefit dimensions at this 
level of value creation. The corresponding relationship cost 
dimension was direct product costs. Respondents explained 
that vendors needed to meet these must-haves. Therefore, 
we named this first level of value creation “the vendor’s 
core Offering.” Jerry illustrated this point: 


I think you have to look at their quality, delivery perfor- 
mance, and price, and then you need to evaluate all of 
that. If anyone of those is weak and you cannot get them 
to improve, then you have to change. All of those factors 
have to be in harmony.... I would say that is the ground 
zero. That is where they need to be in order to be 
considered. 


Product quality. Study participants invariably empha- 
sized performance and reliability in a supplier’s product 
offering. In addition, respondents mentioned the importance 
of meeting the customer’s technical specifications, as the 
following comment from John illustrates: 


For a circuit board, we look for a specific output. We want 
to make sure there are no cracks in the hybrids. None of 
the parts must be skewed on the circuit board. The part 
has to be dimensionally and electrically correct per a set 
of specifications. We gave the supplier a drawing that out- 
lines the physical dimensions, the distances between the 
pads, where the capacitors are supposed to be, [and] 
where the traces are, and then there are the electrical spec- 
ifications the supplier has to meet—that is, the output this 
circuit board has to produce. 


Respondents also mentioned the importance of deliver- 
ing consistent quality levels over time. Notably, a vendor's 
key supplier status may shelter the firm from short-term 
sanctions. For example, respondents mentioned that their 
firms worked with key suppliers to solve a vendor's quality 
problems through joint teams. Consider the following state- 
ment from Jeff: 


We work with a supplier that doesn't hit the [quality] tar- 
get.... We go in and find out what problems they have. Our 
Supplier Quality Group actually helps them to develop 
test procedures, to find the latest technology, [and] to help 
them fix issues on the line. Quality is a major component, 
but we do help them out in identifying where they can 
improve. 


Several participants in our study underscored the pres- 
sure for continuous improvements. For example, Scott illus- 
trated the need for cutting-edge product quality: 


One of the key elements of a supplier relationship is hav- 
ing the best-in-class quality for the components that they 
supply.... We expect suppliers to work with us and try to 
continuously drive quality. As you see more European and 
Asian products come into our country, the bar gets raised 
all the time, and you have to benchmark yourself against 
different competitors than those you had in the past. 


In speaking about quality's contribution to value crea- 
tion in a supplier relationship, participants indicated that 
superior product quality levels may indeed serve to some 
extent as a differentiator among suppliers. Consider the fol- 
lowing comment from Frank: 


They [market research firm] do a customer satisfaction 
survey every year on vehicles. The Japanese have domi- 


nated that, but we have made substantial improvements in 
the last four years One of the reasons is that we have been 
sourcing with higher-quality suppliers... When we are 
sourcing with [first-tier supplier], we can go through all of 
their manufacturing locations and see nothing but great 
marks for them. That is one way a supplier can charge a 
little more. It's because they have a proven track record to 
supply excellent parts. 


However, it also became apparent that it is increasingly 
difficult for suppliers to compete on quality only. As Frank 
said, 


Most of our suppliers are all delivering a quality product. 
Our quality group will work with suppliers to make sure 
that they are meeting our quality objectives. So then we 
have a bid list which the buyer can take. 


On the basis of these findings, we identify product qual- 
ity as the extent to which the supplier’s product meets cus- 
tomer specifications. Key quality aspects are performance, 
reliability, and consistency over time. Our study confirmed 
the potential for differentiation through superior quality. Yet 
its role appeared to be limited. 


Delivery performance. Purchasing managers identified 
three value-creating aspects in this area. First, suppliers cre- 
ate value by consistently meeting delivery schedules (on- 
time delivery). Frank talked about consequences of late 
deliveries in the car industry: 

The most important is that they are on time with the due 

date that we have on the purchase order. We try to stream- 

line how much inventory we have as much as possible. If 

the suppliers are not meeting their schedules in a timely 

fashion, that causes a big hiccup and may result in pre- 

mium freight to get parts here. 


According to the respondents, a second way to create 
value is to adjust to changes in delivery schedules due to 
spikes in demand or changes in the product mix (delivery 
flexibility). Managers valued suppliers’ responsiveness for 
emergency deliveries, as this quote from Jack illustrates: 


You will notice the suppliers that are your best friends. 
When you are down because someone didn’t count the 
parts right, you call them up.... A lot of suppliers will turn 
their shop around to keep us running. Those are the things 
that really stick in your mind. Supphers that go above and 
beyond what a typical supplier will do. 


As a third source of value creation in this area, partici- 
pants mentioned suppliers’ capacity to deliver the right parts 
consistently (accuracy of delivery). Minimizing missing or 
wrong parts in shipments saves time and effort for the 
customer. 

In summary, we identified delivery as a second value 
driver. Suppliers add value by consistently meeting delivery 
schedules (on-time delivery), by adjusting to delivery 
changes (flexibility), and through their capacity to deliver 
the right parts consistently (accuracy). 


Direct product costs. There was remarkable commonal- 
ity across respondents when they discussed the role of 
direct product costs. Participants identified the actual price 
charged by a supplier as the core relationship cost driver. 
Managers focused on “average,” "fair, or “reasonable” 
market price. As Jerry explained, 
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We don’t want any supplier that loses money, but we want 
them to be tough too. We want them to give us the best 
price as they can at the margin they can live with. They 
need to make money as well as we do. 


On further probing into the topic, participants explained 
that they were hesitant to overemphasize price considera- 
tions. For example, Scott said, 


There is always somebody cheaper, but it is one thing to 
say you can make a part at a lower cost, and it is another 
to say you can do both: Support our business needs and 
supply us at a lower cost. We are very hesitant to be over- 
aggressive. There are customers who wave these prices in 
front of suppliers that come from people who can’t make 
or produce. That is a dangerous game to get into with your 
supply base. 


In addition, participants’ firms face a strong pressure to 
reduce costs in their respective markets. Suppliers are 
expected to commit to annual price reductions within long- 
term agreements and to pass cost savings on to customers. 
In turn, respondents agree to higher order volumes to com- 
pensate for lower prices. As John commented, 


We tell our suppliers that we want 4% of cost reductions 
every year. That is pretty typical in the automotive indus- 
try. So, speaking of suppliers, I want a guy who is willing 
to continuously improve his operations. One who finds 
ways to lower his costs, so that some of that can be passed 
on to me. So, I can say to my management, “Hey, instead 
of paying $1.50 for this item, now I will start paying 
$1.40.” There 1s value that can be added that way. It 
allows us to offer our own customers a cost reduction. 


Similar to our findings in the area of product quality, a 
key supplier status may protect a vendor from competition, 
at least in the short run. Customers indicated that they pro- 
vided main suppliers with an opportunity to match competi- 
tion. In addition, they assisted key suppliers in driving down 
prices through joint cost reduction programs, an initiative 
not granted to backup suppliers. 

In summary, we learned from our interviews that man- 
agers focused primarily on price as the key relationship cost 
driver. Participants valued both the supplier’s capacity to 
offer a fair market price and its commitment to reduce 
prices continuously. In turn, key suppliers benefited from 
increased order volumes and an opportunity to match 
competition. 


Value Creation in the Sourcing Process 


Beyond the core offering, vendors create value in the sourc- 
ing process. Relationship benefit drivers identified at this 
level are the supplier’s service support and the personal 
interaction between both parties. The mirroring value- 
creating cost dimension is acquisition costs. 


Service support. The supplier’s capacity to provide 
value-added services was another common theme. The fol- 
lowing comment from John illustrates in general terms the 
role of services in a buyer-seller relationship: 


The other thing would be just general service. If we have 
a quality problem, are they available to answer that ques- 
tion? I talked about it specifically on the technical side, 
but it is actually bigger than that.... I will call it a general 
service category; are they there for us when we have prob- 
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lems that may or may not be related to the technical 
design? 


When probing further into the importance of services in 
value creation, three facets emerged from our depth inter- 
views in this area. First, respondents highly valued a sup- 
plier’s responsiveness, that is, the vendor’s willingness to 
address customers’ concerns in ongoing relationships. 
Shawn mentioned responsiveness as a key service element: 


I’d say one of the key drivers of differentiation has to do 
with responsiveness. For example, our products are very 
complex. We are talking about hundreds of pages of docu- 
mentation, and you are not going to get it mght the first 
time. We don’t expect you to, but what we do expect is a 
very concerted effort to get there. One of the main things 
that hurt us is the fact that a supplier is unresponsive to the 
problems we are having. So, at the end of the day, respon- 
siveness is a huge part of the equation. 


A second facet of service support was the supplier’s 
capacity to manage exchanges of information (information 
management). This subdimension had two ramifications: 
First, respondents mentioned that customers frequently 
changed their requirements. Whenever such changes 
occurred, suppliers were expected to follow through in a 
timely manner. Thus, speed of information represented an 
important aspect of information management. Second, our 
participants voiced their need to receive detailed informa- 
tion. The following quotation from our interview with Jeff 
echoes both speed and appropriateness of information: 


Details are important. If we get a request for a quote from 
[car manufacturer] and our supplier just gives us a number 
on a paper, it doesn’t help us out.... We get requests for 
changes from our customers all the time, and sometimes 
we need to get back to them within a few days. So speed 
is certainly a factor. 


The third component of service support is the outsourc- 
ing of activities to suppliers. Respondents cited examples of 
situations in which suppliers added value by taking respon- 
sibility for particular customer activities. In our interviews, 
we learned that key suppliers solidified their competitive 
position by performing tasks on behalf of their customers. 
Shawn talked about how a supplier performed routine test- 
ing for his firm: 

There are several things that are service related. One of 

them 1s testing and qualification. Take, for example, aero- 

space grade bolts—the types of things that hold wheels 
together on aircrafts. This supplier will come out and do 
testing for you. Because of the amount of tests we have, 
there is not enough capacity for testing [at our plant]. So 
they would do the testing for us, report the results, and 
work with us to make improvements on the parts, and that 
is a value-added service. 


Customers benefited from outsourcing in numerous 
ways. For example, by delivering integrated systems as 
opposed to single parts, suppliers contributed to consolidat- 
ing the supply base. Consider the following comments from 
Frank about outsourcing: 


Consolidation in the supply base is one way in which the 
suppliers are creating value. For the [car model], we 
looked at suppliers that could provide an entire interior, so 
this meant that they would be supplying the instrument 


panel, the floor consol, the overhead system, the door pan- 
els, the side wall trim, garnish, the rear shelf and even the 
carpeting.... Suppliers take on some of the subassembly 
operations. Their facilities are close to our assembly 
plants, which leads to large reductions in plant space. So 
we can either build more vehicles or reduce the time that 
it takes to build a vehicle. 


In summary, we identified service support as an impor- 
tant relationship value dimension. In addition to the sup- 
plier’s level of responsiveness, respondents mentioned that 
key suppliers create value through information management 
and outsourcing of activities. 


Personal interaction. Managers noted numerous exam- 
ples of value creation through personal interaction. They 
held the development of relationships at an individual level 
in high regard. Consider the following statement from 
Denice: 


I think the personal issues are important, at least for our 
company, because there is such a close interaction 
between their folks and ours. If you have people that for 
one reason or another do not get along, it strains the rela- 
tionship and is counterproductive. So I believe that the 
value of the interpersonal skills is quite high. 


Developing interpersonal ties improved problem solv- 
ing and communication and led to a better understanding of 
each partner's goals. These benefits were regarded as con- 
tributing to the growth of a relationship as a whole. As Jeff 
commented, 


When I first came to [company], the person who I took the 
position over [from] was not well liked by the supplier. 
They voiced their opinion as such. I changed the way we 
work with them. When dealing with these people, I Iook at 
them as more than just a tool. I look at them as a person. 
To me, it is very important. If you don't bave that, I don't 
think you can function well in the industry. I think it [the 
relationship] grew more after I came on board because of 
the relationship that I developed with them. I think they 
opened up and understood what they need[ed] to do to 
move the relationship ahead. 


Managers considered suppliers’ key contact personnel 
important facilitators for conducting business. For example, 
when comparing two suppliers of electrical motors, Mary 
mentioned that one company was definitely much easier to 
work with than the other: 


One company is much easier to do business with than the 
other, just because they give us their names and e-mail 
addresses. The quality people know the quality managers; 
they can call them up and talk to them. All of those lines 
are very open. The other company, well, you have to do all 
of those things through the salesperson. It is much more 
difficult to get the answers because now, my quality per- 
son is calling me asking a detailed question, and I have to 
call the salesperson and give him a detailed question, and 
he goes and talks to his people, who ask a question that 
neither one of us knows an answer to. 


Finally, respondents indicated that personal interactions 
should be developed at all levels of the organizations. In 
particular, strong involvement of the vendor's top manage- 
ment was viewed as an indicator of a good working rela- 
tionship. As Richard illustrated, 


One of the things that are of value is that you have the 
presidents of these companies show their face here. We 
like the comfort of having the president here and knowing 
what is going on.... I can call the president of a company 
and get him to shake the stick over there. It's the name, 
the face: “You can count on me if you need to use the sil- 
ver bullet to get something done.” I think that is very 
important. 


Although participants viewed personal interaction as an 
important asset, they also cautioned against its potential pit- 
falls. For example, Richard referred to his company's inter- 
nal rules of conduct with respect to handling gratuities: 


As a buyer, you really have to be careful of how you are 
perceived within the organization. If you are walking out 
of the front door every day to the supplier's car to get 
lunch, that is an issue. We don't mind the occasional, but 
for the most part, we have a very low tolerance level for it. 
The individuals that work for us are well-paid individuals. 
We don't need to take advantage of the other stuff. 


In summary, personal interaction emerged as an impor- 
tant value driver. Knowing the supplier's key contact per- 
sonnel, getting along well with the vendor's representatives, 
and involving a supplier's top management all contribute to 
high levels of personal interaction. 


Acquisition costs. Participants highly valued a supplier's 
willingness to take costs out of the sourcing process. Inven- 
tory management emerged as a first opportunity for cost 
reductions in this area. For example, Scott mentioned how 
having suppliers manage customers' inventories improved 
cash flows: 


Inventory management is of tremendous value to us 
because we are able to take costs off our books and rein- 
vest in other parts of our business to become more effi- 
cient. Suppliers that really work well with us move to a 
"pay-and-consumption" model. We work with them to 
reduce inventories by trying to get visibility throughout 
the whole supply chain. 


Order handling represented a second way to reduce 
acquisition costs. Participants' firms had standardized order 
processes and had downsized staff. For example, Jack men- 
tioned that he used to have 14 buyers in his purchasing 
department. Over a period of 20 years, he was down to 6 
buyers. Thus, participants relied more heavily on suppliers 
for order handling. As John commented, 


We've set up a Kanban system. So I never, ever issue a 
purchase order for the delivery of those circuit boards. 
The operator on the line knows that when he gets down to 
one box, he sends a notification to the supplier, "Hey, 
Mister Supplier, I am down to one box, send me two more 
boxes " So he does that, and I never see the order for that. 


Inspection costs represented a third vector of value crea- 
tion through reductions of acquisition costs. Customers 
reduced incoming inspections as suppliers consistently met 
expectations. John talked about how these measures trans- 
lated into cost savings: 


We track quality each month through our no-incoming- 
verification program. If a supplier's product meets our 
quality criteria so many months, we don't inspect the 
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product anymore here. As soon as it comes 1n the door, it 
goes to the assembly line. 


In summary, suppliers find multiple ways of adding 
value by taking costs out of the sourcing process. Drivers of 
cost reductions identified in this area were inventory costs, 
order-handling costs, and incoming-product inspection 
costs. 


Value Creation in Customer Operations 


Customer operations represent another domain of value 
creation. The benefit dimensions identified in this area are 
the supplier's specific know-how and its capacity to 
improve time to market. The corresponding cost dimension 
is operations costs. 


Supplier know-how. A vendor's know-how emerged as 
another common theme in our interviews. Participants pro- 
vided many examples of situations in which access to a sup- 
pler's technical expertise was viewed as highly valuable. 
Customer organizations benefited from their suppliers' 
know-how in several ways. First, participants mentioned 
that vendors' extant knowledge of supply markets provided 
them with an opportunity to present customers with new 
sourcing alternatives. The following comment from Denice 
echoes this idea: 


[This supplier] comes to our facility at least once a week 
and works very extensively with our design engineers. So 
if they have an application for a semiconductor, they give 
him the performance specifications and let him come back 
with one or two solutions. This reduces the investment we . 
have to make in terms of knowledge and experience of 
individual components. The semiconductor market 1s 
changing every single day, and to keep abreast of all new 
products in the market would be very expensive for us. So 
we rely very heavily on this supplier to bring us those 
products and give us awareness. It also means that we can 
have younger people on the engineering staff. They don't 
have the same background. 


Second, a supplier's thorough understanding of a cus- 
tomer's operations and a long-standing experience with a 
client's products both created opportunities for vendors to 
add value in the improvement of existing products. As 
Shawn explained, 


Suppliers that have a lot of experience with your products 
know how to make parts efficiently and effectively. A sup- 
plier that has done a specific part for years can turn 
around and make a change at a third of the cost of a new 
supplier. Their lead time is usually less also because they 
can do the setup in the dark, and they make suggestions.... 
The expertise in a relationship 1s amazing in terms of the 
value it creates. Their experience and knowledge really 
protect them from global competition. 


Finally, suppliers assist customers in the development of 
new products. Key suppliers are frequently involved early 
on in suggesting innovative solutions and taking costs out of 
a new product up front. In speaking about early involve- 
ment of key suppliers in the new product development 
process, Richard commented, 


The knowledge base that suppliers build up is one of the 
key characteristics that we look for in a supplier. For 
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example, we don't manufacture our own surgical instru- 
ments. So we don't have the knowledge base of instru- 
ment makers walking around our shop floor. If we design 
a new product, we rely on the expertise of the instrument 
makers. They need to come back to us with their experi- 
ence and say, "Your design can't be manufactured. It 
won't work, or it will cost you a lot of money.” We involve 
them early on in the process. They need to tell us where 
we need to put costs into the instrument and where it is 
not necessary. That goes back to the collaboration in our 
preproduction planning meeting. The supplier is there, our 
development engineer is there, and they can talk back and 
forth 


In summary, a supplier's know-how provides many 
opportunities to add value in a manufacturer-supplier rela- 
tionship. Key sources of value creation are the vendor's 
deep knowledge of the supply market, prior experience with 
customer operations and products, and an early involvement 
in new product development. We also learned that key sup- 
pliers benefit from their preferred status because the inter- 
actions between key supplier status and know-how are 
mutually reinforcing. Indeed, holding a specific expertise 
helps a supplier solidify its position. In turn, a stronger 
position enables a vendor to accumulate more experience 
with a client's products and gain better insights into a cus- 
tomer's operations than any other supplier. 


Time to market. In our interviews, time to market 
emerged as an important value driver in supplier relation- 
ships. For example, in talking about the orthopedics indus- 
try, Richard highly valued a supplier's contribution to 
reduced cycle times: 


In our industry, you've got to be quick. If you get your 
product out to the market first, you win. The number of 
small orthopedic implants manufacturers is growing, and 
the really small ones are taking little pieces of market 
share away from us. If you want to make the big dollars in 
the orthopedic world, you have to get your product out 
there first, and right now, one of our main goals is to look 
at how you reduce the time it takes from the idea genera- 
tion of a product to the time you get it up to market. In the 
past, it took us three years. The goal is to take 2596 out of 
that. 


Participants explained that their firms faced an increas- 
ing pressure to develop products faster. For example, Frank 
mentioned that his firm had reduced cycle times from 
approximately 40 months to fewer than 18 months over a 
period of several years. Respondents turned to suppliers in 
different areas to reduce cycle times. Speed of executing 
design work emerged as a first area, as Frank illustrated: 


Our global suppliers can get much faster turnaround in 
completing or changing designs. They take in a require- 
ment and send it to an offshore facility where they have 
much more of the day to work with it. If we decide at 3:00 
P.M. to do something, they can send it to a new facility, 
and by the time you come back the next morning, you 
already get a result. 


Participants also referred to the pressure of developing 
prototypes faster. By developing a prototype exactly to the 
customer's specifications the first time, suppliers improved 
cycle time considerably. Richard illustrated this idea: 


You have to have instrument suppliers that are quick at 
what they do. For example, last week, we delivered two 
models to an instrument supplier, and they turned around 
prototypes within four days. That is an extreme, but typi- 
cally, what we are looking for are lead times of 4 to 6 
weeks. Two years ago, it would have been 12 to 16 weeks. 


Finally, managers explained that suppliers performed 
testing and validation tasks faster than customers. In addi- 
tion, by avoiding unnecessary product retesting, they were 
able to accelerate cycle times significantly. Consider 
Frank’s statement: 


Suppliers add value through testing and validation. We 
have all kinds of validation requirements for our parts, 
whether it is bumper impact tests or sled tests for the inte- 
rior airbags. Suppliers take over more and more of our 
validation, and they are able to do ıt a lot faster. That helps 
us to improve our cycle times. Some of our validation 
equipment is used 24 hours, 7 days a week. By going to 
supplier facilities, we speed up the validation process and 
get cost savings faster. 


_ In summary, a supplier's ability to reduce time to mar- 
ket represented an important source of value creation in 
buyer-supplier relationships. Suppliers add value through 
accelerating design work, developing prototypes faster, and 
speeding up the testing and validation process. 


Operations costs. Participants discussed several direc- 
tions of value creation through cost reductions at the level 
of customer operations. One way to reduce operations costs 
was to take costs out of existing products. Frank provided 
the following example: 


We started a couple of years ago a formal supplier sugges- 
tion program. We are setting objectives to save a certain 
amount of money per year through submitting suggestions 
for how we can save money on a part. For example, a sup- 
plier will say, “We have to pierce five holes in this 
stamped part. We noticed the one hole isn't used anymore. 
Now, there is one less piercing operating we have to do. 
You can save that.” Once engineering has approved it, we 
implement the idea. Then, they get a percent of the cost 
savings, and we get a percent.... It goes a long way into 
making your company stand out as to being proactive in 
helping us save costs. Our “Suppliers of the Year" are 
meeting those objectives. So it just makes sense that you 
would want to give those suppliers more business. 


In addition to taking costs out of products, participants 
explained that valued suppliers typically contributed ideas 
of how to improve costs in the manufacturing process. For 
example, Scott discussed differences between two suppliers 
regarding costs associated with the transformation process 
from steel coil to stamped parts: 


We are buying steel from a certain supplier, and another 
supplier came in and offered us a lower price for that same 
type of steel. You have the cost to get it to the factory, 
which is the price of the steel and the freight cost, but you 
also have the 1mpact on the yield of the steel as 1t is con- 
verted in your factory, and that 1s all the way from the coil 
to a painted or stamped part. So we will evaluate that, and 
we will try to get an understanding of whether there is any 
difference in the cost to do that conversion. Really, we are 
talking total cost of ownership here. 


Finally, respondents mentioned tooling costs and war- 
ranty costs as another area of value creation through cost 
reductions in this field. As Frank mentioned, 


We’ il ask suppliers what they charge us for warrantee and 
tooling. If you have a supplier that has shown very good 
quality and also has fairly low warrantee and tooling costs 
with their components, that is considered an advantage 
over another supplier that typically does not do well on 
these [costs]. That is an operations type of costs. 


In summary, suppliers find multiple ways of adding 
value by taking costs out of customers' operations. Major 
opportunities for cost reductions in this area are product 
costs, manufacturing process costs, and tooling and war- 
ranty costs. 


Relationship Value Index Construction 


Our study provides valuable insights into how the different 
dimensions should be collapsed into an overall measure of 
relationship value. Whether value should be modeled as a 
reflective or formative measure is of particular importance. 
Diamantopoulos and Winklhofer (2001) argue that reflec- 
tive specifications of latent variables often mistakenly pre- 
vail in the marketing literature. In reflective specifications, 
higher-order constructs are assumed to cause their dimen- 
sions rather than be caused by them. Consequently, dimen- 
sions are viewed as strongly correlated and interchangeable 
facets of the focal construct (Bollen and Lennox 1991). In 
contrast, formative specifications view a higher-order con- 
struct as being caused by its dimensions. From a formative 
perspective, the higher-order construct is defined by its 
dimensions, which do not need to be highly correlated with 
one another. According to Diamantopoulos and Winklhofer, 
the choice between a formative and a reflective specifi- 
cation should be based primarily on theoretical 
considerations. 

In the past, researchers conceptualized relationship 
value as a reflective construct without justifying their 
approach (Lapierre 2000). However, the findings of our 
grounded theory study suggest a formative measurement 
approach. Indeed, the decision rules that Jarvis, Mackenzie, 
and Podsakoff (2003) developed for determining whether a 
construct is formative or reflective suggest the use of a 
composite latent construct model in our context. For exam- 
ple, in our study, causality flows from the measures to the 
construct. In addition, the value drivers do not need to be 
highly correlated with one another either within each of the 
value sources specified in our framework or between them 
(see Table 2). For example, regarding a supplier’s core 
offering, a specific supplier may score high on product 
quality but low on delivery performance. Similarly, a sup- 
plier may excel in the sourcing process, yet the same firm 
may not differentiate itself in the area of customer opera- 
tions. Indeed, study participants repeatedly described situa- 
tions in which value drivers did not correlate with one 
another. For example, Shawn talked about the relationship 
between suppliers’ quality and price levels: 

We can get really great pricing from places in Mexico and 

China, but when the product comes in and it has poor 

quality and they deliver late, that isn’t going to keep my 
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line running. Management really gets upset when you run 
out [of] a $.50 piece that you are trying to control inven- 
tory on and we can’t ship a $200,000 order. 


Similarly, Denice provided examples of situations in 
which suppliers performed high on the quality and delivery 
dimensions but scored low on personal interaction: 


A couple of years ago, we focused on the reduction of our 
supply base. We had a number of suppliers that were basi- 
cally on time and had no quality issues, but because of the 
fact that there were interpersonal problems, we made the 
decision to cut them off altogether as a supplier. That was 
primarily because of their interpersonal skills. Their cus- 
tomer service person was not helpful or responsive [and] 
was very slow or abrasive. That 1s a very important part of 
the relationship. 


Finally, Jack discussed a situation in which his firm 
maintained the relationship with a supplier mainly because 
of the vendor’s high-quality products. In turn, delivery, pric- 
ing, and personal interaction were considered sources of 
conflict: 


This supplier has a good quality product, but when you 
look at the purchasing side, delivery, and their pricing,... 
[shakes his head]. When the salesman comes in here, I 
don’t want to spend time with him, because he just gives 
me a "song and dance,” and I usually end up getting into 
an argument with him. So when he comes in, he doesn't 
see me; he just goes up to engineering and gets his part on 
the blueprint, so that we have to buy it... I try to build a 
better relationship. It hasn't been good for me, because I 
have been burnt too many times with those guys, and they 
don't seem to be changing their ways. 


Therefore, depth interviews revealed that value drivers 
may correlate, but they do not need to correlate. Indeed, 
respondents explained that they frequently made trade-offs 
between value dimensions. Consequently, our qualitative 
study's findings suggest that from a methodological stand- 
point, a formative measurement approach should be used 
rather than reflective measures when relationship value is 
modeled as a multidimensional construct. More precisely, 
using Jarvis, Mackenzie, and Podsakoff's (2003) classifica- 
tion of second-order factor models, a Type IV model (i.e., a 
formative first-order, formative second-order model) should 
be specified when value creation is modeled in buyer- 
supplier relationships. 

In summary, our qualitative study addressed the concep- 
tual issues raised in this research. We identified and 
described value dimensions in buyer—-seller relationships. In 
addition, we resolved the issue of how to integrate the 
dimensions into an overall index. Building on our findings, 
we define customer-perceived value in a key supplier rela- 
tionship as a formative higher-order construct that repre- 
sents the trade-off between the benefits and the costs per- 
ceived in the supplier's core offering, in the sourcing 
process, and at the level of a customer's operations, taking 
into consideration the available alternative supplier 
relationships. 


Quantitative Study 


We designed our quantitative study to address the method- 
ological and managerial issues raised in this research. To 
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this end, we conducted a nationwide survey among pur- 
chasing managers. This section describes both the method- 
ology and the major findings of this second step. 


Measure Development 


We took several steps to ensure the content validity of our 
measures (Churchill 1979; DeVellis 2003). First, on the 
basis of our qualitative study, we developed definitions of 
each relationship value driver, and we generated a pool of 
175 items that tap the various facets of each driver. We dis- 
cussed both definitions and items 1n a workshop with pur- 
chasing managers. We asked the managers to assist us in the 
validation of each definition, and we invited them to iden- 
tify the items that best captured the respective dimensions. 
As a result of this first step, we eliminated 77 items, leaving 
a reduced item pool of 98 items for nine value drivers. 

In a second step, we submitted the remaining items for 
an item-sorting task to 15 marketing academics identified as 
experts in the areas of industrial marketing, relationship 
marketing, and research methodology. We asked these par- 
ticipants to assign the individual items to what they believed 
to be the "correct" value drivers. As Anderson and Gerbing 
(1991, p. 734) propose, we computed two indexes for each 
item to identify items that were difficult to assign to the cor- 
responding facets. 


Survey Instrument 


Questionnaire development. The survey is composed of 
three parts. First, mirroring the approach we adopted in the 
qualitative study, we asked respondents to select a specific 
key component and to describe the final product for which 
it was sourced. We asked respondents to name their main 
supplier for the specific product and their second supplier in 
terms of purchasing volumes. The purpose of this initial 
stage was to ask respondents to consider a specific supplier 
and to prepare for a comparison of alternative buyer- 
supplier relationships. We asked participants to compare the 
main supplier with the second supplier for several reasons. 
First, depth interviews showed that managers typically 
compare these two alternatives when making value judg- 
ments. Second, respondents needed to use similar compari- 
son standards to allow for meaningful comparisons. Finally, 
from a managerial perspective, we were interested in under- 
standing how vendors achieve a main supplier position and 
defend it against their toughest competitor, that is, their 
challenger in terms of purchasing volume. 

The second part contained a list of items that tap the 
relationship value dimensions. We first asked respondents 
to answer several statements with respect to the relationship 
between their firm and the main supplier. Subsequently, we 
asked respondents to compare their main supplier with the 
second supplier. All items used seven-point rating scales 
(1 = "strongly disagree" and 7 = “strongly agree"). 

In the third part of the questionnaire, we invited partici- 
pants to respond to a set of questions that described them- 
selves, their company, and the supplier relationship. 
Because we relied on the perceptions of key informants, we 
needed to ensure that respondents were competent to report 
on the supplier relationship. Therefore, respondents needed 


to fulfill minimum requirements with respect to their posi- 
tion, their tenure with the company, and the length of the 
relationship. In addition, we asked respondents for a self- 
assessment of their ability to portray the supplier relation- 
ship accurately. Specifically, we asked them to indicate how 
confident they were in answering the questionnaire, how 
involved with and knowledgeable they were about the sup- 
plier, and to what extent they could influence purchasing 
decisions. 

Questionnaire pretest. We submitted the mailing cover 
letter, our directions for completing the questionnaire, and 
scale items to a pretest with four senior purchasing man- 
agers. Few items were slightly modified. The final list of 
value measures appears in the Appendix. 


Data Collection 


Sampling procedure. We collected data in a nationwide 
survey among purchasing managers of U.S. manufacturing 
firms. We used a three-wave mailing approach (Dillman 
1978). We randomly selected 1950 members of ISM from 
the association’s database. For an exploration sample, we 
contacted 527 managers, and for a validation sample, we 
contacted 1423 managers. We selected only senior-level 
managers, indicated by job titles such as Vice President Pro- 
curement, Director of Global Sourcing, Director of Supply 
Chain Management, or Purchasing Manager. In addition, 
we selected only manufacturing companies (Standard 
Industrial Classification codes 28—30 and 32-38). 

A total of 118 (response rate of 22.4%) questionnaires 
were returned for the exploration sample, and 303 (response 
rate of 21.3%) were returned for the validation sample. We 
dropped 18 respondents who reported an average level of 
confidence in their responses of four or lower on a seven- 
point scale. We discarded 3 additional responses because of 
excessive missing data, which left a final sample of 112 for 
the exploration sample and 288 for the validation sample. 


Sample characteristics. Respondents purchased a broad 
variety of components for multiple applications. Firms 
ranged from small enterprises to multibillion dollar compa- 
nies. On average, customers had been buying from their 
main supplier for 13 years, with a standard deviation of 9 
years. In our sample, on average, main suppliers captured 
73.3% of customers’ order volumes in a given product cate- 
gory. In contrast, second suppliers secured 19.596 of cus- 
tomers' needs. Finally, all other suppliers accounted for 
only 7.296 of orders. 


Respondent characteristics. Respondents held senior 
positions in their firms. They averaged 17 years of experi- 
ence in their area and 10.4 years with their companies.! 
Responses regarding confidence about answering the sur- 
vey and knowledge of the supplier relationship displayed 
mean ratings of 6.0 (confidence in answering the survey), 
6.01 (involvement in the supplier relationship), 6.15 
(knowledge about the supplier), and 5.89 (influence of pur- 
chase decisions) on a seven-point scale. 


INone of the background factors included in this research (i.e., 
company size, relationship length, and respondents' personal 
background and tenure with the company) moderated the results. 


Nonresponse bias. We assessed nonresponse bias fol- 
lowing Mentzer, Flint, and Hult's (2001) recommended 
guidelines. We contacted a random sample of 30 nonre- 
spondents over the telephone and asked them to answer our 
four questions that captured overall value perceptions in a 
supplier relationship (RELVAL 1—-RELVAL 4 in the Appen- 
dix). In addition, we asked nonrespondents to provide back- 
ground information on themselves and their company. The 
t-tests of group means revealed no significant difference 
between nonrespondents and our sample. Thus, nonre- 
sponse bias was not considered a problem in the present 
study. 


Measurement Model 


We report the results of our partial least squares (PLS) 
analysis in Figure 1 (for a detailed discussion of the PLS 
algorithm, see Lohmóller 1989). We chose PLS because of 
the formative nature of the higher-order value construct. 
This estimation procedure accommodates both reflective 
and formative measures (Fornell and Bookstein 1982). We 
measured relationship value on the basis of (1) the forma- 
tive value dimensions identified in the exploratory study 
and (2) four reflective items that captured relationship value 
as a unidimensional concept on a high level of abstraction. 

The formative and the reflective measurement 
approaches share 7396 of their variance, indicating predic- 
tive validity of the formative value dimensions. All param- 
eter estimates are significant at the 596 level. We cannot 
report an overall goodness-of-fit, because the objective of 
PLS is prediction versus fit (Fornell and Cha 1994). Partial 
least squares matches our research objective, that is, to 
assess the potential for differentiation of the formative value 
dimensions. Together with content validity established in the 
qualitative study by expert agreement, these results provide 
empirical evidence for construct validity (Rossiter 2002). 

We measured relationship costs on three formative 
dimensions: direct product costs, acquisition costs, and 
operations costs. In line with Cannon and Homburg (2001), 
we summated the respective cost items to obtain a score for 
each of these cost drivers. 

To operationalize relationship benefits, we developed a 
measurement model first. On the basis of our conceptual 
framework, we expected three dimensions to mirror Cannon 
and Homburg's (2001) conceptualization of relationship 
costs. To probe this conceptualization, we subjected the 
exploration sample to exploratory factor analysis. Three 
factors emerged on the basis of the Kaiser criterion (see 
Table 3). Product quality and delivery items formed the core 
benefits (24.496 variance explained). Personal and service 
benefit items loaded on a common factor that represented 
the benefits related to the sourcing process (28.4% variance 
explained). Know-how and time-to-market items loaded on 
a common factor that formed the operations benefits of a 
purchasing relationship (23.296 variance explained). 

On the basis of the validation sample, we assessed con- 
vergent and discriminant validity (Table 4). Factor loadings, 
t-values, average variance extracted (AVE), and Cronbach's 
alpha are indicative of a high level of convergent validity. In 
addition, we assessed discriminant validity with Fornell and 
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FIGURE 1 
Formative Higher-Order Measurement Model 





Relationship 
value 


Relationship 
benefits 


11 .68 .27 


Core Sourcing Operations 
benefits benefits benefits 


Notes: All parameter estimates are significant at the 5% level. 


Larcker’s (1981) criterion. Table 4 shows that the smallest 
AVE exceeds the squared correlation between each pair of 
the relationship value dimensions. This indicates a satisfac- 
tory level of discriminant validity. After we successfully 
established convergent and discriminant validity, we con- 
structed summated scales for the benefit dimensions. 
Together with the relationship cost scores, they served as 
formative indicators for the first-order relationship benefit 
and cost dimensions, respectively. 


Discussion and Implications 


Against the background of supply base consolidation, our 
research contributes to a better understanding of how ven- 
dors of routinely purchased products earn the coveted main 
supplier position and defend their status against competi- 
tors. Overall, our study’s findings suggest that a key sup- 
plier status indeed confers many substantial advantages to 
vendors. For example, key suppliers typically capture a sig- 
nificantly larger share of a customer’s business than do 
other suppliers. In our study, on average, main suppliers 
secured 73.3% of customers’ order volumes. In turn, second 
suppliers captured only 19.5% of customers’ needs. Our 
study further revealed that a vendor’s key supplier status 
may protect the firm from competition, at least in the short 
run. For example, customers will work with key suppliers to 
solve quality problems through joint teams. Similarly, main 
suppliers are granted an opportunity to adjust prices over 
time to match competition. Finally, key suppliers find them- 
selves in a better position to compete for new business 
because they are typically involved early on in new product 
development. 


130 / Journal of Marketing, January 2006 





R2 = .73 
—.17 
Relationship 
costs 
22 44 61 


Acquisition Operation 
costs i costs 


While the advantages of working with fewer suppliers 
are well documented in the purchasing and supply chain 
management literature, scholars also recognize its down- 
sides (Kekre, Murthi, and Srinivasan 1995; Monczka, Trent, 
and Handfield 2005; Newman 1989; Trevelen and 
Schweikhart 1988). Above all, buyers face increased depen- 
dence and, consequently, higher risks of supply disruption 
and supplier opportunism (Trevelen and Schweikhart 1988). 
Although suppliers gain greater volumes, improved consis- 
tency, and lower transaction costs, the cost of doing busi- 
ness with the customer may also increase (Newman 1989). 
Key suppliers are often required to provide internal cost 
data. Furthermore, they are permanently pressured to invest 
in the relationship. Finally, the main supplier position 
places a greater burden on the vendor to coordinate its own 
activities and subsuppliers (Monczka, Trent, and Handfield 
2005, p. 278). Against this background, vendors must be 
aware that “gaining a greater share of a customer’s business 
does a supplier very little good when that incremental busi- 
ness comes at the cost of reduced profitability” (Anderson 
and Narus 2003, p. 48). 

Our research raised and answered several conceptual, 
measurement, and managerial issues. First, from a concep- 
tual perspective, a thorough investigation of how value is 
created in a supplier relationship was necessary. Building 
on Cannon and Homburg’s (2001) classification of relation- 
ship costs, we suggested an integrated framework that dis- 
tinguishes between two fundamental value-creating dimen- 
sions and three levels at which these drivers operate. 
Following a grounded theory approach, we asked managers 
to identify and describe only those value drivers that allow a 
supplier to set itself apart from competition and gain key 


TABLE 3 


Exploratory Factor Analysis 
Factor 1 Factor 2 Factor 3 
(Sourcing (Core (Customer 
Indicator Process) X Offering) Operations) 
Product1 .84 
Product2 .83 
Product3 .80 
Product4 .86 
Product5 85 
Product6 85 
Service1 .66 
Service2 72 
Service3 10 
Service4 ‘TI 
Delivery1 .72 
Delivery2 .73 
Delivery3 75 
Know-how .64 
Know-how2 83 
Know-how3 70 
Know-how4 .80 
Know-how5 .87 
Time to market1 .74 
Time to market3 ./5 
Time to market4 .80 
Personal1 .81 
Personal2 .84 
Personal3 .83 
Personal4 .80 
Personal5 .83 
Personal6 13 
Personal7 5 
Variance 
extracted (%) 28.4 24.4 23.2 


supplier status. The results of our qualitative study suggest 
nine key differentiators in buyer-seller relationships: prod- 
uct quality, delivery performance, and direct product costs 
at the level of a supplier's core offering; service support, 
personal interaction, and acquisition costs 1n the sourcing 
process; and supplier know-how, time to market, and opera- 
tions costs at the level of the customer's operations. To the 
best of our knowledge, this is the first conceptualization of 
relationship value grounded in managerial practice. 

Second, from a measurement perspective, we developed 
items for each of the differentiators and consolidated them 
into an overall index of relationship value. The results of 
our cross-sectional study indicate that value creation in 
buyer-seller relationships can be not only conceptually 
described but also empirically measured. This is an impor- 
tant finding because scholars anticipated substantial diffi- 
culties in the empirical assessment of relationship value 
(Miller and Törrönen 2003; Parasuraman 1997; Wilson and 
Jantrania 1995). Because we cannot manage what we can- 
not measure, our research represents an essential step 
toward adding managerial relevance to this "problematic 
concept which cannot be ignored" (Wilson and Jantrania 
1995, p. 63). 

We measured relationship value in two complementary 
ways: (1) as a multidimensional construct and (2) as a uni- 

/ 


dimensional construct. To account for its conceptual 
breadth, empirical research should rely on formative multi- 
dimensional scales of relationship value. Reflective models 
of relationship value misconstrue the causal priority 
between the construct and its dimensions. However, they 
are appropriate if relationship value is considered a uni- 
dimensional construct on a high level of abstraction. Both 
measurement approaches share 73% of their variance. Con- 
sequently, researchers may choose between a multidimen- 
sional scale with multiple items and its unidimensional 
counterpart with only few items, depending on their 
research objectives. Relying on the multidimensional scale 
is key for research investigating the value-creating dimen- 
sions of a business relationship. For efficiency reasons, 
however, scholars might prefer the unidimensional scale 
when integrating value into a broader nomological network. 

Third, from a managerial perspective, our research pro- 
vides guidelines regarding each value driver's potential for 
differentiating among suppliers in a business relationship. 
To some extent, each value driver contributes to the expla- 
nation of observed variance in overall relationship value, 
and none of the differentiators should be neglected. How- 
ever, Table 5 shows that the potential for differentiation in a 
buyer-seller relationship varies considerably among the 
various value drivers. 

Our results suggest that relationship benefits display a 
stronger potential for differentiation than do cost considera- 
tions. Whereas relationship costs account for little more 
than 2096 of the variance, relationship benefits explain 
nearly four times as much. This is a noteworthy finding. 
Conventional wisdom posits that customers emphasize cost 
considerations in business markets. Traditionally, the pur- 
chasing literature devotes particular attention to cost fac- 
tors. For example, Anderson, Thomson, and Wynstra (2000, 
p. 310) argue that "[p]urchasing managers are more knowl- 
edgeable about using price and price changes as a basis for 
selecting product offerings than value and value changes." 
In addition, they recommend (p. 325) that “whenever possi- 
ble, suppliers should make resource allocation trade-offs in 
favor of internal cost reductions, which allow the supplier to 
offer lower prices (or to postpone price increases), rather 
than performance enhancements." 

Our findings provide a different picture, illustrating that 
vendor selection criteria are highly context dependent. 
Exploring value creation in ongoing relationships and con- 
ducting value assessments of physical products follow two 
different, yet complementary, logics. In the current study, 
we focused on key supplier relationships as units of analy- 
sis. From this perspective, relationship benefits and costs 
take on two very different roles. Whereas cost factors serve 
as key criteria to get a supplier on the short list of those 
vendors considered for business, relationship benefits dom- 
inate when deciding which supplier to name first among a 
set of available suppliers. Following this line of reasoning, 
cost competitiveness emerges as a necessary but not suffi- 
cient condition to gain key supplier status. In turn, offering 
superior benefits to the customer is essential for winning a 
substantial share of a customer's business. 

In our examination of individual value drivers, we iden- 
tified service support and personal interaction as core dif- 
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TABLE 4 
Convergent and Discriminant Validity 





A: Convergent Validity 


Factor Indicator Loading t-Value AVE a 


Core offering Product .83 32.75 75.096 .96 
Product2 .91 70.72 
Product3 .86 44.60 
Product4 .86 33.03 
Product5 .90 63.07 
Product6 .91 63.79 
Delivery1 .79 24.77 
Delivery2 .86 49.61 
Delivery3 87 51.78 
Sourcing process Service1 83 32.26 73.896 .96 
Service2 a3 30.38 
Service3 84 35.93 
Service4 85 38.61 
Personalt 88 46.76 
Personal2 90 69.03 
Personal3 88 48.51 
Personal4 .90 53.32 
Personal5 .88 52.64 
Personal6 84 38.32 
Personal7 82 26.55 
Customer operations Know-how1 75 23.53 72.6% .97 
Know-howe2 86 37.08 
Know-how3 84 47.61 
Know-how4 87 56.21 
Know-how5 88 41.05 
Time to market1 .86 44.91 
Time to market3 .87 51.38 
Time to market4 .87 53.71 


B: Discriminant Validity 


Core Offering Sourcing Process Customer Operations 
Core offering 75 
Sourcing process 55° .74 
Customer operations .38 .63 73 


Notes: Bold numbers on the diagonal show the AVE. Numbers below the diagonal represent the squared correlations. 


TABLE 5 
Value Drivers’ Potentlal for Differentiation in Key Supplier Relationships 





Relationship Value Dimensions 


Sources of Value Creation Benefits Costs 
Core offering Core benefits: Direct costs: 
11 x .77 = .08 .22 x .17 = .03 
" Sourcing process Sourcing benefits: Acquisition costs: 
.68 x .77 2.52 .44 x .17 = .07 
Customer operations Operations benefits: Operations costs: 
.27 x .77 - .21 .61 x .17 2.10 


Notes: Bold numbers show the variance explalned by the formative value dimensions. We derive them by multiplying the path coefflclents of the 
second-order dimensions by the coefficlents of the respective first-order dimensions in Figure 1. Overall, benefits account for 8% + 52% 


+ 21% = 81% of explained variance. In turn, costs account for 3% + 7% + 10% = 20% of explained vanance. Benefits and costs do not 
add up to 10096 because of rounding errors 
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ferentiators. In the current study, these sourcing benefits 
offer the strongest potential for differentiation, explaining 
more than half of the observed variance (52%). They are 
followed by operation benefits. A supplier’s know-how and 
its ability to improve a customer’s time to market explain 
21% of the observed variance. Operation costs (10% of 
observed variance), core benefits (i.e., product quality and 
delivery performance; 8% of observed variance), and acqui- 
sition costs (7% of observed variance) display a moderate 
potential for helping a firm gain and maintain key supplier 
status. Finally, direct product costs (ie., price) show the 
weakest potential for differentiation in a key supplier rela- 
tionship (3% of observed variance). These findings suggest 
that the core product and its price become less important 
differentiators in customer-supplier relationships. Offering 
value through personal interaction and service, access to 
know-how, and increased time to market has become 
important in securing a key supplier position. 

However, note that managerial practice may well differ 
from our findings at the level of individual firms. For exam- 
ple, Noordewier, John, and Nevin (1990) suggest that repet- 
itively used items and capital goods greatly differ in terms 
of acquisition and possession costs.? Indeed, repetitively 
used items involve significant order-handling and inventory 
management costs. In such a context, suppliers create value 
for customers by taking cost out of the sourcing process. In 
turn, capital goods require a thorough understanding of how 
the equipment helps customers reduce operation costs 
throughout the product's life cycle. In such an environment 
(i.e., in which the interpurchase intervals are much longer), 
managers base their decisions on payback periods and dis- 
counted cash flow analyses (Nordewier, John, and Nevin 
1990). Nevertheless, our results represent a point of depar- 
ture for managerial decision making. Because of the cross- 
sectional nature of the quantitative study, our findings pro- 
vide a best-practice profile for companies striving for key 
supplier status. Our sample consists of winners only; that is, 
our analysis is based on data gathered from suppliers that 
successfully made their way to the top of the supply base. 
Consequently, our results deliver an informative benchmark 
for vendors of routinely purchased products that want to 
gain or maintain key supplier status. 

On a final note, our study also provides interesting 
insights for secondary suppliers. By adopting a "Trojan 
borse" approach, a secondary supplier may break down an 
existing market offering into individua] components and 
focus on a single product or service (Rust, Lemon, and 
Narayandas 2005, p. 313). Having established a foothold in 
a customer account, the supplier then can progressively 
expands its share. À secondary supplier may also develop a 
unique expertise and provide selected customers with a bet- 
ter solution at a lower cost. The supplier then can offer this 
capability for a reduced fee in exchange for an incremental 
share of a customer's business (Anderson and Narus 2003). 
Finally, a backup supplier can take advantage of the grow- 
ing trend toward cross-sourcing among buying organiza- 
tions (Newman 1989). Cross-sourcing not only helps cus- 
tomers mitigate supply risk but also opens a window of 


2We thank an anonymous reviewer for this insight. 


opportunity for backup suppliers to provide customers with 
the same product without needing to endure the burdens 
placed on the shoulders of the main supplier. 


Limitations and Future Research 
Directions 

As in any research project, the choices we made in this 
study 1mply limitations in the interpretation of our results. 
By focusing on understanding how a firm can differentiate 
itself from competition, we limited our assessment to a spe- 
cific unit of analysis. Rather than showing how value is cre- 
ated in buyer-seller relationships overall, our findings refer 
to a particular context and research question. 

First, we asked respondents to compare their main sup- 
plier with a second supplier for routinely purchased prod- 
ucts. Such a focus excludes the examination of other sup- 
plier relations (e.g., value creation by "me-too" suppliers, 
sole supplier relationships). Similarly, our research did not 
investigate the sourcing of other product categories, such as 
capital goods. 

Second, data for our research are based on the cus- 
tomer's perspective of value creation in key supplier rela- 
tionships. À vendor's point of view might be different. 
Additional research could examine potential gaps between 
both parties' value perceptions. 

Third, our research relied only on single respondents. 
Other members of the buying organization might emphasize 
different value drivers in a key supplier relationship. 
Because we cannot exclude potential biases, further 
research could address this limitation usefully. 

Fourth, the choices we made in developing our sample 
frame warrant some caution in the generalization of our 
findings. We sampled respondents among members of the 
ISM. This approach might favor the inclusion of better- 
educated purchasing professionals and larger buying 
organizations. | 

Fifth, this research focused on dyadic business 
exchange relationships. However, buyer-seller relationships 
are embedded in complex networks of relationships (Achrol 
1997; Anderson, Håkansson, and Johanson 1994; Ford 
1990). Thus, further research could examine value creation 
in the broader network surrounding a customer and its key 
supplier. 

Sixth, although we paid careful attention in the assess- 
ment of content and predictive validity, we did not raise the 
issue of nomological validity. Integrating value into the 
nomological network of relationship marketing remains a 
promising opportunity for further research (Hogan and 
Armstrong 2002, p. 19). 

Finally, scholars have called for more longitudinal 
research on buyer-seller relationships (Narayandas and 
Rangan 2004). More must be learned about how value is 
created over time in close supplier-customer relationships. 
Further research using a longitudinal design could address 
this issue. 

These limitations should be kept in mind when consid- 
ering our results and their implications. Nevertheless, our 
findings provide new insights for academics and practition- 
ers alike. 
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APPENDIX 
Operational Measures 


M SD 
Product Support 
Product1 Compared to the second supplier, the main supplier provides us with better product 
quality. 4.55 1.59 
Product2 Compared to the second suppller, the main supplier meets our quality standards 
better. 4.62 1.60 
Product3 Compared to the second suppller, the main suppliers products are more rellable. 4.47 1.59 
Product4 Compared to the second supplier, we reject less products from the main supplier. 4.60 1.69 
Product5 Compared to the second supplier, the main supplier provides us with more consistent 
product quality over time. 4.69 1.60 
Product6 Compared to the second supplier, we have less variations In product quality with the 
main supplier. 4.53 1.68 
Service Support 
Service1 Compared to the second supplier, the main supplier provides us with better services. 4.92 1.52 
Service2 Compared to the second supplier, the main supplier is more available when we need 
information. 4.63 1.54 
Service3 Compared to the second suppller, the maln supplier provides us with more 
appropriate Information. 4.83 1:65 
Service4 Compared to the second suppller, the main supplier responds faster when we need 
Information. 4.74 1.50 
Dellvery 
Delivery Compared to the second supplier, the main supplier performs better in meeting 
delivery due dates. 4.83 1.65 
Dellvery2 Compared to the second supplier, we have less delivery errors with the main 
supplier. 4.74 1.68 
Delivery3 Compared to the second supplier, deliveries from the main supplier are more 
accurate (no missing or wrong parts). 4.77 1.69 
Suppller Know-How 
Know-how1 Compared to the second supplier, the main supplier provides us a better access to 
his know-how. 4.68 1.59 
Know-how2 Compared to the second supplier, the main supplier knows better how to improve our 
existing products. 4.73 1.55 
Know-how3 Compared to the second supplier, the main SPON performs better at presenting us 
with new products. 4.81 1.52 
Know-how4 Compared to the second supplier, the main supplier knows better how to help us 
drive innovation in our products. 4.77 1.69 
Know-how5 Compared to the second supplier, the main supplier knows better how to assist us In 
new product development. 4.37 1.48 
Time to Market 
Time to Compared to the second supplier, the main supplier performs better In helping us 4.48 1.50 
market Improve our time to market 
Time to Compared to the second supplier, the main supplier helps us more in improving our 4.55 1.57 
market2 cycle time. 
Time to Compared to the second supplier, the main supplier helps us more In getting our 4.40 1.58 
market3 products to market faster. 
Time to Compared to the second supplier, the main supplier performs better in helping us 4.61 1.55 
market4 speed up product development. 
Personal Interaction 
Personal1 Compared to the second supplier, it is easier to work with the main supplier. 4.89 1.61 
Personal2 . Compared to the second supplier, we have a better working relationship with the main 
suppller. 5.07 1.52 
Personal3 Compared to the second supplier, there is a better interaction between the main ) : 
suppllers people and ours. 5.07 1.54 
Personal4 Compared to the second supplier, we interact better with the main supplier. 4 91 1.57 
Personal5 Compared to the second supplier, we can address problems more easily with the 
main supplier. 4.83 1.56 
Personal6 Compared to the second supplier, we can discuss problems more freely with the maln 
supplier. 4.69 1.59 
Personal? Compared to the second supplier, the main supplier gives us a greater feeling of 
belng treated as an important customer. 4.72 1.63 
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APPENDIX 





Continued 
ea a are eS 
M SD 
Relationshlp Value 
RELVAL1 Compared to the second supplier, the main supplier adds more value to the 
relationship overall. 4.99 1.51 
RELVAL2 Compared to the second supplier, we galn more in our relationship with the main 
supplier. 4.93 1.45 
RELVALS Compared to the second supplier, the relationship with the maln supplier is more 
valuable. 5.01 1.51 
RELVAL4 Compared to the second supplier, the maln supplier creates more value for us when 
comparing all costs and benefits in the relationship. 5.01 1.49 
Relatlonship Costs 
How do each of the following costs of the main supplier compare with the costs of your second-best suppller? 
Main Main Main Main 
Supplier's Suppller's Main Supplier's Supplier's 
Costs Are Costs Are Supplier's Costs Are Costs Are 
Much Somewhat Costs Are Somewhat Much 
Lower Lower the Same Higher Higher M SD 





















Purchasing 
price 
Ordering 
costs 
Delivery 
costs 
Inventory 
carrying 
costs 
Coordination 
and com- 
municatlon 
costs 
Manufacturing 
costs 
Downtime 
costs 


2.51 1.02 
2.61 72 


2.63 .89 


2.60 1.81 


2.62 .84 
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Markus Christen, Ganesh lyer, & David Soberman 


Job Satisfaction, Job Performance, 
and Effort: A Reexamination Using 
Agency Theory 


The objective of this article is to clarify ambiguities in the literature regarding the relationships among three key 
constructs of work relationships: effort, job performance, and job satisfaction. The relationship between job perfor- 
mance and job satisfaction is of central interest to research in organizational psychology. However, empirical 
research in that area finds that the link between these constructs is weak at best. A negative effect of effort on job 
satisfaction is consistent with agency theory, but there is limited empirical evidence to support this assumption. 
Moreover, some studies have found a positive effect of effort on job satisfaction. Using a model that incorporates 
the main constructs from agency theory and organizational psychology, the current study finds a negative, direct 
effect of effort and a positive, direct effect of job performance on job satisfaction. The authors show that conflicting 
findings in the literature are the result of inconsistency in both the measurement and the definition of constructs 
across studies that do not fully account for all the relationships between constructs. The current findings empha- 
size the need to distinguish clearly between factors that represent employees’ inputs in a work relationship (i.e., 
effort) and those that represent their outputs (i.e., job performance). The article also demonstrates the importance 
of properly accounting or controlling for all key variables to ellminate biases that can arise in empirical research on 





work relationships. 


tional psychology that considers the role of job satis- 

faction in managing effective work relationships. 
Similarly, job satisfaction 1s a widely studied construct in 
marketing research on sales force (Brown and Peterson 
1993), retail store managers (Lusch and Serpkenci 1990), 
and service workers (Boyt, Lusch, and Naylor 2001). This 
research examines the antecedents of job satisfaction and, 
in particular, the effects of job performance, effort, and the 
compensation structure. However, findings in this literature 
about the relationships between job satisfaction and these 
antecedents have been inconsistent and even controversial. 
For example, despite the finding that people derive intrinsic 
value from work, the relationship between job performance 
and job satisfaction has been found to be inconsistent and 
' weak (Brown and Peterson 1993; Iaffaldano and Muchinsky 
1985). Similarly, studies that examine the effect of effort on 
job satisfaction find that it has a positive effect (Brown and 
Peterson 1994). This second finding appears to contradict 
the logic of the equally large literature on agency relation- 
ships in economics and marketing, which is based on the 


T is an extensive body of research in organiza- 
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assumption that effort is costly to an agent and therefore 
reduces the agent’s utility (or job satisfaction). 

The objective of this article is to develop a model of 
work relationships to investigate the relationship between 
job satisfaction and its key determinants, job performance 
and effort. The premise is that a complete understanding of 
job satisfaction and work relationships must be predicated 
on a theory of how effort affects job satisfaction and the 
way that effort affects the relationship between job perfor- 
mance and job satisfaction. To this end, we draw on both 
agency theory and organizational psychology. We consider 
a role of effort that is consistent with a principal—agent 
model (Holmstrom 1979): When compensation and other 
factors are controlled for, effort is a cost for an agent. We 
then embed this cost in a job satisfaction model (Porter and 
Lawler 1968) to clarify the relationships between job satis- 
faction and its key antecedents. A clear understanding of 
these relationships is important to design employment con- 
tracts that optimize firm performance, while providing sat- 
isfactory incentives and compensation for an employee. 

When we consider the three constructs of job satisfac- 
tion, job performance, and effort jointly, we find a strong 
positive effect of job performance on job satisfaction, a 
result that has long been hypothesized in marketing and 
organizational psychology but has received weak empirical 
support (Iaffaldano and Muchinsky 1985). In addition, 
when we account for the moderating effect of job perfor- 
mance, we find that effort has a strong negative effect on 
job satisfaction, a result that supports the theoretical 
assumption of empirical studies of contracts, such as sales 
force compensation plans (e.g., Lal, Outland, and Staelin 
1994). Both omitted variables and construct definition prob- 
lems explain why some previous research has found a posi- 
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tive effect of effort on job satisfaction. The result of our 
model and analysis is consistent with the agency theoretic 
view of effort. At the same time, we recognize the manner 
by which other determinants of job satisfaction mediate the 
impact of effort on job satisfaction. 

Another important factor that must be incorporated into 
a model of work relationships is compensation. Agency 
theory makes a distinction between fixed compensation 
(e.g., salary) and variable compensation that depends on 
output (e.g., bonuses based on individual or firm perfor- 
mance). The agency theoretic prediction is that fixed com- 
pensation should have a significant effect on job satisfaction 
but not on effort, regardless of an employee's risk prefer- 
ence. Our results support this prediction. 

In addition to an examination of fixed salary, our data 
set enables us to examine a common compensation variable 
that has received little attention in the literature: corporate- 
wide profit-sharing plans tied to overall firm performance. 
This type of compensation is interesting because in a large 
organization, it is neither fixed nor affected solely by the 
individual employee's performance. We find that these 
profit-sharing plans have a significant effect on both job sat- 
isfaction and effort. 

This article is relevant to researchers in marketing and 
management who are interested in the relationship between 
job satisfaction and job performance. The results counter 
previous findings that there is no relationship between the 
two factors. Job performance has a direct and positive effect 
on job satisfaction when we properly account for effort. 
Effort has a positive effect on job performance and, thus, a 
positive, indirect effect on job satisfaction. 

Next, the article demonstrates the problems of omitted 
variables and unclear construct definitions in researching 
work relationships within organizations. Findings about the 
relationship among effort, job performance, and job satis- 
faction can be unclear or inconsistent with received theory 
when key constructs are omitted or lumped together. This 
suggests the need for empirical research to distinguish 
clearly between factors that are inputs (i.e., effort) and those 
that are outputs (1.e., job performance). 

We next provide a summary of the relevant literature, 
beginning with a discussion of the relationship between job 
performance and job satisfaction in marketing and organiza- 
tional psychology. We follow this summary with a discus- 
sion of the effect of effort as it is purported in agency 
theory. On the basis of the literature review, we then set 
forth several hypotheses. We déscribe our empirical model 
and discuss the data and estimation approach. After we pre- 
sent the estimation results, we conclude with a discussion of 
the implications of our findings and highlight several 
important issues for further research. 


Literature Review 


The Relationship Between Job Performance and 
Job Satisfaction 

The idea that satisfied employees are more productive held 
through the 1970s. However, it was difficult to obtain sup- 
port for the view that job satisfaction has a significant effect 
on job performance. As a result, the reverse (that an 
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employee's job performance affects his or her job satisfac- 
tion) became the focus of research in the area (Lawler and 
Porter 1967). Although the idea that an employee's job per- 
formance affects his or her job satisfaction is consistent 
with several psychological theories, such as intrinsic moti- 
vation theory (Deci and Ryan 1985), few studies have found 
support for it (Iaffaldano and Muchinsky 1985). Similarly, 
organizational studies of the sales force in marketing invari- 
ably find that the relationship between job performance and 
job satisfaction is weak (Bagozzi 1980; Brown and Peterson 
1993). As Brown and Peterson (1993) note, if the effect of 
job performance on job satisfaction is insignificant, firm 
actions designed to increase job performance should not 
have a direct effect on job satisfaction and related out- 
comes, such as employee turnover. 

Conversely, there seems to be consensus that many 
antecedents of job satisfaction and job performance are 
common (e.g., effort, compensation, quality of supervision, 
clarity of job responsibilities). Consequently, it is essential 
to account for both direct and indirect effects of common 
antecedents to reach the conclusion that the effect of job 
performance on job satisfaction 1s insignificant. 

The literature suggests that the antecedents of job satis- 
faction can be categorized into personal characteristics, role 
perceptions, and organizational variables (Brown and Peter- 
son 1993). A typical job satisfaction model appears in 
Figure 1. 

It is intuitive that effort and compensation are critical 
determinants for the viability of any employment situation. 
An employee will not be willing to exert effort unless he or 
she is paid, and the employer will not be willing to pay 
unless the employee works. A contract needs to be individ- 
ually rational (i.e., both parties must expect to be better off 
by engaging in the work relationship). However, few, if any, 
studies include the employee's effort as an antecedent of job 
satisfaction (Brown and Peterson 1994). One exception is 
that of Clark and Oswald (1996), who use job satisfaction 
as a proxy for utility. They find a negative (albeit weak) 


FIGURE 1 
A Typical Job Satisfaction Model 
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Notes’ Signs Indicate the direction of effects according to the Iitera- 
ture (Brown and Peterson 1993). Parentheses indicate a 
hypothesized relationship that has not been confirmed by 
empirical results. 


effect of effort on job satisfaction. However, because job 
performance is omitted, its role as a potential mediator of 
the effect of effort on job satisfaction is not considered. 

Many studies treat effort as part of job performance, 
which is defined broadly as an aggregate construct of effort, 
skill, and outcomes that are important to the employee and 
outcomes that are important to the firm (e.g., Behrman and 
Perreault 1984; Lusch and Serpkenci 1990; Walker, 
Churchill, and Ford 1977). Similar to the work of Bagozzi 
(1978), a few studies use a narrow definition of job perfor- 
mance based on actual sales or other objective productivity 
measures. However, these studies do not include effort as a 
separate construct. 

We argue that it is important to define effort as distinct 
from job performance. From the perspective of an 
employee, job performance and effort are different. Effort is 
an input to work, and job performance is an output from 
this effort. From a firm’s perspective, effort and job perfor- 
mance may be difficult to distinguish, and effort is often 
inferred from the output produced (the possibility of high 
effort and low output or low effort and high output is often 
not considered). This may explain the inclusion of.effort in 
the definition of job performance. Some studies include 
work motivation as an antecedent to job satisfaction, but 
motivation (“I want to work hard") is not the same as 
exerted effort (“I did work hard and spent a lot of time and 
energy”). 

The implications of either neglecting effort or consider- 
ing it a part of job performance for the empirically observed 
relationship between job performance and job satisfaction 
can be significant. If effort is costly for an employee, ignor- 
ing effort can bias the estimated effect of job performance 
on job satisfaction (because effort should increase job per- 
formance). Failing to control for effort induces a negative 
spurious correlation, which may reduce or even hide a true 
positive effect of job performance on job satisfaction. Simi- 
larly, by including effort in the measure of job performance, 
negative and positive aspects can nullify each other, yield- 
ing an effect for job performance that is again biased toward 
insignificance. 

In summary, the existing job satisfaction research in 
marketing and organizational psychology is likely ham- 
pered by an omitted-variables bias and imprecise definitions 
of job performance. The objective of this article is to under- 
stand this relationship within a framework that overcomes 
these problems. 


The Impact of Effort on Job Satisfaction 


If effort is costly for an employee, it should have a negative, 
direct effect on job satisfaction. This implies that there is a 
conflict of interest between the employer, who wants the 
employee to work hard, and the employee, who wants the 
salary with the minimum possible effort. This conflict is the 
basis for the literature in economics on the relationship 
between principals and agents (Hart and Holmstrom 1987; 
Holmstrom 1979). Nevertheless, aside from the previously 
mentioned study by Clark and Oswald (1996), there is little 
direct empirical evidence that effort is a cost that makes 
employees less happy. Moreover, studies of job satisfaction 
that include effort as an antecedent tend to find that it has a 


positive effect on job satisfaction (Brown and Peterson 
1994). 

An objective of the article is to resolve the inconsis- 
tency between theoretical arguments and empirical find- 
ings. Our interest is to confirm empirically that the relation- 
ship between job satisfaction and effort is negative, because 
it is the basis for much of the research on contracts and job 
design. To accomplish this objective, we rely on prior 
research in economics that has established job satisfaction 
as a good proxy for utility (Clark and Oswald 1996; Fried- 
man 1978) to untangle the direct and indirect effects of 
effort on job satisfaction (e.g., through compensation or job 
performance). The total effect of effort should indeed be 
positive, otherwise an employee would be better off not 
working. 

Our conjecture is that the primary impediment to find- 
ing a negative relationship in previous empirical studies is a 
problem of omitted variables. Given that employment situa- 
tions are typically subject to a problem of moral hazard, 
employers use a combination of outcome-based control 
(e.g., performance-based contracts) and behavior-based 
control (e.g., monitoring) to prevent employees from exert- 
ing minimal effort (Anderson and Oliver 1987), but these 
controls also affect employees’ job satisfaction.! As a result, 
it is essential to account for the effect of these controls 
when estimating the valence and strength of the relationship 
between effort and job satisfaction. For example, in situa- 
tions with incentive pay, which employers often use to com- 
pensate sales people, high effort (though costly) may lead to 
strong job performance and, thus, high compensation. This 
would increase job satisfaction and largely mitigate the neg- 
ative effect of costly effort on job satisfaction. A model that 
does not account for the indirect path (through compensa- 
tion) by which effort affects job satisfaction would find that 
the direct effect of effort on job satisfaction is either 
insignificant or even positive. A key requirement to identify 
the direct effect of effort on job satisfaction is to account for 
the indirect paths through which effort can affect job satis- 
faction (especially those created by the employer’s control 
system). Next, we summarize the preceding discussion with 
a set of hypotheses and then propose a model of job satis- 
faction that incorporates an agency relationship and the key 
antecedents of job satisfaction into a comprehensive 
framework. 


Hypotheses 

The first hypothesis addresses the relationship between job 
performance and job satisfaction. By treating job perfor- 
mance as distinct from effort and accounting for the direct 
effect of effort on job satisfaction, we expect the following 
relationship: 

H,: All else being equal, an employee’s job satisfaction 

increases with his or her job performance. 


! Moral hazard is a contracting problem between a principal and 
an agent when an agent expends effort that only he or she observes 
and the effect of effort on output 1s uncertain. Consequently, the 
agent will not choose an efficient level of effort. 
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In the context of our organizational setting, we show 
how null effects can be found by adopting approaches used 
in previous job satisfaction studies. 

H,: All else being equal, omitting effort as a separate 

antecedent of job satisfaction, either by excluding it from 


the model or by combining it with job performance, 
reduces the effect of job performance on job satisfaction. 


The next three hypotheses are based on the existence of 
a typical agency relationship between the firm and an 
employee. 
Ha: All else being equal, job performance increases with an 
employee’s effort and ability. 
H,: All else being equal, firm performance increases with an 
employee’s job performance. 


A basic assumption of agency theory is that effort is 
costly to an agent. By defining effort as an input and by 
considering indirect paths through which effort affects job 
satisfaction, we hypothesize that effort should be costly for 
an employee: 

H4: All else being equal, an employee's job satisfaction 

decreases with his or her effort. 


Our last hypothesis posits a direct implication of the 
agency model for fixed compensation (i.e., compensation 
that does not change with an employee's output). 


Hs: All else being equal, fixed compensation has a significant, 
positive effect on an employee's job satisfaction but no 
significant effect on the employee's effort. 


2The positive effect of job performance on job satisfaction 
appears at odds with standard agency theory. However, a positive 
effect may be consistent with a model of repeated interactions 
between a principal and an agent. 


Model 


The firm's objective is to maximize financial performance 
within the business unit or department affected by the 
employee's job performance. Similarly, the employee maxi- 
mizes job satisfaction. Business unit performance is 
assumed to be, in part, a result of the employee's job per- 
formance, which is determined by the employee's effort and 
ability. Our data consist of the performance and satisfaction 
of store managers in a retail chain, so the relevant business 
unit for each employee is a store. The firm (in our case, the 
retail chain) controls compensation and promotion opportu- 
nities and can influence other aspects of the job (e.g., 
responsibilities). The idea is that these variables potentially 
influence both the employee's job satisfaction and the 
employee's willingness to work hard. We summarize the 
model in Figure 2. 

The model translates into a system of four equations 
that represent (1) the relationship among store performance, 
SP; the employee's job performance, JP,; and the 
employee's effort, ME, (Equations 1—3) and (2) the relation- 
ship between job satisfaction, JS,, and relevant aspects of 
the employee's work situation, including effort and job per- 
formance (Equation 4): 


(1) SP, = à + Pyp JP, + £x 
(2) JP, = Q + Bug; ME, + B A2MA, + £2» 
(3) ME, = 05 + Bc3C, T Bps3PS, + YX T OY, + Ezp 


and 
(4) JS, = Q4 + Pme4ME, + Pyp4JP, + Bc4C, + Bps4PS; + Yj jJ 
YO Yu + £4, 


where C; is employee i's perception of compensation and 
PS, is a dummy variable indicating participation in the 


FIGURE 2 
Proposed Model of Job Satisfaction and Store Performance 
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profit-sharing plan. Following the literature, we divide the 
aspects of the job into job characteristics, X,,, and role per- 
ceptions, Y,;. Job characteristics include the employee's 
perception of the job’s attractiveness, the job’s autonomy, 
the quality of supervision, and the social climate at the work 
place. Role perceptions refer to the employee’s perception 
of his or her responsibilities in terms of clarity, degree of 
overload, and degree of conflict. The nine B parameters are 
estimated to assess our hypotheses. We subsequently define 
and describe the four endogenous constructs and the key 
exogenous factors (we discuss measurement issues in the 
“Data and Estimation” section). 

The store performance equation (Equation 1) reflects a 
vertical relationship in which the employee (in this.case, the 
store manager) performs different tasks to generate outputs 
for the firm (the retail chain), and SP, is a measure of a 
store’s financial performance. The summary construct, JP,, 
is a measure of the employee’s job performance. We define 
this as a qualitative assessment of a store’s performance 
relative to targets and objectives set for that store (as sum- 
marized by a supervisor). We treat job performance as an 
intermediate outcome variable (between effort and store 
performance), but our framework also recognizes that other 
factors affect store performance. 

The job performance equation (Equation 2) reflects the 
idea that the employee's job performance, JP,, is a function 
of his or her effort, ME,. Consistent with the work of Camp- 
bell and Pritchard (1976), we define effort as the amount of 
energy and time an employee puts into the job. Employees 
are not endowed with the same ability to perform the job, 
MA, (ability refers to an employee’s skill and knowledge 
related to the specific duties of the job). This should also 
have a significant effect on observed job performance. 

The effort equation (Equation 3) reflects the assumption 
that effort, ME,, is influenced by various job factors, some 
of which the firm controls. The set of job factors included in 
Equation 3 consist of compensation, C;; access to the profit 
sharing plan, PS;; four job characteristics, X,; and three role 
perception factors, Y (we elaborate on these characteristics 
and factors subsequently). We used a dummy variable to 
indicate an employee’s participation in the profit-sharing 
plan (29% of the store managers in the sample were 
included in the plan). 

Finally, the job satisfaction equation (Equation 4) is 
based on the assumption that an employee’s job satisfaction 
is a function of his or her job performance, his or her effort, 
and the job characteristics and role perceptions that affect 
his or her effort (see the discussion of Equation 3). 

Our model includes four job factors, X, (J 2 1, ..., 4), 
that are standard in the literature: job autonomy, job attrac- 
tiveness, quality of supervisory feedback, and social cli- 


3When the performance of a firm depends on various other 
exogenous factors, it is important to control for bad (good) out- 
comes, even when a manager's job performance is high (low). 
Therefore, eliminating job performance from the model and using 
only store performance could give a misleading result for the 
effect of effort. If the most talented and hardest working store 
managers were assigned to the worst-performing stores to improve 
them, it might be possible to find a negative effect of effort on 
store performance. 


mate. Together with the compensation variables, they repre- 
sent the job's core characteristics (Fried and Ferris 1986). 
We define “job autonomy" as the degree to which the firm 
provides independence and discretion to the employee in 
fulfilling his or her role. We define “job attractiveness” as 
the degree to which a job is exciting, challenging, and pro- 
vides a sense of accomplishment. We define “supervisory 
feedback" as the extent to which the employee receives 
information about his or her performance. Supervisory 
feedback is important because it often has psychological 
value for an employee. It is also a key element of the firm's 
control system. Finally, we define the variable “social cli- 
mate" as the degree to which there 1s a good working rela- 
tionship among the employees within the store. We use the 
employee's perception of these factors because perceptions 
are the primary drivers of the effort taken and satisfaction 
received (see, e.g., Judge, Bono, and Locke 2000).4 

We include three role perception variables, Y,; (k = 1, 
..., 3): role ambiguity, role conflict, and role overload. 
These are known to have significant, negative effects on 
both job performance and job satisfaction (Brown and 
Peterson 1993). We do not have a direct interest in the effect 
of these variables, but we include them in the model 
because of their acknowledged importance in explaining 
job performance and job satisfaction. We define "role ambi- 
guity" as a combination of uncertainty about the relation- 
ship between action and output and a lack of clear direc- 
tions and behavioral requirements from the supervisor 
(Rizzo, House, and Lirtzman 1970). We define "role con- 
flict" in terms of the congruency of various job responsibil- 
ities (1.e., the degree to which goals, objectives, and respon- 
sibilities of a position conflict with one another; Rizzo, 
House, and Lirtzman 1970). Role conflict can arise from 
inconsistent demands from different role partners (e.g., 
supervisors, coworkers, customers) on the employee; incon- 
gruencies with personal values; conflicts among different 
roles; and conflicts among time, resources, and demands 
(role overload). Because the workload of the employees in 
our data set is high and a distinctive job attribute, we 
include "role overload" as a separate factor. 

To complete the model specification, we include inter- 
cepts, Œy, and error terms, £4, for each of the £ = 1, ..., 4 
equations to account for unobserved factors and random 
measurement error. We also add a set of variables that cap- 
ture store and store manager characteristics for control and 
identification purposes. We provide details in the discussion 
of our estimation approach. Next, we describe the data set 
and discuss measurement and estimation issues. 


Data and Estimation 


Data Set 


We use data that Lusch and Serpkenci (1990) collected for 
their study of the effect of personal difference variables on 


^Effort and job satisfaction may also be affected by promotion 
opportunities. A factor analysis of the data (see the “Data and Esti- 
mation" section) indicates that “promotion” cannot be identified 
independently of compensation. Thus, we exclude promotion as an 
independent factor from the analysis. 
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the job satisfaction of retail store managers. This data set is 
based on a survey of a U.S. grocery retailer with more than 
200 supermarkets. It contains typical measures collected for 
job satisfaction studies and extensive operating and 
accounting information for each store. The stores are uni- 
form in terms of merchandise and layout, and most stores 
have 7000 to 10,000 square feet of selling space. Stores are 
about equally distributed over metropolitan and rural areas. 
Each store is staffed by a store manager and one or more 
assistant managers. A district supervisor monitors the oper- 
ations of 10 to 15 stores and reports directly to senior 
management. 

The data consist of three individual data sets collected 
from three different sources. The retailer provided operating 
and financial data for each store for the most recent 
three years. District managers (supervisors) completed a 
survey to provide an assessment of store managers’ perfor- 
mance, effort, and ability. Store managers completed a sur- 
vey on the quality of their work life, which provides the 
data for the store managers’ assessment of job factors, role 
perceptions, and job satisfaction. This survey also captured 
several of the store managers’ personal and attitudinal 
characteristics. 

For the statistical analysis, there were 188 usable obser- 
vations from 226 stores (18 surveys were not returned, and 
20 were eliminated because of missing data). In addition, 
we eliminated 11 observations from store managers who 
were at the current job for less than one year, which left us 
with 177 observations for estimation. 


Measurement of Constructs 


Job satisfaction. Locke (1976, p. 1300) defines job sat- 
isfaction as “a pleasurable or positive emotional state,” 
which is “a function of the perceived relationship between 
what one wants from a job and what one perceives it is 
offering.” In essence, job satisfaction is an overall state that 
is derived from experiencing a work situation. Because our 
approach embeds a principal—agent structure within a job 
satisfaction model, we use job satisfaction as a proxy for the 
utility from working (derived by the employee). This 
assumption follows other “workplace studies" that have an 
economic orientation (Clark 1997; Clark and Oswald 1996; 
Friedman 1978).5 

There are two general approaches to measure overall 
job satisfaction. The first determines job satisfaction by the 
measurement of satisfaction with different job “facets” 
(e.g., Smith, Kendall, and Hulin's [1969] job descriptive 
index); the second measures overall or "global" job satis- 
faction directly (e.g., Hackman and Oldham's [1975] job 
diagnostic survey). Research shows that in many cases, the 
global approach is superior to the facet approach (e.g., 
Scarpello and Campbell 1983). As a result, we use a global 
three-item measure of job satisfaction. 


SThese studies define a worker's utility function as u = u(y, e, i, 
j), where y is 1ncome, e is effort extended, and 1 and j are sets of 
individual and job parameters, respectively. In contrast to this 
specification, however, we also include job performance as a 
determinant of utility. 
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Store manager effort, ability, and job performance. We 
obtained measures of effort, ability, and job performance 
(for each store manager) from two different scales com- 
pleted by the district supervisor. The Appendix provides a 
summary of the different scale items we used to measure 
the different constructs in our model. As we show in Table 
1, all measures have good reliability properties. Cronbach's 
alpha values range from .81 for ability to .89 for effort. 


Store performance. The economic outcome of interest 
to the retail chain is store profits. To reduce potential 
accounting problems, we use a store's operating profit, 
which we define as store sales less store operational 
expenses for the current year. To control for store size, we 
divide the operating profit measure by available store sell- 
ing space for the same year.” l 

Compensation, job factors, and role perceptions. The 
scales that Lusch and Serpkenci (1990) use to measure a 
store manager’s perception of compensation and job factors 
are derived from the job descriptive index (Smith, Kendall, 
and Hulin 1969). They contain sufficient items to measure a 
store manager's perceptions of the five factors reliably (i.e., 
compensation, job attractiveness, job autonomy, supervi- 
sory feedback, and social climate). Individual store man- 
agers provided answers. Each of the five factors is mea- 
sured by its respective scores on two to four items based on 
a five-point Likert scale, ranging from "strongly agree" to 
“strongly disagree,’ or on a six-point Likert scale, ranging 
from “definitely yes" to “definitely no.’ The scale items 
appear in the Appendix. Table 1 shows that Cronbach's 
alpha values range from .71 for job autonomy and social 
climate to .85 for compensation. As we previously noted, 
we indicated inclusion in the corporate profit-sharing plan 
with a dummy variable. Lusch and Serpkenci (1990) mea- 
sure the three different role factors with a multi-item scale 
(see the Appendix). Cronbach’s alpha for these three factors 
ranges between .69 for role conflict and .80 for role 
overload. 

To ensure that the factors measure unique aspects of a 
store manager’s job situation, we conducted a factor analy- 
sis. An exploratory factor analysis yielded eight factors with 
eigenvalues greater than one. We then conducted a confir- 
matory factor analysis to test the eight-factor solution, and 
we compared it with other possible solutions. Table 2 pro- 
vides a summary of various goodness-of-fit indicators for 
different factor solutions. Although the chi-square test sta- 
tistic indicates that the restrictions underlying the eight- 
factor solution are significant (which is expected given the 


6Effort, ability, and job performance are significantly correlated 
(see Table 1). Nevertheless, a factor analysis of the data strongly 
indicated the presence of three factors. For each construct, we 
selected scale items on the basis of the results of the factor analy- 
sis. Moreover, higher correlation hurts efficiency, but estimates are 
still consistent. 

7Note that unobserved store-specific factors and the retailer's 
strategy can affect both store performance and a store manager's 
effort. This can lead to biased parameter estimates if not properly 
controlled for By measuring job performance relative to store 
objectives, we can account for heterogeneity in profit potential 
across stores. We also include several store characteristics in the 
estimation to mitigate this problem further. 


TABLE 1 
Pairwise Correlation Coefficients and Measurement Properties of Variables 





Variables 





1 2 3 4 5 6 7 8 9 10 11 12 13 14 





1. Compensation 








2. Profit sharing& .02 

3. Attractivoness 35 | —04 

4. Autonomy .38 00 | .42 

5. Supervisor 37 14  .37  .38 

6. Social climate .48 05 .53 45 .36 

7. Role ambiguity —28 —.03 —39 —.46 -46 -.30 

8. Role conflict —.11 05 —13 06 -—11 -—12 .40 

9. Role overload —.28 04 —37 —24 -30 -.24 .40 .35 
10. Ability .13 09 —02  .13 10 —03 .-—07 -—04 -.16 
11. Manager effort .06 10 —02  .17 15  —06  —04 08  —10  .69 
12. Job performance 14 02 —.07 .08 11 —03 -—16 -05 -—17 .66 .66 
13. Job satisfaction 47 10 .58  .43 27 2 —34 -16 -—34 .10 08  .09 
14. Store performance> 06 -.07 -.04 -.13 .01 —08 -04 -.10 -19 35 #19 33 —.01 
M 1.88 29 3.74 413 299 3.54 1.87 2.11 2.75 445 439 4.30 4.53 56.8 
SD 77 .46  .75  .85 .73  .87 d 88 .79  .89 78 1.21 1.19 21.8 
Number of scale items 4 1 4 2 4 2 4 3 3 3 5 3 3 — 
Cronbach's a 85 — .75  .719. .73  Á.729  .79 69 .80 81 S89 88 .67 — 
alndicator variable 
bStore operating profit relative to selling space. 
cSimple correlation 

TABLE 2 
Goodness-of-Fit Statistics for Different Factor Solutions 
Steiger- Chi- Bentler Parsimo- 

Number of Lind Square Jóreskog Comparative nious Bollen's Bollen's 
Factorsa RMSEA (d.f.) GFI Fit Index Fit Index Rho Delta : 
3 .116 998.7 (321) .696 .695 .559 .575 .698 
4 (Overload) .101 918.7 (319) .736 .730 584 .607 .733 
5 (Social climate) .090 810.9 (315) .765 777 .614 .648 .780 
6 (Autonomy) .077 714.5 (310) .793 .818 .638 .685 .820 
7 (Supervisor) .062 628.3 (304) .825 .869 .654 .728 .866 
8 (Conflict) .049 481.6 (297) .875 .912 .685 .764 .904 
9 (Promotion) .048 474.1 (289) .883 .907 .690 .768 .899 


aThe three factors are compensation (compensation, promotion, social climate), work (attractiveness, role conflict, role overload), and organt- 
zation (autonomy, role amblgulty, supervisor). The subsequent rows indicate higher-factor solutions. The factor that is separated from the pre- 
vious solution with fewer factors is In parentheses. 

Notes: RMSEA = root mean square error of approximation, and GFI = and goodness-of-fit index 


sample size), its goodness-of-fit increases significantly 
compared with solutions with fewer factors. In addition, the 
indicators have reasonably high values. 

In summary, we use standard measures for the con- 
structs of interest, and they satisfy typical reliability stan- 
dards. Because our data contain several five-, six-, and 
seven-point Likert scales, we standardized all item 
responses before creating indicators for estimation. 


Estimation Approach 


The model we outlined in the previous section is a triangu- 
lar system of four simultaneous equations with four 
endogenous variables (effort, job performance, job satisfac- 
tion, and store performance), ten exogenous variables (a 
store manager's ability, six job factors, and three role per- 
ception variables), and several store and store manager 


characteristics added for control and identification pur- 
poses. To simplify the exposition, we do not discuss them in 
detail. However, the error terms across the four equations 


8We included the following variables in our system of equa- 
tions: store performance equation (Equation 1): age of the store, 
number of store employees, tenure of the store manager, and a 
dummy variable for a urban store location; job performance equa- 
tion (Equation 2): a series of dummy variables to indicate the 
supervisor; effort equation (Equation 3): a dummy variable for 
urban store location, tenure, and age of store manager and four 
personal difference variables we adapted from Lusch and Serp- 
kenci’s (1990) study; and job satisfaction equation (Equation 4): 
four personal difference variables. Thus, for every equation, the 
number of exogenous variables is larger than the number of 
included endogenous variables, which satisfies the order condition 
of identification. The restriction that effort and job satisfaction do 
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are most likely correlated (i.e., the error covariance matrix 
is not diagonal, that is, the system is not fully recursive). In 
this situation, Lahiri and Schmidt (1978) show that a trian- 
gular system can be estimated in the manner of seemingly 
unrelated regression models (ignoring the simultaneity) 
with maximum likelihood estimation. This approach yields 
consistent and efficient estimates of the error variances, 
which is necessary for parameter estimates to be consistent 
(Greene 1991).9 

One estimation problem is the possibility that access to 
the profit-sharing plan is determined through self-selection 
by the store managers or by the chain. À comparison of 
demographic and personality variables between store man- 
agers with and without the plan suggests that enrollment in 
the plan was random. À second problem is the possibility of 
random shocks that can simultaneously affect job perfor- 
mance and store performance. This can lead to a problem of 
contemporaneous correlation in the store performance 
equation (Equation 1), which we control for by using instru- 
mental variable estimation. Specifically, we use store per- 
formance and other available variables lagged by one and 
two years as instruments. We then use the predicted value of 
store performance to estimate the system of equations. 


Results 


We first present the results for our four-equation model, 
which yield tests of H; and H4-Hg. We then analyze two 
alternative models to replicate several controversial results 


not directly affect store performance ensures that the rank condi- 
tion is satisfied as well. 

?Three-stage least squares estimation can provide more efficient 
results by 1mposing greater constraints on the system. However, 
this makes estimates more susceptible to model specification prob- 
lems. In any event, the results generated with three-stage least 
squares estimation lead to identical conclusions (the magnitudes 
of some estimated parameters are different). 


in job satisfaction studies. These alternative models enable 
us to assess H». 


Effort, Job Performance, and Compensation 


The estimation results for Equations 1—4 appear in Table 3. 
Rows 2-4 show the results for the direct and indirect effects 
of a store manager's effort and job performance, Rows 5—6 
show the results for the compensation variables, Rows 7—10 
show the results for the effects of job characteristics, and 
the bottom three rows show the effect of role perceptions. 

First, the effect of job performance on the manager's job 
satisfaction is positive and highly significant, providing 
strong support for H, (Byp4 = .286, p < .001).19 The data can 
also be used to refute the idea that happy employees are 
more productive. If we reverse this relationship by adding 
job satisfaction to Equation 2 and by removing the effect of 
job performance from Equation 4, we find no significant 
effect of job satisfaction on job performance (B = .098, p = 
.20). 

Second, as we proposed in H3, job performance 
increases with both effort (Big; = .158, p < .05) and ability 
(Bman = .387, p < .001). Consistent with H4, store perfor- 
mance increases with job performance (Bp, = 6.67, p < 
.001). We also conducted a series of tests to assess the 
robustness of our findings with respect to alternative 
assumptions. For several variables, we tested for nonlinear 
effects on job satisfaction. All were insignificant. The posi- 
tive effect of job performance on store performance is 
robust with respect to different store performance measures 
(e.g., net profit, sales per square foot). Because of unob- 


` IOWhen our hypotheses make clear directional predictions, we 
report the statistical significance based on one-tailed t-tests. We 
report the significance of effects of other parameters based on two- 
tailed t-tests. Estimation details for additional analyses we pre- 
sented in this section are available on request. 


TABLE 3 
Estimation Results 


Store Store Manager Store Manager Store Manager 
Performance Job Performance Effort Job Satisfaction 

Parameter (Equation 1) (Equation 2) (Equation 3) (Equation 4) 
Intercept (o) 27.0*** (4.93) —495™ (.211) 078  (.213) —.054 (.051) 
Effort (Bur) .158"* (.074) —.269"** (.061) 
Ability (By) .387*** (.073) 
Job performance (Bp) 6.67*** (1.33) .286*** (.054) 
Compensation (Bc,) —.001  (.087) .168*** (.057) 
Profit sharing (Bps,) .183* (.109) .246*** (.080) 
Job autonomy (7; ,) .249*** (.079) .096* (.053) 
Job attractiveness (yo) .020  (.092) .476*** (.061) 
Supervisory feedback (Yaz) .140* (.080) —.128™ (.060) 
Social climate (Y4) —.204** (.083) .176"** (.056) 
Role ambiguity (8, ,) 093 (.091) —.026 (.060) 
Role confilct (55,) —.024 (.054) —.047  (.031) 
Role overload (ð) —.119* (.070) —.083* (.045) 


*p « .10 (two-talled t-test). 
**p « .05 (two-tailed t-test). 
***p < .01 (two-tailed t-test). 


Notes. Number of observations = 177; total number of parameters estimated (including control variables) = 67. Standard errors are in 


parentheses. 
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served contemporaneous effects, we used an instrumented 
store performance variable in our estimations. The actual 
measure of store performance can artificially inflate the 
measured effect of job performance on store performance. 
We repeated the estimation with actual store performance 
and found this to be the case. To determine whether a man- 
ager’s job satisfaction depends on the performance of his or 
her store (Ostroff 1992), we added store performance to 
Equation 4. We found that the effect of store performance 
on job satisfaction is insignificant (B = .002, p = .21). 

Third, and most important, we found strong support for 
Hs (i.e., effort is a “cost” to the store manager; E 
—.268, p < .001). Finally, the estimate for the effect of com- 
pensation on job satisfaction has the expected positive sign 
and is statistically significant (c4 = .168, p < .01), whereas 
its effect on effort is not significant (Bes = —,001, p = .99), 
as we expected. These two results support H«. We also esti- 
mated the system with an estimate of the store manager's 
annual salary added to Equations 3 and 4 and found that the 
results are unaffected. For job satisfaction, the effect of 
actual salary is only marginally significant. 

As we previously mentioned, our data set provides us 
with the ability to examine the effects of a compensation 
component that is used frequently in firms but has not been 
subject to significant empirical research: corporate-level 
profit-sharing plans. As with other compensation elements, 
access to this plan, which some store managers had, should 
increase job satisfaction (all else being equal). As we show 
in Table 3, we find a positive effect of the profit-sharing 
plan on job satisfaction. (Bps4 = .246, p < .001). The effect 
of this plan on effort is less obvious. According to agency 
theory, there should be no effect, because the effort of an 
individual store manager has only a small impact on corpo- 
rate profitability. Conversely, arguments in the organiza- 
tional psychology literature suggest a positive effect 
because contingent rewards that depend less directly on a 
person's own performance are considered less controlling 
(Deci 1971; Ryan, Mims, and Koestner 1983). We found 
that access to the profit-sharing plan leads to greater effort 
(Bos, = .183, p < .05). Therefore, it would be useful to 
investigate further the mechanism through which such plans 
affect the agent's effort. 

By carefully separating input (store manager's effort) 
from output (store manager's job performance), we find 
support for both the importance of intrinsic rewards that 
store managers obtain through job performance (H4) and 
the assumption that effort per se is a disutility or cost for 
them (H3). Because a store manager's effort increases job 
performance (H3), it has a positive, indirect effect on job 
satisfaction. However, this positive, indirect effect of effort 
(.158 x .286 = .045) is of smaller magnitude than the nega- 
tive, direct effect (-.269). This implies that a store manager 
must receive compensation for his or her effort. 


Other Job Factors and Role Perceptions 


The estimates for the four job factors and the three role per- 
ceptions included in the job satisfaction equation (Equation 
4) have signs that are consistent with existing research, 
except for the effect of the supervisor, which is negative. A 
store manager's job satisfaction increases with job auton- 


omy (314 = .096, p < .1), job attractiveness (954 = .476, p < 
.001), and good social climate (444 = .176, p < .01). In addi- 
tion to the quality of supervisory feedback (444 = —128, p < 
.05), the three role perception factors also decrease job sat- 
isfaction, though at best, their effects are only marginally 
significant (ambiguity: 014 = —.026, p = .67; conflict: 854 = 
—.047, p = .13; overload: 834 = —.083, p < .1). 

For the store manager effort equation (Equation 3), we 
found (expected) positive effects of job autonomy (4,3 = 
.249, p < .01) and the quality of supervisory feedback 
(333 = .140, p < .1). Conversely, job attractiveness has no 
significant effect on effort (454 = .020, p = .83). Notably, we 
found that a better social climate is associated with lower 
effort (944 = —.204, p < .05). This finding indicates that in 
hierarchical organizations, such as a retail store, excessive 
socialization might interfere with the retailer's objectives of 
extracting high effort from the store employees. Although 
the store manager gains satisfaction from positive work 
relationships, having subordinates that are more than just 
employees (i.e., friends) may interfere with the ability to 
manage them. This counters the argument that retailers that 
create close working relationships between employees and 
store management have higher levels of performance 
(Dunne and Lusch 1999). Of the three role perception varia- 
bles, only the effect of role overload is marginally signifi- 
cant (044 = —119, p < .1).!! In general, we find that the 
effect of the role perception variables are weaker than 
reported in other job satisfaction studies (Brown and Peter- 
son 1993; Lusch and Serpkenci 1990). This difference may 
be caused by the exclusion of effort, as was the case in these 
studies (i.e., the disutility from effort could have been par- 
tially attributed to stress factors). 

As we mentioned previously, we included several varia- 
bles that were indicative of store and store manager charac- 
teristics in the estimation for control and identification pur- 
poses. We do not discuss them except for noting that the 
signs of the personal difference variables are consistent 
with Lusch and Serpkenci's (1990) hypotheses. 


Alternativo Models 


We now examine how the results of our analysis are 
affected by changing the definition of job performance and 
excluding effort as an antecedent from the model (as H3 
suggests). To test these ideas, we eliminate effort (i.e., the 
effort equation) from the model and add the job characteris- 
tic and role perception variables to the job performance 
equation. In a first alternate model, we use an "aggregate" 
measure of job performance that includes effort and ability, 
as Walker, Churchill, and Ford (1977) advocate and as 


I'We estimated a model that combined the three role perception 
variables into a single “job tension" factor. We found a marginally 
significant effect on job satisfaction (6 = —.100, p « .1) and an 
insignificant effect on effort (6 = —.096, p = .28). In addition, we 
allowed for direct effects of job factors and role perception varia- 
bles on store performance and found no significant effect and no 
change ın the parameter estimates of interest. However, these fac- 
tors could potentially moderate the effect of effort on job perfor- 
mance. Such a test is left to further research. 
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Lusch and Serpkenci (1990) use. We then estimate this 
same model (without effort) with the “narrow” measure of 
job performance we used in our original model. This repli- 
cates Bagozzi’s (1978) approach. We expect the first alter- 
nate model to replicate the insignificant effect of job perfor- 
mance on job satisfaction that Lusch and Serpkenci (1990) 
report: The negative and the positive elements in the job 
performance measure should cancel each other out. By 
completely eliminating effort, the second version should 
yield a somewhat larger effect of job performance on job 
satisfaction, but it should still be substantially smaller than 
the one we obtained from the fully specified four-equation 
model. 

The estimation results shown in the second and third 
columns of Table 4 support these conjectures. (To facilitate 
the comparison, we repeated the relevant results from Table 
3 in the first column of Table 4.) Using an aggregate mea- 
sure of job performance leads to an insignificant effect of 
job performance on job satisfaction (B = .023, p = .76). This 
result is equivalent to the insignificant effect that Lusch and 
Serpkenci (1990) report. The use of the narrow measure of 


job performance leads to a positive but much smaller effect 
(B = .136, p < .05).12 These results strongly support H> and 
indicate that failing to control properly for the effect of 
effort on job satisfaction leads to results that are biased 
toward insignificance. !3 

Note that the effects of different job factors on job per- 
formance and job satisfaction are different in the alternate 
models. In particular, when we exclude effort from the job 
performance measure, the results show a negative and sig- 
nificant effect of job attractiveness and a negative and mar- 
ginally significant effect of job autonomy on job perfor- 


I7The second alternate model is a nested version of the full 
model. As a result, we can statistically test the implied restrictions 
using a chi-square test. The alternate model can be rejected with a 
high degree of significance (y? = 21.4, degree of freedom = 1). 

13We also estimated a model similar to that of Brown and Peter- 
son (1994), which excluded job characteristics, especially the 
compensation variables. In this case, we found a positive effect of 
effort. 


TABLE 4 
Estimation Results for Alternate Models 





Relevant Results 
Parameter from Table 3 
Job Satisfaction 
Constant —.054 (.051) 
Effort —.269*** (.061) 
Job performance .286*** (.054) 
Compensation .168*** (.057) 
Profit sharing .246*** (.080) 
Job autonomy .096* (.061) 
Job attractiveness .476*** ( 053) 
Supervisory feedback —.128  (.060) 
Social climate .176*** (.056) 
Hole ambiguity —.026  (.060) 
Role conflict —.047  (.031) 
Role overload —.083  (.045) 
Effort/Job Performanceb 
Constant 078  (.213) 
Compensatlon —001  (.087) 
Profit sharing .183* (.109) 
Job autonomy .249*** (.079) 
Job attractivoness 020 (.092) 
Supervisory feedback 140* (.080) 
Social climate —.204** (.083) 
Role ambiguity 093  (.091) 
Role conflict —.024 (.054) 
Role overload —.119"* (.070) 
x2 (d.f.) i 


*p < .10 (two-tailed t-test). 

™p < .05 (two-tailed t-test). 

"**p < .01 (two-tailed t-test). 

aWe left the control variables as In the original model. 


Job Satisfaction Modelsa 





Narrow Job 
Performance Measure 


Aggregate Job 
Performance Measure 


—044  (.066) —072  (.077) 
023 — (.076) .136** (.062) 
.299*** (.085) .239*** (.086) 
163 (121) .166  (.119) 
.285*** (.084) .380*** (.079) 
.304*** (.092) .357*** (.092) 

—081  (.091) —.099  (.089) 
.168** (.083) .190** (.083) 
019 — (.097) —026  (.090) 
.026 (.073) —009  (.054) 
057  (.081) —.040  (.080) 

—190 (331) —.630* (376) 
.090 — (.075) .186*  (.086) 
.289*** (.111) 134  (.126) 
.166** (.074) —101  (.081) 

—088  (.081) —.195** (.081) 
.129* (.078) .052 — (.088) 

—.213*** (.072) —107  (.083) 
0711 (.063) —106  (.090) 

—028  (.072) .045 — (.055) 

—096  (.071) —156* (.080) 
12.9*** (1) 21.4*** (1) 


bBecause the two Job satisfaction models did not Include effort, the various Job characteristic and role perception vanables were directly linked 


to Job performance. 
Notes: Standard errors are indicated in parentheses. 
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mance. Again, these results suggest that omitting effort 
leads to biased estimates. 


Discussion 


This study was motivated by inconsistencies across differ- 
ent studies in the job satisfaction research, including limited 
and contradictory evidence that high levels of effort are 
costly for employees. An examination of the literature sug- 
gests that the inconsistencies most likely arise from differ- 
ences in construct definitions and a failure to account for 
effort properly. Effort and compensation are the most 
important factors in a vertical work relationship, yet few job 
satisfaction studies account for both as separate constructs. 
Furthermore, in marketing, the majority of studies examine 
the job satisfaction of salespeople (i.e., a context in which 
compensation is frequently related to job performance and, 
thus, effort). The failure to account for effort here (and, 
thus, the effort-compensation relationship) also biases esti- 
mates of the relationship between compensation and job 
satisfaction. In addition, we were interested in understand- 
ing the effects of corporatewide profit-sharing plans, which 
is a relatively unresearched element of the compensation 
package in many organizations. 


The Relationship Between Job Performance and 
Job Satisfaction 


The empirical analysis in this article uses data collected for 
a typical job satisfaction study, but the model we propose 
incorporates the basic constructs from an agency relation- 
ship and the key antecedents of job satisfaction. In contrast 
to previous models, this framework accounts for both the 
direct and the indirect effects of effort (through job perfor- 
mance) on job satisfaction and leads to conclusions that are 
significantly different. 

First, we find a significant, positive effect of job perfor- 
mance on job satisfaction. This contrasts with Lusch and 
Serpkenci's (1990) reported negative (but insignificant) 
effect of job performance, using the same data set. We show 
that not accounting for effort biases the estimated effect of 
job performance on job satisfaction, and this can lead to 
incorrect conclusions. Second, by changing the model and 
construct measures to be consistent with prior studies, we 
. replicate small or insignificant effects of job performance 
on job satisfaction and a positive effect of effort on job sat- 
isfaction, as reported in the existing literature. This supports 
the argument that problems with construct definition can 
explain inconsistent findings in the job satisfaction litera- 
ture. This finding also challenges the now widely accepted 
view about the relationship between job performance and 
job satisfaction in organizational psychology and the sales 
force literature that the two constructs do not affect each 
other. In particular, decades of research have failed to find a 
significant or consistent link between job performance and 
job satisfaction (Brown and Peterson 1993; Iaffaldano and 
Muchinsky 1985). 

A positive effect of job performance on job satisfaction 
also has important implications for a firm that wants to 
motivate and retain talented employees. It implies that 
actions to increase job performance can also increase 


employees’ job satisfaction. As a result, benefits such as 
reduced turnover and less absenteeism (a result of higher 
job satisfaction) may be useful for justifying the cost of a 
policy, even when that policy is directed primarily toward 
improving job performance. 


Effort, Job Performance, and Job Satisfaction 


Because job satisfaction is a proxy for utility (Clark and 
Oswald 1996; Friedman 1978), a negative relationship 
between manager effort and job satisfaction follows from 
the principal-agent framework. Nevertheless, the relation- 
ship has proved elusive to empirical validation. Recogniz- 
ing the different possible paths through which an 
employee's effort may affect his or her satisfaction enables 
us to confirm a negative relationship empirically. This find- 
ing, combined with the positive effect of job performance 
on store performance, is consistent with the economic view 
that there 1s an inherent conflict of interest in vertical rela- 
tionships. When accounting for effort, we also find rela- 
tively weak effects of the role perception variables. Factors 
such as role ambiguity and role conflict are considered key 
determinants of job satisfaction in the sales force literature 
(Brown and Peterson 1993). Because these factors capture 
work-related stress, it is possible that they captured the 
omitted effect of effort. 

The positive effect of job performance on job satisfac- 
tion also has important implications for research based on 
agency theory. In a standard agency model (e.g., Lal, Out- 
land, and Staelin 1994), it is assumed that job performance 
can increase job satisfaction only indirectly (tbrough factors 
such as compensation or promotion). Our research demon- 
strates that this standard assumption of agency theory may 
not be justified. The omission of job performance's effect 
on job satisfaction has the potential to bias the effect of 
effort on job satisfaction toward insignificance.!4 For exam- 
ple, when the effect of job performance is eliminated from 
the job satisfaction equation (Equation 4), we obtain a neg- 
ative but insignificant effect of effort on job satisfaction 
(B = —031, p = .24).15 


Corporate Profit-Sharing Plan 


In general, the literature distinguishes between two types of 
income that employees can earn: fixed compensation, 
which is paid to the employee independent of output, and 
variable pay, which is paid to the employee contingent on 
output. In our data, there is a third type of compensation: 
payments that are based on overall corporate performance. 
Because corporate performance is not known in advance, 
the expected payment to the employee under this plan is 
uncertain. In contrast to variable pay, which is directly con- 
tingent on an employee's output, the ability of an employee 
in a large organization, such as a retail chain, to affect the 
level of the expected payoff from a corporate-level plan is 


I4This is analogous to the way that the omission of effort's 
effect on Job satisfaction biases the effect of job performance 
toward 1nsignificance. 

15Clark (1997) does not include job performance and also finds 
a relatively weak negative effect of effort. 
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also low. The findings show that this type of compensation 
has some incentive pay-type effects: It has positive effects 
on both effort and job satisfaction. This holds even though 
increased effort by a store manager is unlikely to affect the 
ultimate payout from the plan significantly. This is useful 
because many firms would like to provide incentives to 
their employees (with performance-based pay) but cannot 
because the impact of an individual employee on output 
cannot be isolated. Most employees in marketing positions 
have a relatively small direct effect on the outcome of their 
brand, business unit, or customer service department. In 
these conditions, profit-sharing plans based on the perfor- 
mance of the overall organization provide an opportunity to 
obtain an effect similar to that of performance-based pay for 
the sales force. Thus, although data limitations prevent us 
from conducting a detailed analysis of why corporate profit- 
sharing plans act the way they do, our results indicate that 
these plans can be of significant value in organizations. 


Summary and Further Research 
This article attempts to clarify ambiguities in the literature 
about the relationships among effort, job performance, and 
job satisfaction. Using a model with variables that are 
important for both agency theory and organizational psy- 
chology, we find a negative, direct effect of effort and a 
positive, direct effect of job performance on job satisfac- 
tion. We show that conflicting findings in the literature are a 
result of inconsistency in both the measurement and the 
definition of constructs across studies that do not fully 
account for all the relationships among constructs. This 
suggests that some results in the sales force literature 
regarding job satisfaction, such as the strong effect of role 
perception variables, should be reexamined. This article 


also demonstrates important omitted variables biases that 
can arise in the empirical analysis of work relationships. 

The most obvious extension is to generalize the findings 
to agency relationships in which the incentives in the com- 
pensation packages are stronger (i.e., there are variable 
components in the compensation that are directly related to 
the agent's output). Restricting the analysis to a single 
retailer provides a benefit by reducing the potential number 
of confounding factors. We also identified an interesting 
finding about corporatewide profit-sharing plans. However, 
we cannot determine its relative effectiveness with the 
available data. An interesting issue for further investigation 
is to determine whether and how different types of employ- 
ees respond to pay that is based on overall corporate 
performance. 

Similar to any empirical study of complex organiza- 
tional issues, this study is subject to several cautions. For 
some of our measures, we use cross-sectional data, and this 
limits our ability to control fully for certain unobserved 
effects. In addition, it limits our ability to test the direction 
of causality. To the extent possible, we make use of covari- 
ates and test for reverse relationships. We have also exer- 
cised care to reduce potential problems pertaining to mea- 
surement errors, sampling bias, and incomplete model 
specification. That being said, the study has two method- 
ological aspects that differentiate it from other studies. 
First, we use individual-level data to test an individual-level 
model, as Lal, Outland, and Staelin (1994) suggest. Second, 
we use multiple sources of information to measure the vari- 
ous constructs, thus reducing the problem of common 
method bias and common source bias. This enables us to 
shed new light on the determinants of job satisfaction and 
provide insights that are important both for researchers in 
this domain and for organizations. 


APPENDIX 


Scale Items to Measure Model Constructs 
et ee 


Compensation 
| am paid fairly for the work | do.a 
My pay is better than that for similar jobs In other flrms.& 


Salary and wage increases are given to those who do a 
good job.a 


Do you feel your pay is as high In comparison with what 
others get for similar work in other companies?b 


Job Attractiveness 


After a day's work, | really feel like | have accomplished 
something.8 


| am In a “dead-end” job. (R)@ 
| am often bored with my Job. (R)e 


Do you find your work challenging, exciting, and glving you 
a sense of accomplishment?> 
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Role Confllct 


Having to decide things that affect the lives of individuals 
that you know.c 


Feeling that you may not be liked and accepted by the 
people you work with.c 


Feeling that you do things on the job that are against your 
better judgment.c 


Role Overload 


Feeling that you have heavy a workload, one that you can't 
possibly finish dunng an ordinary workday.c 


Thinking that the amount of work you have to do interferes 
with how well it gets done.c 


Feeling that your Job tends to interfere with your famlly life.c 


APPENDIX 
Continued 





Job Autonomy 
| have a lot to say about how to do my job.@ 


How satisfied are you with the amount of control you have 
In your work?f 


Supervisory Feedback 
Management is quick to criticize poor performance. (R)8 
No one ever says "you've done a good job.” (R)8 


Do you:feel you do not know what your supervisor thinks of 
you, how he evaluates your performance? (R)? 


Do you feel free to offer suggestions concerning policies 
and procedures affecting your operation?b 
Soclal Climate 
Working for Winn's is like belng a part of a family.8 
The people here are proud to work for Winn's.8 


Role Ambigulty 


Belng unclear on just what the scope and responsibilities of 
your job are. 





Feeling that you are not fully qualifled to handle your job.c 


The fact that you can't get enough information to carry out 
your job.¢ 

Not knowing Just what the people you work with expect of 
you.¢ 


Source: Based on instruments that Lusch and Serpkenci (1990) use. 


Effort 
Takes responsibility in his work.d 
Readily assumes responsibility.d 
Makes an effort to improve his managerial skills.¢ 
Works long hours when necessary.d 
Level of motivation. 


Ability 
Can cope with pressure or strain on the job.d 
Decision-making ability.e 
Knowledge of trade area.e 





Tolerance for pressure.® 


Job Performance 
Fails to meet to target goals set for the store. (R)d 
Achieving the target sales volume.® 
Achieving the target gross profit.e 


Job Satisfaction 
How satisfied are you with your general work situation’?! 


Would you advise a friend looking for a new job to take one 
similar to yours?b 


| [ust hate to get up In the morning to go to work. 


aMeasured on a five-point Likert scale, ranging from “strongly agree" to "strongly disagree " 
bMeasured on a six-point Likert scale, ranging from "definitely yes" to “definitely no.” 

cMeasured on a five-point Likert scale, ranging from “never bothered" to “bothered nearly all the time." 
dMeasured on a five-point Likert scale, ranging from “strongly disagree” to “strongly agree.” 
eMeasured on a seven-point Likert scale, ranging from "very satisfied" to “very dissatisfied.’ 
{Measured on a six-point Likert scale, ranging from "extremely satisfied" to “extremely dissatisfied." 


Notes: R = reverse coded. 


ee 
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The Emergence of Dominant 
Designs 


In many product categories, technological evolution results in the emergence of a single product design that 
achieves market dominance. In this article, the authors examine two questions: Will a dominant design emerge in 
a new product category? and If it does, how long will it be before a dominant design emerges? Thus, the authors 
simultaneously model the probability of emergence of a dominant design and the time of that emergence, condi- 
tional on its emergence. The model incorporates the effects of several product-market characteristics on the prob- 
ability and time of dominant design emergence, including appropriability of the rents associated with the product, 
network effects, size of the products value net, the standards-setting process, radicalness of innovation, and 
research-and-development intensity. The authors use data for 63 office products and consumer durables to esti- 
mate a split-population hazard model for the probability and time of emergence of the dominant design. They find 
that a dominant design is more likely to emerge with weak appropriability, weak network effects, low product radi- 
calness, and high research-and-development intensity. Dominant designs that emerge are likely to emerge sooner 
in product categories in which there is weak appropriability, there are a large number of firms in the value net, the 
standards are set by a de facto process, and there is low product radicalness The proposed model can be used to 





predict both the probability and the time of the emergence of a dominant design in a new product category. 


ticular product's design architecture as one that defines 

the specifications for the entire product category; such a 
design is referred to as the dominant design (Abernathy and 
Utterback 1978; Utterback 1994). The emergence of a 
dominant design in a product category is an important event 
that significantly affects firms' strategies and performance. 
The late emergence of a dominant design can retard market 
growth because potential customers delay purchasing the 
product and wait for a dominant design to emerge (Utter- 
back 1994). A dominant design also shapes future genera- 
tions of products in the category, resulting in "an architec- 
tural franchise" for the firm with a dominant design and 
potentially locking out competitors (Schilling 1998). In this 
article, we examine the probability and time of emergence 
of a dominant design in a new product category. 

There 1s considerable variation in the emergence of 
dominant designs across product categories. For example, 


k: many new product categories, the market accepts a par- 
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the modern fax machine was introduced in 1960, but the 
dominant design, the GIII, did not emerge until 1983 
(Baum, Korn, and Kotha 1995). Conversely, the dominant 
design for DVD players emerged just three years after prod- 
uct introduction in 1996. In other product categories, a 
dominant design has not emerged many years after product 
introduction, and it seems unlikely that a dominant design 
will ever materialize. For example, the camcorder category, 
introduced in 1984, supports multiple designs more than 20 
years later. These designs include the VHS (home video 
system), VHS-C (home video system—compact), super 
VHS, super VHS-C, 8 mm, Hi-8, and digital 8 mm formats. 

Dominant designs have been documented in diverse 
product categories, including VCRs, nuclear reactors, auto- 
matically controlled machine tools, and watches (Utterback 
1994). The factors that influence the emergence of domi- 
nant designs have been identified with conceptual models 
(Lee et al. 1995; Smith 1997; Suaréz 2004), whereas empir- 
ical research has focused primarily on the ex post perfor- 
mance implications of dominant designs (Christensen, 
Suaréz, and Utterback 1998; Suaréz and Utterback 1995). 

The study of dominant designs is highly relevant for 
marketing managers whose strategies must be informed by 
marketplace dynamics. Competition before the emergence 
of a dominant design is between designs, whereas after its 
emergence, competition is within the more circumscribed 
domain of a dominant design. À dominant design not only 
influences the strategies and performance of firms in the 
product category (e.g., DVD players) but also affects firms 
in ancillary categories (e.g., movies and video games for 
DVD players). 

In this article, we examine the effects of product-market 
characteristics on the probability and time of the emergence 
of a dominant design. We address two questions: First, what 
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product-market characteristics influence whether a domi- 
nant design will emerge in a product category? Second, 
conditional on the emergence of a dominant design, what 
product-market characteristics influence the time of that 
emergence? 

Consistent with prior research (Christensen, Suaréz, and 
Utterback 1998), we take a market-facing view and define 
the time of the dominant design’s emergence as when the 
market clearly favors one design over others. Unlike prior 
research, which has implicitly assumed that a dominant 
design will always emerge, our approach incorporates the 
possibility that a dominant design may sometimes never 
emerge. 

We estimate a split-population hazard model (Schmidt 
and Witte 1989; Sinha and Chandrashekaran 1992) of the 
probability and time of emergence of a dominant design, 
which allows for two subpopulations of product categories: 
one in which a dominant design will emerge and one in 
which it will never emerge. We incorporate product-market 
characteristics as predictors of the emergence of dominant 
designs and calibrate the model using data on 63 office 
products and consumer durables. The data are right cen- 
sored for 33 products for which a dominant design had not 
yet emerged by 2003. 

The results show that a dominant design is more likely 
to emerge with weak appropriability of rents associated 
with the product, weak network effects, low product radi- 
calness, and high research-and-development (R&D) inten- 
sity. Dominant designs that emerge do so sooner when there 
is weak appropriability, a large number of firms in the value 
net, de facto standards, and low product radicalness. The 
same factor may influence both the dominant design’s prob- 
ability and its time of emergence, but in different ways. The 
proposed model can be used as a predictive tool to estimate 
both the probability of emergence of a dominant design and 
the time of its emergence, given a set of product-market 
characteristics. 

We organize the article as follows: In the next two sec- 
tions, we provide an overview of dominant designs and pre- 
sent the proposed model and hypotheses. We then describe 
the data, the model estimation procedures, and the results. 
We conclude by discussing the article’s contributions, sum- 
marizing its limitations, and identifying directions for fur- 
ther research. 


Dominant Designs: An Overview 


The concept of a dominant design emerged from studies on 
industrial innovation in the 1970s. In the early stages of 
market evolution, high market and technical uncertainty 
results in a diversity of product designs (Abernathy and 
Utterback 1978; Dosi 1982; Lee et al. 1995; Smith 1997; 
Utterback 1994). At some point, the market favors one 
product’s design (i.e., the future dominant design) over the 
other designs. A dominant design may not necessarily be 
the one that embodies superior technical performance; 
sometimes, it is a satisficing design in terms of technical 
possibilities driven by the accommodation of commercial 
interests among suppliers, users, and competitors (Tushman 
and Rosenkopf 1992; Wade 1995). 
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Evolutionary processes of variation, selection, and 
retention characterize product category evolution (e.g., 
Anderson and Tushman 1990; Tushman and Murmann 
1998). Technological breakthroughs create a rivalry 
between alternative designs, resulting in a period of design 
variation or ferment. Viewed in this evolutionary perspec- 
tive, the emergence of a dominant design is the transition 
point between the periods of variation and selection. 

Empirical studies document the emergence of dominant 
designs in diverse product categories, including typewriters, 
televisions, electronic calculators, automobiles (Utterback 
1994), VCRs (Cusumano, Mylonadis, and Rosenbloom 
1992), cochlear implants (Van de Ven and Garud 1994), fax 
machines (Baum, Korn, and Kotha 1995), cement, glass, 
and minicomputers (Anderson and Tushman 1990). The 
emergence of a dominant design affects firms' market 
shares (e.g., Anderson and Tushman 1990) and survival 
(e.g., Baum, Korn, and Kotha 1995; Christensen, Suaréz, 
and Utterback 1998; Suaréz and Utterback 1995; Tegarden, 
Hatfield, and Echols 1999). 

However, dominant designs do not emerge in all prod- 
uct categories (e.g., camcorders, supercomputers, video 
game consoles). To our knowledge, prior research has not 
examined why dominant designs emerge in some product 
categories but not in others. Thus, we address the question 
in this article. 


Definition of Dominant Designs 


Over the years, the definition of dominant designs has 
evolved from being broad and possibly tautological (i.e., 
being defined by its effects) to being more specific. Table 1 
summarizes alternative definitions of dominant designs in 
the literature. In this article, we seek a definition of a domi- 
nant design that is independent of its effects. Following the 
work of Christensen, Suaréz, and Utterback (1998), we 
define a dominant design as the specification (consisting of 
a single design feature or a complement of design features) 
that defines the product category's architecture. We measure 
the time to a dominant design's emergence as the time 
between the introduction of the first product in the category 
and the time an alternative in the category satisfies our 
definition. 


Dominant Designs Versus Standards 


Some prior research (e.g., Anderson and Tushman 1990; 
Besen and Farrell 1994; Katz and Shapiro 1986; Schilling 
1998) has equated "dominant designs" with "standards," 
using the terms interchangeably. Indeed, such terms as 
"standards wars" have been used to denote the battle 
between designs that results in a dominant design (Shapiro 
and Varian 1999). In this article, we draw a distinction 
between dominant designs and standards. 

Following the convention used in the engineering disci- 
pline and by numerous standards bodies, we use the term 
"standards" to denote the technical specifications for qual- 
ity, reference, compatibility, adaptability, and connectivity 
that are required for the proper functioning of products, 
such as fax machines, computers, and railway tracks 
(Grindley 1995; Krechmer 2000). Standards are an 


TABLE 1 
Alternative Definitions of a Dominant Design in the Extant Literature 


Empirical Method to Identify a 
Dominant Design 


Conceptual paper. 


Source Definition of a Dominant Design 
Abernathy and A dominant design is a single architecture that establishes 
Utterback (1978) dominance in a product category. 


Anderson and 
Tushman (1990) 


A dominant design is a single architecture that establishes 
dominance In a product category. 


A design is dominant if it 
acquires more than 50% market 
share of the product category 
and maintains it for four 
consecutive years. 


Utterback (1994) The dominant design In a product category is the one that No details provided. 
wins the allegiance of the marketplace; it Is the one that 
competitors and innovators must adhere to If they hope to 
command significant market following. A dominant design Is a 
product in a product category that gains general acceptance 
as the standard on technical features that other market players 
must follow If they wish to acquire significant market share. 
Suaréz and The dominant design is a specific path along an industry's Industry experts were used to 
Utterback (1995) design hierarchy that establishes dominance among classify dominant designs In 
competing design paths. typewriters, automobiles, 
televisions, picture tubes, 
transistors, and electronic 
calculators. 
Christensen, A dominant design emerges in a product category when one Industry experts were used to 
Suaréz, and product's design specifications (consisting of a single or a identify the emergence of the 
Utterback (1998) complement of design features) define the product category's dominant design in the rigid disk 


architecture. 


inevitable requirement for these products because of inter- 
dependency among several components or among the prod- 
ucts' users because these products must either connect with 
one another directly (e.g., fax machines) or indirectly (e.g., 
personal computers [PCs]) or require complementary goods 
to meet users' needs (e.g., DVD players). Thus, standards in 
a product category serve a functional purpose that is inde- 
pendent of market acceptance, whereas by definition, mar- 
ket acceptance is an integral aspect of a dominant design. 

Two other aspects of dominant designs differentiate 
them from standards: First, a dominant design often 
emerges from competition among several designs. For 
example, in the home video recorder market, the VHS 
emerged as the dominant design as a result of competition 
between two video standards, Sony's Betamax and Japanese 
Victor Company's (JVC's) VHS. Second, dominant designs 
often consist of numerous standards. For example, the 
dominant design in the CD-player category, the Philips— 
Sony design, has more than two dozen codified standards. 

To summarize, although there are product categories in 
which dominant designs are equivalent to the standard in 
the product category (e.g., DVD in digital home video play- 
ers), there are also categories with multiple standards and 
no dominant design (e.g., for camcorders, there are VHS, 
8 mm, mini-DVD, and so forth). 


drive industry based on the 
technical elements of the 
product category evolution over 
time. 


A Model of the Probability and Time 
of Emergence of a Dominant 
Design 
Technological evolution has long been viewed as if it were 
a random process with a path-dependent trajectory that 
defies systematic modeling (e.g., Ames and Rosenberg 
1977; Rosenberg 1994). However, this view has been chal- 
lenged by studies that model a product category's evolution 
as a function of its product-market characteristics (Agarwal 
and Bayus 2002; Golder and Tellis 1997; Gupta, Jain, and 
Sawhney 1999). In the spirit of this latter work, we suggest 
that a product category’s product-market characteristics 
affect its evolution, influencing both the probability and the 

time of the emergence of a dominant design. 

Extending previous research, we argue that the follow- 
ing product-market characteristics affect whether and when 
a dominant design emerges in a product category: appropri- 
ability, network effects, the number of firms in its value net, 
the type of standards-setting process (de jure versus de 
facto), its radicalness, and its R&D intensity. We suggest 
that some product-market characteristics affect both the 
probability and the time of a dominant design's emergence, 
whereas others affect only the probability or the time of 
emergence. Next, we define these product-market charac- 
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teristics and develop hypotheses of their effects on the prob- 
ability and time of emergence of a dominant design.! 


Appropriability 


Appropriability refers to aspects of the product category 
that govern firms’ abilities to capture innovation rents in the 
product category (Levin et al. 1987; Teece 1986). Levin and 
colleagues (1987) studied more than 100 industries and 
concluded that appropriability is a multifaceted construct 
that embodies the following six aspects: patents to prevent 
duplication, patents to secure royalty income, secrecy of the 
new product development effort, lead time for development 
of the product, learning-curve efficiency, and sales and ser- 
vice effort that underlies the innovation. Appropriability for 
a product category can be represented along a continuum 
that extends from a tight (closed) regime, in which firms 
can appropriate most or all innovation rents, to a weak 
(open) regime, in which firms can appropriate little or no 
rents. 

Tight appropriability leads to disjointed competition and 
localized monopolies with several independent, discon- 
nected market niches that reduce the selection pressures that 
are crucial for the emergence of a dominant design (Ander- 
son and Tushman 1990; Frenken, Saviotti, and Trommetter 
1999). With tight appropriability, product prices may also 
be higher because of limited industrywide learning effects, 
lowering the probability of emergence of a dominant design 
(Levin et al. 1987). In such a situation, developers of com- 
plementary goods may not have the knowledge or the 
incentives to develop complementary goods, reducing 
selection pressures and delaying the emergence of a domi- 
nant design. 

The advantages associated with a weak appropriability 
regime have gained prominence with the emergence of the 
open-source movement in software development (Garud 
and Kumaraswamy 1993; Lecocq and Demil 2002; West 
2003). Weak appropriability enables knowledge sharing 
among firms and the coevolution of networks of cooperat- 
ing competitors (Brandenberger and Nalebuff 1996). With 
weak appropriability, all firms in the network, or the “value 
net" as we call it, can share in the innovation rents. Such 
rent-sharing reduces firms' incentives to innovate indepen- 
dently of the future dominant design, both increasing the 
probability of the emergence of a dominant design and has- 
tening the time to emergence (Gallini 1985). For example, 
the emergence of the IBM PC as the dominant design in 
PCs resulted, in part, because of its open architecture with 
extensive public documentation (Khazam and Mowery 
1994). Likewise, the emergence of the DVD format 
resulted, in part, because of the open format and extensive 
public documentation. 

Integrating these arguments, we expect that weak 
appropriability in a product category increases selection 
pressures and positively affects both the probability of the 


lThe model also includes two control variables—the year of 
first product introduction and the presence of a recessionary envi- 
ronment during first product introduction—for which we do not 
develop hypotheses but present the results subsequently. 
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emergence of a dominant design and the time of the emer- 

gence. Thus, we expect the events "emergence" (e) and 

“time of emergence" (t) to be conditionally dependent, 
given appropriability. 

Hie: The weaker the appropriability of the product category, 

the more probable is the emergence of a dominant design. 

Hc The weaker the appropriability of the product category, 


the shorter is the time to the emergence of a dominant 
design. 


Network Effects 


Positive network effects exist when a customer's utility 
from a product increases as the number of customers who 
use the product (or compatible products) increases (Katz 
and Shapiro 1986). When the utility of a product to each 
user depends on the number of other users, the product 
exhibits direct network effects (e.g., fax machine, tele- 
phone). Indirect network effects arise when the link 
between consumer utility and the number of users in the 
network occurs through the increased availability of com- 
plementary products (e.g., movies on DVD for DVD play- 
ers). Àn increase in the number of users of the focal hard- 
ware product increases the availability of complementary 
goods, which in turn increases the utility that customers 
derive from the focal product. 

Some aspects of strong network effects suggest that 
dominant designs are more likely to emerge and emerge 
early. For example, a dominant design in a product category 
with strong network effects assures potential adopters that 
the dominant design's network will be the largest, resulting 
in its further adoption and ultimately establishing its market 
supremacy. Networked markets are typically "winner take 
all" so that a dominant design may have an early, large mar- 
ket share (Schilling 2002; Shapiro and Varian 1999). Such 
markets may tilt toward one design and do so rapidly 
because of selection pressures to choose a design early 
(e.g., Besen and Farrell 1994). 

Other aspects of strong network effects suggest that 
dominant designs are less likely, and if they do emerge, they 
emerge later. Prospective customers of products with direct 
network effects may adopt a “wait-and-see”’ attitude, result- 
ing in "excess inertia,” which reduces the probability of 
emergence (Farrell and Saloner 1986; Goldenberg, Libai, 
and Muller 2002). Such excess inertia may also exist in 
product categories with indirect network effects in which 
the lack of complementary goods may cause a "chicken- 
and-egg" coordination problem that delays the emergence 
of a dominant design (Gupta, Jain, and Sawhney 1999). For 
example, in the HDTV market, Gupta, Jain, and Sawhney 
(1999) find that complementors' inactions delayed the 
emergence of a dominant design. Srinivasan, Lilien, and 
Rangaswamy (2004) report that the survival duration of a 
pioneer is negatively influenced by network effects (i.e., 
pioneers that introduce products with stronger network 
effects have shorter survival duration than pioneers that 
introduce products with weaker network effects), support- 
ing the existence of excess inertia in product categories with 
network effects. Given the opposing effects of network 
effects, a priori, we do not hypothesize a directional effect 
of network effects on either the probability or the time of 


emergence of a dominant design; rather, we observe it 
empirically. 


H>,: The stronger (weaker) the network effects in the product 
category, the less (more) probable 1s the emergence of a 
dominant design. 


Ha: The stronger (weaker) the network effects in the product 
category, the longer (shorter) 1s the time to the emergence 
of a dominant design. 


Value Net 


A firm's linkages with its suppliers and producers of com- 
plementary products are important sources of relational 
rents for the firm and utility for its customers (Dyer and 
Singh 1998). The product's value net consists of suppliers 
and producers of complementary goods that deliver utility 
either directly or indirectly to the customer (Stabell and 
Fjledstad 1998). Tushman and Rosenkopf (1992) and Wade 
(1995) suggest that the interdependencies within a value 
net, or what they call a “technological community," play a 
major role in the trajectory of technological evolution. Our 
interest is whether and how the size of tbe value net affects 
the emergence of a dominant design in a product category. 

Because the value net typically consists of firms with 
different, frequently competing objectives, firms may ini- 
tially prefer different designs, and thus we do not expect the 
value net of a product category to have the cohesion neces- 
sary to create selection pressures to affect the probability of 
the emergence of a dominant design; however, if the emer- 
gence of a dominant design seems inevitable, we expect the 
product's value net to affect the time to emergence, as we 
describe subsequently. 

The more firms that are in the value net, the greater is 
the incentive for each firm to support a dominant design (if 
it emerges) because of higher resultant revenues (Amit and 
Zott 2001), thus reducing the time to the emergence of a 
dominant design. Consider the DVD player: Firms in the 
value net included the DVD player manufacturers (e.g., 
Matsushita, Sony, Philips), film studios (e.g., Paramount, 
Disney), film producers (e.g., Dreamworks, Pixar), movie 
rental chains (e.g., Blockbuster, Hollywood Video), and 
media manufacturers (e.g., Memorex, TDK). A dominant 
design permits firms in the value net to suspend further 
investments in product development, to compete within the 
dominant design, and to achieve supply-side and demand- 
side efficiencies (Utterback 1994). As a result, as the num- 
ber of firms in the value net increases, the greater is the rev- 
enue generation in the value net, providing the financial and 
Institutional pressure to establish a dominant design quickly 
(Wade 1995). 

However, the more firms that are in the value net, the 
greater is the agency costs (e.g., transaction and coordina- 
tion costs) for various firms in the value net, which could 
potentially delay a dominant design's emergence (Bergen, 
Dutta, and Walker 1992). Although we acknowledge this 
latter, opposing effect, we expect that the incentives for rent 
generation and sharing dominate the potential higher trans- 
action costs. 


Hz: The greater the number of firms that are ın the value net 
of the product category, the shorter 1s the time to the 
emergence of a dominant design. 


Note that according to our hypothesis, we expect the events 
of emergence and time of emergence to be conditionally 
independent, given the value net of the product category. 


Standards-Setting Process 


A review of the standards-setting process suggests that this 
process can be broadly classified into two types: de jure and 
de facto standards setting (Besen and Farrell 1994; David 
and Greenstein 1990; Farrell and Saloner 1988, 1992). In a 
de jure standards-setting process, standards are established 
through formal processes by standards organizations, which 
may include independent bodies, such as the International 
Telecommunications Union and the Institute of Electric and 
Electronic Engineers, or government bodies, such as the 
National Institute of Standards and Technology. In a de 
facto standards-setting process, standards for the product 
category are set by market forces. The operating system for 
PCs 1s an example of a product category that initially had 
several de facto standards (e.g., CP/M, MS-DOS). Because 
standards-setting processes, regardless of whether they are 
de jure or de facto, involve considerable uncertainty (e.g., 
compatibility, connectivity) among several firms with fre- 
quently competing interests, we do not expect the 
standards-setting process to affect the probability of the 
emergence of a dominant design, but we expect it to affect 
the time to the emergence of a dominant design (if it 
emerges). 

De jure standards-setting processes, which typically 
include manufacturers, suppliers, and complementors, 
exclude end users and tend to support current or known 
technologies over emergent or new ones (Sirbu and 
Zwimpfer 1985). Because there are many member partici- 
pants in the standards organization with potentially conflict- 
ing interests, it is more difficult to achieve consensus, 
delaying the emergence of a dominant design (David and 
Greenstein 1990). The fax machine standard was set 
through a de jure standards-setting process by several mem- 
bers of the Consultative Committee for International Tele- 
graph and Telephone located in several different countries, 
and it required 23 years for the emergence of a dominant 
design. 

In contrast, de facto standards-setting processes are 
characterized by aggressive market competition without 
intervention by standards bodies. Possible unilateral control 
of de facto standards by one firm, whose design may likely 
be the future dominant design, could also alleviate con- 
sumer uncertainty about a war of attrition between stan- 
dards and shorten the time to the emergence of a dominant 
design (Shapiro and Varian 1999). Consistent with these 
arguments, Farrell and Saloner (1988) find that de facto 
standards-setting processes are faster than de jure 
processes. 


H4,: Compared with a de jure standards-setting process, a de 
facto standards-setting process in a product category 
results in a shorter time to the emergence of a dominant 
design. 


Hadicainess 


Radical products (e.g., instant photography, microwave 
ovens) involve new technologies that offer significant 
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advances in both technology and consumer benefits 
(Chandy and Tellis 2000). Radical products, which offer a 
low initial performance-to-price ratio, face limited market 
acceptance (Christensen 1997; Utterback 1994). The evolu- 
tion of radical products is characterized by high uncertainty, 
and numerous product variants are developed to identify 
viable technological trajectories and consumer preferences 
(Sorenson 2000). Not surprisingly, a radical product is 
refined by several firms in sequential iterations into a 
“market-ready” form (Basalla 1988). The limited market 
acceptance for rudimentary versions of the radical product 
may dampen selection pressures for a dominant design’s 
emergence, thus making it less likely to emerge (Tushman 
and Murmann 1998). For example, the microwave oven, 
which Raytheon developed in 1946 and Tappan commer- 
cialized in 1955, was a radical product that faced poor ini- 
tial market acceptance. Independent development efforts by 
several U.S. and Japanese firms over 20 years were neces- 
sary before the introduction of market-worthy designs. 

The evolution of a radical product occurs over several 
years and involves the sequential development of several 
technology platforms (Rosenberg 1994). The development 
of variants, each with different marketplace support, leads 
to local, idiosyncratic market niches with independent prod- 
uct development efforts, thus delaying the emergence of a 
dominant design. Thus, we expect a negative effect of radi- 
calness on the probability of the emergence of a dominant 
design and a positive effect on its time of emergence. 


Hse. The greater the radicalness of the product category, the 
less probable is the emergence of a dominant design. 

Hs: The greater the radicalness of the product category, the 
longer is the time to the emergence of a dominant design. 


R&D Intensity 


Product categories differ in their R&D intensity, that is, the 
depth and breadth of knowledge required to design and 
commercialize a product (John, Weiss, and Dutta 1999). In 
R&D-intensive product categories, widespread integration 
of interdisciplinary technologies across diverse firms may 
be necessary to achieve the technology fusion that is critical 
for product development (Iansiti and West 1997). Thus, a 
dominant design can provide a clear pathway in what may 
otherwise be a noisy and confusing technological 
environment. 

Product categories that are R&D intensive are character- 
ized by high levels of knowledge spillovers, not only across 
firms in the category but also across related categories (Tsai 
2005). Knowledge spillovers in such product categories 
spur advances in multiple market niches, increasing product 
diversity, which in turn creates the variation that is crucial 
for the emergence of a dominant design (Wilson, Weiss, and 
John 1990). Moreover, product categories with higher R&D 
intensity also experience frequent technological changes, 
resulting in rivalries between alternative designs (Adam, 
Trajtenberg, and Fogarty 2000; Lotz 1998), which increases 
the selection pressures for the emergence of a dominant 
design. However, given the noisy and confusing technology 
environment in R&D-intensive product categories that per- 
sist even as the dominant design emerges, we do not expect 
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R&D intensity to affect the time of emergence of the domi- 
nant design. 


Hg: The greater the R&D intensity of the product category, 
the more likely 1s the emergence of a dominant design. 


Table 2 summarizes the hypotheses for the probability and 
the time of the emergence of a dominant design. 


Method 


Split-Population Hazard Model 


To allow for the possibility that a dominant design may 
never emerge, we use a split-population hazard model, 
which segments the population into two subpopulations, 
one that will experience the event (the emergence of a 
dominant design) and one that will never experience it. The 
model simultaneously estimates coefficients for both the 
probability that the event will ever occur and the event's 
time, conditional on its occurrence. The split-population 
approach has been typically applied to problems of criminal 
recidivism (e.g., Schmidt and Witte 1989). In marketing, 
Sinha and Chandrashekaran (1992) use a split-population 
model of the determinants of banks' technology adoption, 
allowing some banks to be eventual nonadopters.? 

For each of the N product categories i, we observe the 
duration t, for products for which a dominant design 
occurred in the interval [0, t] and the vector X, of covari- 
ates. Let C, be an indicator that denotes whether a dominant 
design occurred in (0, T) for the ith product category; C, = 1 
if the observation is complete, and C, = O if it is censored. 
The data are characterized by Type I censoring (right cen- 
soring) so that each product category has a fixed censoring 
time t > 0; t, is observed if t, < t; otherwise, t, > t. Thus, 
P(C, = 1, t = t) = f(t) for the completed observations, and 
P(C, = 0) = P(t, > t[X,) = 1 — F(t) = S(t) for the censored 
observations. The log-likelihood function is 


N à 
a) LL = V (C, inff(t,)] + (L - CaSO) 


-— 


1=l[ 


Let A= {... A, ...} be an unobserved vector, where A, = 
1 if the ith product category witnesses the emergence of a 
dominant design and A, = 0 if it will never witness a domi- 
nant design. We specify the probability of a dominant 
design’s emergence as a function of the independent vari- 
ables X,: 6(X,) = P(A, = 1) = 1/[1 + exp(aX,)], and P(A, = 
0) = 1 — 6(X,). For product categories in which the domi- 
nant design emerges (C, = 1), we also observe the time of its 
emergence, whereas when a dominant design does not 
emerge (C, — 0), either a dominant design will never occur 
(i.e., A, = 0) or a dominant design will eventually emerge, 
but beyond the observation period, in which case t » t. 
Thus, if a dominant design emerges during the observation 


period, 


2For details on the statistics for the split-population model, see 
Schmidt and Witte (1989); for an application in a marketing con- 
text, see Sinha and Chandrashekaran (1992). 


TABLE 2 


Summary of Hypotheses and Results 


Probability of Emergence of 


Time of Emergence of 
a Dominant Design 





Varlable a Dominant Design 

Appropriability H,,: The weaker the appropriability of the 
product category, the more probable Is the 

emergence of a dominant design. 
(Supported) 

Network effects Həə: The stronger the network effect in the 

product category, the less (more) probable 
is the emergence of a dominant design. 
(Less likely supported) 
Value net — 
Standards-setting process — 


Radicalness of Innovation 


Hase: The greater the radicalness of the 
product category, the less probable is the 
emergence of a dominant design. 


(Supported) 


Ht The weaker the appropriability of the 
product category, the shorter is the time to 
the emergence of a dominant design. 
(Supported) 


Ha The stronger the network effect in the 
product category, the longer (shorter) is 
the time to the emergence of a domlnant 
design. (Not supported) 


Ha: The greater the number of firms in the 
value net of the product category, the 
shorter is the time to the emergence of a 
dominant deslgn. (Supported) 


H4: Compared with a de Jure standards- 
setting process, a de facto standards- 
setting process In a product category 

results in a shorter time to the emergence 
of a dominant design. (Supported) 


Hs: The greater the radicalness of the 
product category, the longer is the time to 
the emergence of a dominant design. 
(Supported) 


R&D Intensity Hege: The greater the R&D intensity of the 
product category, the more probable is the 
emergence of a dominant design. 
(Supported) 
(2) P(C, 21, t= t) = P(A, = Df(tIX,, A, = 1) 


= 6(X, )f(t,1X,, A, = 1), 

and for censored observations, 

(3 PC, = 0) =1-8(X,) + &(X))l - FIX, A, = DI 
= 1 — 6(X,) + 6(X, )S(7). 


The log-likelihood function in Equation 1, when expressed 
as a function of the covariates X,, 1s 


N 
(4) LL = V (C, [bX pft IX, A, = 0] 


i=] 


+ (1 — Ci)In[1 — &(X,) + êX, S). 


To estimate the effects of the covariates on the probabil- 
ity and the time of the emergence of a dominant design, we 
used the accelerated failure time specification, exploring 
alternative distribution functions f(-) for the base hazard. We 
implement the model within LIMDEP 8.0 (Greene 2002). 


Data 


We identified two product categories, office products and 
consumer durables, which have been extensively studied in 
prior research on market evolution (e.g., Agarwal and 


Bayus 2002; Golder and Tellis 1997). We used the historical 
method to collect data on the products (Golder 2000). We 
obtained information on the product categories from articles 
published in scholarly journals, books, and online business 
databases. We were able to collect complete information for 
63 product categories (see Appendix A). All the data are 
from public sources. 


Measures 


Emergence and time of dominant designs. As we dis- 
cussed previously, before the emergence of a dominant 
design, there are several technological changes in the prod- 
uct category. However, after a dominant design emerges, 
competition moves from competition between alternative 
product designs to competition within the dominant design. 
Guided by the distinctive characteristics surrounding the 
emergence of a dominant design, strategy researchers 
(Christensen, Suaréz, and Utterback 1998) have retrospec- 
tively identified a dominant design on the basis of when one 
design achieves market acceptance over other competing 
designs. We use this method to identify the emergence of 
dominant designs. 

Three graduate students used archival sources (e.g., 
Consumer Reports, trade magazines, computing magazines) 
to develop histories of the evolution of product categories 
(for brief histories of two such categories, see Appendix B). 
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We then used two other graduate students to identify 
whether and when a dominant design emerged. 

Following the convention for survival models (Lawless 
2003), we used two variables to code the emergence of a 
dominant design. The first denotes whether a dominant 
design emerged in the product category. If a dominant 
design emerged, the variable was coded as 1, but if a single 
design specification was not favored by the market (e.g., 
video game consoles, camcorders), the variable was coded 
as 0. We observed dominant designs in 30 product cate- 
gories. For the 33 product categories in which a dominant 
design had not yet emerged in 2003, the data were right 
censored. We define the second variable as the number of 
years between the introduction of the product category and 
the emergence of a dominant design. There were discrepan- 
cies between the two raters for 3 of the 30 product cate- 
gories for the time measure, which we resolved using inputs 
from industry experts. 

Given its centrality in our study, we used two checks of 
the measure of a dominant design. First, we used 11 indus- 
try experts to verify the year of the dominant design's emer- 
gence obtained from archival sources. The experts included 
five editors of trade journals (e.g., Disk Drive Report, Com- 
puter Reseller News, Future Image Report), two representa- 
tives of trade organizations (Consumer Electronics Associa- 
tion, DVD Forum), and four executives (e.g., Gartner 
Group, IDC). In most cases (26 of the 30 product categories 
in which a dominant design emerged), there was unanimous 
agreement between student raters and industry experts. In 
four product categories (fax machine, floppy drive, main- 
frame computer, and microprocessor chip), the dates for the 
dominant design's emergence differed by, at most, two 
years, which we reconciled by consulting with the industry 
experts. 

Second, following the work of Anderson and Tushman 
(1990), we validated the measure of a dominant design with 
an objective measure of its market share performance. We 
obtained archival measures of market share for the domi- 
nant design for 24 of the 30 products in which a dominant 
design emerged. Using Anderson and Tushman's method, 
we coded the year in which the dominant design's market 
share exceeded 50% for four consecutive years. For these 
24 products, the dominant design's time measure correlates 
highly (p = .86) with the time to the 50% market share 
threshold, reassuring us of the reliability of our measure. 

Because there are no objective measures for appropri- 
ability, network effects, value net, and radicalness, we used 
raters to develop measures for these product characteristics. 
When feasible, we validated these measures with data from 
secondary sources. 

Appropriability. Levin and colleagues (1987) conceptu- 
alize a product's appropriability as protection of R&D 
advantages for firms in the product category, covering the 
following six aspects: (1) patents (or copyrights) to prevent 
duplication, (2) patents (or copyrights) to secure royalties, 
(3) secrecy of product development effort, (4) lead time for 
product development, (5) speed in moving down the learn- 
ing curve in product development, and (6) sales and service 
support. Because the sixth aspect is not related to innova- 
tion, we do not include it in our measure of appropriability. 
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We used two groups of expert raters to measure appro- 
priability: academic experts (six professors from three busi- 
ness schools who are recognized experts on organizational 
innovation) and industry experts (two sets). Following the 
work of Levin and colleagues (1987), we asked the experts 
to rate (on a seven-point scale) the products on appropri- 
ability. We averaged the ratings separately for the two 
groups of raters. The higher a product category's rating on 
the scale, the weaker is its appropriability. 


Network effects. Following the work of Srinivasan, 
Lilien, and Rangaswamy (2004), we used raters to measure 
a product's network effects as a continuous, two- 
dimensional variable, representing both direct and indirect 
effects. Again, we used two groups of raters (different from 
the raters used for the other measures): academic experts 
(four professors at two business schools who are recognized 
experts on organizational innovation) and MBA students 
(25 students with an average of three years' work experi- 
ence). We asked the raters to rate direct and indirect net- 
work effects separately for each product category on a 
seven-point scale, ranging from 1 (“no effects") to 7 (“very 
strong effects"). We computed network effects for each 
product from each rater by adding the scores for direct and 
indirect network effects and averaging separately for each 
group of raters. 


Radicalness. We used Chandy and Tellis's (2000) radi- 
calness measure, which has two dimensions: whether a new 
product incorporates a substantially different core technol- 
ogy (technology radicalness) and whether a new product 
provides substantially higher customer benefits relative to 
existing products (benefits radicalness). Both dimensions 
were assessed on nine-point scales. Six academics, all 
experts in organizational innovation, and ten graduate engi- 
neering students provided the radicalness ratings. We pro- 
vided each rater with a brief history of each product cate- 
gory. We obtained the measure of radicalness for each 
product by adding each rater's ratings for technology and 
benefits radicalness and averaging the ratings separately for 
the two groups of raters. 


Number of firms in the value net. Three graduate stu- 
dents, who prepared detailed case histories of the 63 prod- 
ucts, also gathered details on the value net of the product 
category (Amit and Zott 2001). Two other graduate students 
(different from those who identified the dominant design) 
counted the number of firms in the value net. There was 
complete agreement between the two raters for 57 products 
(90% of the cases). The counts for the remaining 6 products 
were resolved by a discussion between the raters and one of 
the authors. 


Standards-setting process. The two graduate student 
raters (those who counted the number of firms in the value 
net) also independently identified whether the standards- 
setting process was de facto (coded as 1) or de jure (coded 
as 0). There was unanimous agreement between the raters 
on the standards-setting process, which was reconfirmed 
with three academic experts. 


R&D intensity. Consistent with prior research (Agarwal 
and Bayus 2002), we measured R&D intensity by the aver- 


age R&D expenditure as a percentage of sales for each 
product category at the three-digit Standard Industrial Clas- 
sification level.3 


Recession at the time of product introduction. We mea- 
sured recessionary environment using a categorical variable 
based on whether there was a recession in the U.S. econ- 
omy in the year of first product introduction. We defined 
recession as two consecutive quarters of decline in gross 
domestic product. 


Ratings from multiple raters. For the three explanatory 
variables of appropriability, network effects, and radical- 
ness, multiple raters rated all the products. There are two 
ways that ratings from multiple raters may bias the esti- 
mates of the model. First, there is the potential for differ- 
ences in the reliability of ratings across the raters (Van 
Bruggen, Lilien, and Kacker 2002). To account for this bias, 
we computed a weighted mean of the raters’ ratings; the 
weight assigned to a rater was the reciprocal of the rater’s 
absolute distance from the unweighted mean rating com- 
pared with other raters, as Van Bruggen, Lilien, and Kacker 
(2002) propose. Second, there may be a measurement error 
bias because the rating instrument may have induced each 
rater to provide a rating that differs from its “true” value. 

We report results using the rater-bias adjusted ratings 
from academic experts, which accounts for differences in 
reliability of the ratings from multiple raters. As we report 
subsequently, we test for robustness of these estimates 
using ratings obtained from the other raters. In addition, we 
also implement a simulation procedure that examines the 
robustness of the parameter estimates to potential measure- 
ment errors. 


3We obtained data for the R&D intensity from the Web 
(National Science Foundation, Division of Science Resources 
Studies 1999). 


Results 


Descriptive Statistics 


In 33 (52%) of the 63 product categories, no dominant 
design emerged by 2003. The average time to the dominant 
design’s emergence for the 30 product categories in which a 
dominant design emerged was 6.50 years. Table 3 contains 
the descriptive statistics and correlation matrix of the vari- 
ables. Correlations are within acceptable limits (highest = 
.56, between network effects and the size of the value net), 
and the variance inflation factors were lower than ten (aver- 
age = 1.48; maximum = 1.94), suggesting that multi- 
collinearity is not a threat to the validity of the findings 
(Belsley, Kuh, and Welsch 1980). 


Model Selection and Results 


We estimated the model in Equation 5 using three distribu- 
tion functions (lognormal, log-logistic, and Weibull) that 
accommodate a changing hazard rate. Although the general 
pattern of results is similar across the models, the model 
estimated with the log-logistic function fits best. We report 
the results for that model in Table 4 (Column 1). The 
parameters of the log-logistic function (A = .23; © = .31) 
suggest that the hazard function is mound shaped, increas- 
ing for the first six years and then decreasing. Although 
dominant designs emerged in only 30 of the 63 product cat- 
egories (48%) during the period of observation, the hazard 
model suggests that, on average, there is a 60% chance that 
a dominant design would eventually emerge in the office 
products and consumer durables categories (see last row of 
Table 4). 

We first discuss the effects of the control variables. The 
year of product introduction has no effect on the probability 
of emergence (b = 5.63, not significant [n.s.]) or on the time 
of emergence of a dominant design (b = 4.30, n.s.), whereas 
the presence of a recessionary environment (b = .70, p < 
.01) delays the emergence of a dominant design. In general, 
the results support the hypotheses. With respect to the prob- 


TABLE 3 
Descriptive Statistics and Correlation Matrix of Key Variables (n = 63) 





Range of 

Varlable Variable M (SD) 
1. Time to the 

emergence of a 

dominant design 

(n = 30) 2-20 6.50 (4.96) 
2. Appropriabiltty 1-7 3.20 (1.94) 
3. Network effects 2-14 8.52 (3.12) 
4. Number of firms 

in the value net 1-5 2.21 (1.10) 
5. Standards-sething 0—1 54—de facto 

process 85% 
6. Radicalness 2-18 11.89 (2.11) 
7. R&D Intensity 1.67—9.00 7.40 (2.76) 
*p «.10 
**p « .05 
p< .01 


2 3 4 5 6 7 
—Aa 
1.00 
—.46** 1.00 
—.18 56*** 1.00 
—.04 —.24* —. i7 1.00 
11 —.04 —.09 —.11 1.00 
—.03 —.17 —.05 .50*** 18 1.00 


aCorrelations with the left-censored duration variable are not meaningful, and therefore we do not report them. 
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ability of the emergence of a dominant design in a product 
category, as we hypothesized in H,,, we find that appropri- 
ability has a positive effect (b = 1.51, p < .01), suggesting 
that the weaker the appropriability, the more likely is the 
emergence of a dominant design. As we hypothesized in 
Hee, the greater the R&D intensity, the more likely is the 
emergence of a dominant design, though we obtain only 
marginal significance for this result (b = .81, p < .10). Con- 
versely, as we hypothesized in H», and Hse, respectively, 
network effects (b = —.60, p < .05) and radicalness (b = 
—1.47, p « .05) negatively affect the probability of emer- 
gence of a dominant design. 

With respect to the time of emergence, as we hypothe- 
sized in Hip H4, and H4, respectively, weak appropriability 
(b = —21, p < .05), the number of firms in the value net of 
the product (b = —35, p < .01), and a de facto standards- 
setting process (b = —.88, p < .01) hasten the emergence of a 
dominant design. As we hypothesized in Hs, radicalness 
(b = .28, p < .01) delays the emergence of a dominant 
design. We find no support for Hy, for the effect of network 
effects on the time of the emergence of a dominant design 
(b = .08, n.s.). 

Next, we performed two model comparisons to examine 
the explanatory power of the split-population hazard model. 
First, we compared the proposed split-population hazard 
model with all explanatory variables for both the probabil- 
ity and the time of emergence with a baseline model that 
excluded the explanatory variables. À likelihood ratio test 
of the difference between the model with no explanatory 
variables and the proposed model (y? = 51.92, d.f. = 12, p< 
.01) indicated that the proposed split hazard log-logistic 
model outperformed this baseline model. Second, we com- 
pared the proposed model with a single-population model 
(i.e., incorporating only time; see Column 2 of Table 4). 
The proposed complete model outperformed the single- 
population model (x? = 49.14, d.f. = 5, p < .01). In addition, 
the parameter estimates change sign, and only appropriabil- 
ity has a significant, though opposite, effect in the single- 
population model. Such changes in sign may be expected 
when a single-population model is a misspecification of an 
underlying split population (Sinha and Chandrashekaran 
1992). 


Robustness Tests 


Model specification. We performed additional analyses 
to examine the robustness of the model specification. First, 
we examined the effect of stepwise addition of variables to 
our models (though we do not report the results here). For 
both the probability and the time submodels, the coeffi- 
cients are consistent in terms of their signs in the stepwise 
models with those we report in Table 4, reconfirming that 
multicollinearity is not a threat to the validity of our find- 
ings. Second, we tested for the effects of two additional 
control variables (both dummy coded): whether the candi- 
date designs in the product category had a dominant firm 
sponsor, which we measured by sales revenue, at the time 
of product introduction and whether the candidate designs 
in the product category were backward compatible with a 
previous generation product, as judged by two graduate stu- 
dent raters. The analysis indicated that these two variables 


affected neither the probability nor the time of the emer- 
gence of a dominant design, supporting the robustness of 
the model specification. 


Censoring date. As we discussed previously, the censor- 
ing mechanism here is Type I, for which the maximum like- 
lihood estimation of the hazard function provides consistent 
estimates (see Lawless 2003). To explore whether the 
results are sensitive to sampling variations due to the use of 
2003 as the censoring date, we reestimated the model with 
alternative censoring dates: 2002, 2001, 2000, 1999, and 
1998. We obtained consistent results, though, as may be 
expected, the significance levels of the parameter estimates 
varied across the different censoring dates. We report the 
results for the censoring dates of 2002 and 1998 in Columns 
3 and 4 of Table 4. 


Sampling variations. Although the sample size com- 
pares well with prior research, it is small; therefore, we per- 
formed a bootstrap analysis to examine the sensitivity of the 
results to sampling variations. The bootstrapping analysis 
(Column 5 of Table 4) indicated that the results were robust 
to sampling variations. 


Use of multiple raters. To cross-validate the estimates 
obtained from the ratings of appropriability, network 
Effects, and radicalness provided by the academic raters, we 
reestimated the model using the rater-bias adjusted ratings 
obtained from students and industry experts. These esti- 
mates were consistent with those we report in Table 4. 

Effect of measurement errors. To assess the effects of 
measurement errors in ratings, we conducted a simulation 
(with 30 randomly generated samples) by assuming that the 
average ratings provided by the raters across different prod- 
ucts are normally distributed in the population with mean 
and variance equal to the overall mean and variance (across 
all raters), respectively (i.e., X ~ N[uy, Oy]), for each vari- 
able measured with error.* For the simulations, we set uyg 
equal to its observed mean in the sample, and oy = s (k — 
1), where så is the observed variance in the sample and k is 
the number of raters. For each variable and for each rater, 
we drew a random number from N(0, Oy) and added that 
value to the rating of each product provided by that rater. 
We then computed the combined ratings across the raters as 
Van Bruggen, Lilien, and Kacker (2002) propose. We reesti- 
mated the split hazard model, once for each of the 30 ran- 
dom data sets generated, to obtain the mean and standard 
deviation of the estimates across the 30 simulations. The 
reestimated models provided results that were consistent 
with those we report in Table 4 (further details of the simu- 
lation procedure are available on request). 


Direct and indirect network effects. We also separately 
examined the impact of direct and indirect network effects 
on the probability and time of emergence of a dominant 
design (though we do not report it here). The hypothesized 


4We pursued this approach because there 1s currently no avail- 
able theory for the assessment of measurement error biases in 
continuous-time accelerated failure time models with multiple 
constructs, multiple populations, and right censoring (Augustin 
2005) 
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model (Bayesian information criterion value = 143.49) out- 
performed the disaggregated network effects model 
(Bayesian information criterion value = 151.83). In the dis- 
aggregated model, direct network effects have a weak effect 
(p < .01) on the probability and time of emergence of a 
dominant design, whereas indirect network effects have no 
effect on either. We conjecture that our inability to account 
separately for the direct and indirect network effects on the 
emergence of dominant design may be because both direct 
and indirect network effects coexist (p = .25, p « .01) and 
jointly determine the dynamics of dominant design 
emergence. 


Predictive Validity 


Emergence of dominant design. Because the data are 
right censored at 2003, we may not observe the emergence 
of a dominant design in a product category during the 
observation period, because (1) a dominant design may 
never emerge in that category or (2) a dominant design may 
emerge after 2003. We selected a subset of product cate- 
gories introduced in or before 1984 (n = 42) and used the 
model to predict the probability of emergence of the domi- 
nant design. The cutoff date of 1984 provides 20 years of 
observations after product introduction, or three standard 
deviations beyond the average time for dominant design 
emergence that we observed in our data. Thus, we can be 
reasonably certain that a dominant design is unlikely to 
emerge in those product categories after 2003. 

The split-population log-logistic hazard model predicts 
(1) the emergence of a dominant design for 14 categories, 
12 of which had witnessed the emergence of a dominant 
design by 2003, and (2) no emergence of a dominant design 
for 28 categories, 18 of which had not witnessed the emer- 
gence of a dominant design, resulting in a false-positive rate 
of 1496 and a false-negative rate of 3696. By means of the 
criterion that Morrison (1969) advocates for evaluating the 
predictive performance of discriminant analysis models, the 
overall proportion of correct classification is .71, which 
compares favorably with the proportional chance Coro of 
0; Coro = [o]? + [1 — o]?, where [a] is the proportion of 
categories for which a dominant design has actually 
emerged and [1 — œ] is the proportion for which a dominant 
design has not emerged. A chi-square test of misclassifica- 
tion indicates that the proposed split hazard model predicts 
dominant design emergence well (y? = 12.92, d.f. = 1, p < 
.01). We obtained similar results in the predictions of the 
emergence of a dominant design when we used alternative 
cutoff years of 1981 (x2 = 13.17, d.f. = 1, p < .01) and 1987 
(x2 = 10.32, d.f. = 1, p < .01). 


Time of emergence of a dominant design. 'To evaluate 
the model’s ability to predict the time of emergence of a 
dominant design, we applied jackknifing, using data for the 
30 products for which there was a dominant design. We 
held out one target product, reestimated the model on the 
other 29 products, and then used the estimated parameters 
to predict the emergence of a dominant design for the 
remaining products. We computed the mean absolute devia- 
tion (MAD), defined as (I/D)27 -= [D — D,,|, where D,a D 


1p? 1p» 
and n denote the actual time to the emergence of a dominant 
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design for product i, the predicted time of the emergence of 
a dominant design for product i, and the number of products 
(n = 30), respectively. 

The MAD using the predicted model is 2.49 years com- 
pared with a MAD without the model of 3.43 years, an 
improvement of 27%. A post hoc examination of absolute 
deviations indicates that elimination of two products (color 
television and audiocassette player) reduces the MAD for 
the predicted model to 2.20 years compared with a MAD 
without the model of 3.40 years, an improvement of 35%. 
Our small sample size precludes the determination of why 
these two products have such high leverage on model per- 
formance. Overall, the two predictive validity tests demon- 
strate that the split-population hazard model predicts both 
the probability and the time of the emergence of a dominant 
design well. 


Discussion 


We investigated the emergence and the time of emergence 
of a dominant design in a product category as a function of 
its product-market characteristics, allowing for the possibil- 
ity that a dominant design may never emerge in some prod- 
uct categories. The model specifies the effects of product- 
market characteristics both on the probability of the 
emergence of a dominant design in a product category and 
on the time of the emergence of a dominant design, condi- 
tional on its emergence. 


Theoretical Contributions 


Our findings contribute to the literature in marketing that 
explores product and market evolution. The findings sug- 
gest that a split-population model that allows for the non- 
emergence of a dominant design in some product categories 
offers a better way to model the factors that influence the 
emergence of dominant designs than a single-population 
hazard model. The split-population model can be extended 
to other product and market evolution processes, such as 
product takeoff, to account for the possibility that some 
products may never take off (e.g., Sony minidisk, quadra- 
phonic sound). 

We find that dominant designs may never emerge in 
some product categories, a result that runs counter to an 
Implicit assumption in the existing literature that a domi- 
nant design will eventually emerge in all categories. A prod- 
uct category’s characteristics differentially influence the 
probability and time of emergence of a dominant design. 
The facilitating role of weak appropriability on both the 
probability and the time of emergence of a dominant design 
suggests that appropriability has opposing effects on firms’ 
quests for securing rents from innovations (Teece 1986). On 
the one hand, weak appropriability results in a loss of rents 
because other firms can imitate the innovator’s design. On 
the other hand, weak appropriability is associated with a 
greater probability of emergence and the earlier emergence 
of a dominant design, increasing rents for all firms in the 
market. The negative effect of network effects on the proba- 
bility of the emergence of a dominant design reinforces 
recent empirical research (e.g., Goldenberg, Libai, and 


Muller 2002; Srinivasan, Lilien, and Rangaswamy 2004) 
that suggests that for office products and consumer 
durables, the excess inertia of network effects appears to 
outweigh its installed base effects. 

The negative effect of the number of firms in the value 
net on the time to the emergence of a dominant design sug- 
gests that if there are more firms appropriating rents in a 
product category, a dominant design is likely to emerge 
sooner. These findings are somewhat counter to other evi- 
dence on the adverse effects of coordination costs of 
interorganizational relationships. For the product categories 
in this study, the added rents from the value net appear to 
outweigh the added transaction costs, thus hastening the 
emergence of a dominant design. 

Likewise, the results indicate that a dominant design is 
likely to emerge sooner in product categories in which the 
standards-setting process is de facto (versus de jure). This is 
consistent with prior theoretical work in economics (e.g., 
Farrell and Saloner 1988) and points to the efficiency of 
market mechanisms in forcing a quicker emergence of a 
dominant design. 

Dominant designs are less likely to occur and, if they 
occur, take longer to emerge for more radical product cate- 
gories. However, the results for R&D intensity suggest that 
dominant designs are more likely to occur in R&D- 
intensive categories; the higher investments in the product 
categories perhaps provide the necessary variation in 
designs, which in turn create pressures for the selection of a 
dominant design. 


Managerial Contributions 


The study’s findings have some clear implications for man- 
agers. First, the results suggest that managers should realize 
that a dominant design may never emerge in many product 
categories, especially those with tight appropriability, 
strong network effects, and high product radicalness. Some 
of these characteristics are strategic decision variables. For 
example, weak appropriability significantly influences both 
the probability and the time of emergence of a dominant 
design. In addition, market-based de facto standards-setting 
processes (versus de jure standards-setting processes) 
appear to be efficient for the timely emergence of dominant 
designs. Managers who wish to increase the probability of 
their product becoming a dominant design and/or speed up 
the emergence of dominant designs might consider strate- 
gies to weaken appropriability and also encourage de facto 
standards-setting processes. For example, IBM and Intel’s 
decision to share know-how for blade servers may be an 
attempt to hasten the emergence of a dominant design using 


a de facto standards-setting process (The Wall Street Jour- 
nal 2004). 

Second, the model can be used to predict the probability 
and tume of emergence of dominant designs. These predic- 
tions can be useful to managers in the ex ante planning of 
marketing strategies. For example, of the products in our 
data set that have not yet had dominant designs emerge, the 
product category with the highest probability is satellite 
radio (p = .86), and the product category with the least 
likely probability is instant messenger (p = .01). It will be 
interesting to observe whether these predictions turn out to 
be accurate. 


Limitations and Further Research 


This study has some limitations that present opportunities 
for further research. We primarily focused on the time to the 
emergence of the initial dominant design in a new product 
category. Some researchers (e.g., Anderson and Tushman 
1990; Tushman and Anderson 1986) have conceptualized 
the emergence of dominant designs as a sequence of tech- 
nological discontinuities that result in several dominant 
designs. Indeed, in some product categories (e.g., operating 
systems for PCs, spreadsheet software), the initial dominant 
design was replaced by a subsequent design. To extend the 
findings from this study, further research could address the 
emergence of multiple, sequential dominant designs, per- 
haps through a repeated-events, split-population hazard 
modeling approach. 

Given our focus on the development of a cross-product 
category model in which objective measures of some vari- 
ables were not available, we used retrospective measures for 
some constructs, obtaining them from knowledgeable 
raters. With improved record keeping, future researchers 
could use measures that are not subject to the possible hind- 
sight biases and measurement errors associated with these 
measures. Alternatively, researchers could consider focus- 
ing on a specific product category (e.g., Shankar and Bayus 
2003), which may enable the use of objective measures 
within the product category. 

The emergence of a dominant design may also be influ- 
enced by the strategic characteristics and actions of firms, 
including their collateral assets (Teece 1986), product port- 
folios, and strategic maneuvering (Cusumano, Mylonadis, 
and Rosenbloom 1992). Further research could extend our 
study by incorporating such firm-level factors. 

We hope that we have contributed in some distinct way 
to the theory base that helps explain the emergence of domi- 
nant designs. We also hope that our approach and modeling 
framework are helpful for managers who need to devise 
strategies that reduce the uncertainty about the dynamics of 
evolution of markets for technology products. 
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Product Categories (n = 63) 


APPENDIX A 





Product Category 


1. 3.5” floppy drive 
2. AM stereo 
3. Answering machine 
4. Antivirus software 
5. Audiocassette player 
6. Automatic teller machine 
7. Cable modem 
8. CAD software 
9. Camcorder 

10. Camera phone 

11. CD player 


12 


49. 
. Portable file document software 
. Presentation graphic software 

. Projection television 


53 


54. 
55. 


SUSHESSZRSSSISREBR 


. CD-ROM drive 


Cellular phone 
Color television 


13 
14 
15. Cordless telephone 

16. Database software 

17. Desktop publishing software 
18. Dial-up modem 56 K 

19. Digrtal camera 

20. Dot matrix printer 

21. DSL modem 

22. 
23 
24 
25 
26 
27 
28 
29 
30 


DVD player 


. Electric toothbrush 

. Fax machine 

. File-zipping software 
. Flash memory 

. Flatbed scanners 

. Food processor 

. Graphics software 

. HDTV 

31. 
. Inkjet printer 


Home microwave oven 


Instant messenger 


. Instant photography 


Internet service providers 


. Internet telephony 


Laser printer 
Mainframe computer 
Microprocessor chip 
Mini audio disc 


. Notebook computer 
. OCR software 
. Operating system for PCs 


Pager 
PC 


. Personal digital assistant 


Personal finance software 
Photocopiers 
Pocket calculator 


Satellite radio 
Single-use camera 
Spreadsheet software 


56. Super audio CDs 
57. Video game console 


58. 


Home video recorders 


59. Web camera 
60. Web server software 
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Year of 
Product 
Introduction 


1979 
1982 
1962 
1982 
1962 
1967 
1995 
1982 
1984 
2000 
1982 
1983 
1979 
1951 
1975 
1981 
1984 
1979 
1991 
1964 
1996 
1996 
1960 
1960 
1986 
1988 
1978 
1972 
1990 
1987 
1955 
1984 
1998 
1948 
1980 
1997 
1984 
1946 
1971 
1992 
1980 
1974 
1977 
1974 
1975 
1993 
1983 
1950 
1971 
1993 
1986 
1973 
1998 
1986 
1979 
1999 
1971 
1975 
1991 
1995 


Time to the 


Emergence of 
a Dominant 
Design 


1984 
1986 


1969 
1998 


Specificatlons of 
a Dominant Design 


Sony's design 
Motorola's C-Quam System ' 


Philips’s design 
DOCSIS specifications 


Philips-Sony's design 
Sony's design 


National Television System Committee 


Server Query Language (SQL) 
Adobe Pagemaker 
56KFlex 


ESC/P from Epson 
G.Lite 
DVD design from Zenith 


all 


Adobe Photoshop 
Standard Definition Televislon 


Polaroid 


IBM 360 
Intel 4004 


MS DOS 


IBM PC 
Intuit 
Xerox 914 


Adobe PDF 
Harvard Presentation Graphics 


Lotus 1-2-3 


JVC VHS 
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Continued 
Time to the 

Year of Emergence of 

Product a Dominant Specifications of 
Product Category Introduction Design a Dominant Design 
61. Word-processing software 1979 1983 Wordstar 
62. Work station 1980 1986 Sun's Unix 
63. Zip drives 1995 1997 lomega 





Appendix B 
Brief Histories of Two Product 
Categories 


Camcorders (Dominant Design Did Not Emerge) 


In January 1982, several leading consumer electronics firms 
announced a universal standard for camcorders, the 8 mm 
video format. In March 1983, 127 video-related companies 
agreed on standards to ensure compatibility of all 8 mm 
camcorders. Ten months later, Eastman Kodak Company 
introduced the world's first 8 mm camcorder in January 
1984, followed by Sony in the fall of 1984. 

In January 1985, several Japanese firms, including 
Canon, Fuji, Pioneer Electronic, Kyocera, Aiwa, Sanyo, and 
Ricoh, licensed Sony's 8 mm camcorder technology. In 
February 1985, JVC launched a rival VHS camcorder for- 
mat, compatible with its VHS home video recorders but 
incompatible with Sony's 8 mm camcorders. JVC's VHS-C 
format received a big boost when several firms, including 
Matsushita, Toshiba, Hitachi, Mitsubishi, Philips, and 
Sharp, licensed the VHS-C technology from JVC. 

In the period between 1985 and 1990, both the 8 mm 
and the VHS-C formats enjoyed fluctuating popularity in 
the market, and neither format gained market dominance. In 
addition, several other camcorder formats, including super 
VHS, super VHS-C, and digital 8 mm, were introduced in 
the early 1990s, each of which acquired niche market posi- 
tions. Given this sequence of events, a dominant design did 
not emerge in the camcorder market at that time or since 
then, despite several firms having agreed on a standard two 
years before the commercial introduction of camcorders. 


DVD Players (Dominant Design Emerged) 


In an attempt to avoid the VHS-Betamax “format war" of 
the 1970s, in the mid-1990s, consumer electronics manu- 
facturers, led by Sony, Toshiba, and Panasonic, in conjunc- 
tion with movie studios, led by Warner and Columbia (a 
division of Sony), worked together in the DVD consortium. 
The objective of the DVD consortium was to establish the 
DVD standard as an open format. In addition, the DVDs 
were encoded with the Dolby Digital sound process to be 
compatible with virtually all home theater electronics. 

However, the DVD format for DVD players was not 
unchallenged. In September 1997, Circuit City introduced a 
competing format called the Digital Video Express (DIVX), 
which was partially compatible with DVD. DIVX players 
could play all DVDs, but DVD players could not play 
DIVX discs, which were priced on par with DVD discs. 
Early adopters did not know it, but at the time of the DIVX 
announcement, Circuit City had neither the hardware nor 
the software for product demonstration. 

When Circuit City finally launched DIVX in the fall of 
1998, it faced an uphill battle, and studio support for DIVX 
never materialized. By May 1999, approximately two mil- 
lion DVD players had been shipped to retailers, compared 
with approximately 165,000 DIVX players. At the same 
time, there were 3317 DVD movie titles available on the 
DVD format, whereas only 471 titles were available on the 
DIVX format. In June 1999, Circuit City withdrew DIVX 
from the market, and the DVD format emerged as the domi- 
nant design. 
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Franziska Vólckner & Henrik Sattler 


Drivers of Brand Extension Success 


The research presented in this article addresses the issue of the significance and relative importance of the deter- 
minants of extension success by simultaneously investigating ten success factors. The empirical analysis consid- 
ers the direct relationships between success factors and extension success, the structural relationships among 
investigated factors, and moderating effects. The authors find that fit between the parent brand and an extension 
product is the most important driver of brand extension success, followed by marketing support, parent-brand con- 
viction, retailer acceptance, and parent-brand experience. The authors also find several important structural rela- 
tionships among the investigated success factors (e.g., marketing support — fit — retailer acceptance — exten- 
sion success). Finally, the interaction terms of fit with the quality of the parent brand and with parent-brand 
conviction are statistically significant, albeit of relatively low importance. 


rand extensions—that is, use of established brand However, there are at least two key issues that have 
names to launch new products (e.g., use of the name received little or no attention in prior work: First, little is 
Mr. Clean for a new detergent)—represent one of known about the relative importance of the success factors 


the most frequently used branding strategies. Extending in explaining brand extension success because each previ- 
brands both within and beyond the original product cate- ous study investigated the effects of only a small fraction of 


gory is deemed to be profitable because, in general, it is all relevant success factors at one time (usually two to four 
assumed that brands that are already known and recognized factors). Second, previous studies tested only the direct 
require lower new product introduction expenses, such as relationship between brand extension success (dependent 


advertising, trade deals, or price promotions (Collins-Dodd variable) and potential success factors (independent vari- 
and Louviere 1999; Tauber 1988). Nevertheless, the success ables). They did not take into account that some success 
of brand extensions is uncertain. Failure rates of brand factors may constitute dependent variables in other struc- 
extensions in many fast-moving consumer good (FMCG) tural relationships; that is, previous studies did not examine 
product categories are approximately 8096 (Ernst & Young a series of structural relationships. The failure to account 
and ACNielsen 1999; Marketing 2003). Therefore, potential for potential relationships among success factors may cause 


determinants of brand extension success (i.e., success fac- faulty interpretation of the significance and relative impor- 
tors of brand extensions) have emerged as an important tance of the success factors under investigation. 

focus of research inquiry to provide insights that may help Against this background, this article makes two primary 
managers reduce the failure rates of brand extensions (e.g., contributions to brand extension research by presenting a 


Aaker and Keller 1990; Bottomley and Doyle 1996; Dacin large-scale empirical study: First, we address the issue of 
and Smith 1994; Swaminathan, Fox, and Reddy 2001). The the significance and relative importance of the determinants 


findings of previous studies provide important insights into of extension success by simultaneously investigating ten 
the factors that influence brand extension success. In success factors. Second, we apply a structural equation 
particular, they reveal the kinds of success factors that play analysis to test several conceptual models of the determi- 
an important role in the success of an extension product, at nants of extension success. Our analysis considers the direct 
least under certain conditions. For example, despite some relationships between success factors and extension suc- 
mixed results, prior studies show that the quality of the par- cess, the structural relationships among investigated factors, 
ent brand and the fit between the parent brand and extension and moderating effects. Furthermore, an advantage of struc- 
product categories are highly important brand extension tural equation analysis is the ability to account for measure- 
success factors (Bottomley and Holden 2001). Overall, ment errors. The contributions of our study are especially 
approximately 15 determinants of extension success have relevant for marketing practice. To improve brand extension 
been proved significantly relevant (p « .10) in at least one success, it is imperative for managers to know which of the 
empirical study (for a more detailed discussion, see the sub- large number of potentially relevant success factors should 
section "Determinants of Brand Extension Success" in the receive the most attention and how they should allocate 
"Conceptual Framework" section). resources to the relevant factors. 


We consider brand extensions within a comprehensive 
nomological net by developing a conceptual framework for 
Franziska Vólckner is Assistant Professor of Marketing and Management our empirical research that builds on the results of prior 
(e-mail: voelckne @ econ.uni-hamburg.de), and Henrik Sattler is Professor research and the managerial wisdom derived from expert 
of Marketing and Management (e-mail: uni-hamburg @henriksattler.de), judgments. We specify and empirically test four alternative 
Institute of Marketing and Retalling, University of Hamburg. The authors models. The focus of this article is not on theory develop- 
are grateful to John Roberts and the four anonymous JM reviewers for ment; rather, we pursue an empirical approach to studying 
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findings from published research and beliefs of managers 
into a comprehensive model of brand extension success. 

Having outlined the motivation behind this article, in 
the next section, we develop the conceptual framework for 
our empirical research. Then, we describe the research 
design and discuss some methodological issues. We present 
the results of our study and conclude our article by outlin- 
ing the implications of the findings, limitations of the study, 
and opportunities for further research. 


Conceptual Framework 


Overview of the Conceptual Framework 


Our conceptual framework identifies potentially relevant 
success factors and specifies the system of relationships 
among these factors. Although we refer to the various rela- 
tionships as hypotheses, this article focuses on testing the fit 
of alternative models and does not focus on testing various 
specific hypotheses. We opted to use the term “hypotheses” 
for the relationships in the models because it is the common 
term used in the literature. 


Figure 1 provides an overview of our conceptual frame- 
work. The potentially relevant determinants of brand exten- 
sion success can be assigned to four groups: (1) parent- 
brand characteristics, (2) the extension's marketing context, 
(3) the relationship between the parent brand and the exten- 
sion product, and (4) the extension's product category char- 
acteristics. Figure 1 proposes that the success of a brand 
extension is influenced by the direct effects of determinants, 
mediating effects, and moderating effects. The following 
discussion presents the details of the constituent elements 
of our framework. 


Determinants of Brand Extension Success 


This section presents the results of a systematic literature 
review that resulted in identification of a broad variety of 
determinants of extension success.! The review included the 
following journals between 1985 and 2001 (the year of data 


lOur research does not consider the effect of ingredient brand- 
ing strategies that has been investigated in recent empirical studies 
(e.g., Desai and Keller 2002). 


FIGURE 1 
Overvlew of Conceptual Framework 


' Direct Effects — Model 1 


PB Characteristics 
Quality (strength) of the PB 
eHistory of previous BE 

«PB conviction 

«PB experience 


BE's Marketing Context 
eMarketing support 
eHetailer acceptance 


Relationshlp Between the PB 
and the BE 
eFit 


eLinkage of the utility of the PB to 
product attributes of the original 
product category 


BE's Product Category 
Characteristics 
ePerceived rlsk 
«Consumer innovativeness 





Mediating Effects — Model 2 

Derived from Prior Research: 

eQuallty of the PB — retailer acceptance 
eMarketing support — retailer acceptance 
«History of previous BE — PB conviction 
ePB experience — PB conviction 
eMarketing support — fit 


Derived from Managerial Wisdom: 

«PB conviction — quallty of the PB 

eHistory of previous BE quality of the PB 
eHistory of previous BE — retailer acceptance 
Fit > retailer acceptance . 
ePercelved risk —^ consumer innovatlveness 


Moderating Effects — Model 3 

Derived from Prior Research: 

Quality of the PB x fit” 

«Quality of the PB x history of previous BE 
eHistory of previous BE x fit 


Derived from Managerial Wisdom: 
eFIt x PB conviction 
eMarketing support x retailer acceptance 


Notes: PB = parent brand; BE = brand extension Model 1: direct effects; Model 2. direct and mediating effects; Model 3: direct, mediating, and 


moderating effects. 


Drivers of Brand Extension Success / 19 


r 


collection): International Journal of Research in Marketing, 
Journal of Consumer Research, Journal of Marketing, Jour- 
nal of Marketing Research, Management Science, Market- 
ing Letters, and Marketing Science. In addition, we 
included articles that were frequently cited in these journals 
in the review. We limited our review to empirical studies 
that investigate the effect of certain success factors (e.g., 
quality of the parent brand) on some kind of extension suc- 
cess measure (typically, customer evaluations of brand 
extensions linked, for example, to the perceived quality of 
the extension). Most of the studies were based on consumer 
surveys that used hypothetical brand extensions (i.e., exten- 
sions not introduced to the market). Respondents were 
asked to rate independent (i.e., success factors) and depen- 
dent (1.e., success of the extension; e.g., the perceived qual- 
ity of the extension) variables on rating scales. Aaker and 
Keller’s (1990) fundamental study represents a typical 
example. Few studies used market data as dependent vari- 
ables (e.g., market share or stock market value; Lane and 
Jacobson 1995; Reddy, Holak, and Bhat 1994; Smith and 
Park 1992). 

We identified 15 determinants of extension success that 
had been proved significantly (p < .10) relevant in at least 
one empirical study. We excluded the following three deter- 
minants because they had only minor influence on brand 
extension success in prior studies: (1) difficulty in produc- 
ing a product from the extension’s product class (e.g., Bot- 
tomley and Doyle 1996; Bottomley and Holden 2001; 
Nijssen and Bucklin 1998; Sunde and Brodie 1993), (2) 
consumers’ knowledge of the extension’s product class 
(e.g., Smith and Park 1992), and (3) company size (Reddy, 
Holak, and Bhat 1994). Likewise, we excluded the order 
and direction of previous brand extensions (Dawar and 
Anderson 1994) and consumers’ mood (Barone, Miniard, 
and Romeo 2000) because measuring these two factors is 
extremely complex within the scope of a consumer survey. 
Furthermore, on the basis of the results of a comprehensive 
pretest, we found that only a few consumers could correctly 
state the order and direction of previous brand extensions of 
real brands. This finding suggests that the order and direc- 
tion of previous extensions play only a minor role in con- 
sumers’ evaluations of a new brand extension product. 

In summary, the literature review resulted in ten success 
factors that may influence brand extension success. These 
factors are included in our empirical study (see the second 
column of Table 1). Given that we considered the results of 
45 empirical studies that cover 15 years of extensive 
research effort, it seems reasonable to assume that the ten 
identified success factors cover a substantial subset of all 
potentially relevant success factors. The term “potentially 
relevant” seems adequate because there are many examples 
of mixed results for a particular factor (e.g., Dacin and 
Smith [1994] and Reddy, Holak, and Bhat [1994] find 
mixed results on the number of previous brand extensions). 
In other words, a certain factor that was significant in one 
study was not significant in another study. 

In addition to prior work, interviews with brand man- 
agers and researchers can also be useful in broadening the 
scope of knowledge about brand extension evaluations. 
Therefore, we used expert knowledge as an additional infor- 
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mation source to build further confidence in the relevance 
of identified brand extension success factors. There is an 
important tradition in marketing research of incorporating 
judgmentally derived information into marketing models 
(Leeflang and Wittink 2000), especially managerially esti- 
mated parameter values (e.g., Sandor and Wedel 2001). We 
interviewed 21 brand managers and researchers. We set the 
requirement that all experts needed to be highly knowledge- 
able in brand extension issues and in the German FMCG 
market. All brand managers hold senior management posi- 
tions in major German FMCG companies. The researchers 
stem from marketing departments at several German uni- 
versities. We conducted 21 semistructured interviews 
regarding the relevance of success factors and the system of 
relationships among them. During the interviews, we gave 
participants a description of the ten success factors, which 
we identified in the literature review, and asked them to 
evaluate the factors’ relevance. Almost all experts agreed 
that, in general, the ten success factors are relevant for 
brand extension success. Finally, we asked the experts to 
add success factors that might play a major role for brand 
extension success. Most of them did not name any addi- 
tional factors. Within the group of experts that mentioned 
additional determinants of extension success, only one type 
of factor was mentioned more than once, namely, profitabil- 
ity of the market as one of the characteristics of the exten- 
sion’s market. Although it is highly plausible to assume that 
this factor is important for new product success, we decided 
not to explore it in our study, because our analysis focused 
on consumers’ evaluations of brand extensions. We also 
needed to restrict the number of success factors in our study 
to keep the evaluation task manageable for respondents. 

‘Table 1 (Hj—H;9) presents the success factors that we 
considered in our study and their corresponding hypotheses. 
We abridged the original wording of the hypotheses 
because of limited space. We treat multiple measures of the 
same factor (e.g., fit or history of previous extensions) as 
one determinant of extension success. Table 1 quotes only 
one source per hypothesis. The sources listed provide a the- 
oretical foundation for the postulated hypotheses. There- 
fore, we dismissed a detailed discussion of the ten direct 
effects. However, we provide a detailed explanation of the 
mediating and moderating effects because not all of them 
build on the results of prior research. 


Mediating Effects 


Mediating effects derived from prior research. Prior 
research has focused on the relationship between extension 
success and its potential determinants. On the basis of our 
systematic literature review, we identified four studies (see 
Table 1) that investigated the impact of a certain success 
factor (independent variable) on another success factor 
(dependent variable). Nevertheless, the issue of how struc- 
tural relationships among the success factors affect exten- 
sion success is still not clear, because none of the studies 
considered extension success a component of the estimated 
model. 

To begin with, retailers' utility for brand names is based 
on associations, such as retail performance expectations and 
positioning characteristics. Strong brands have more posi- 


TABLE 1 


Hypotheses Regarding Success Factors of Brand Extensions 


Main Effects: Direct Effects of the Success 


Factors 


Parent-Brand Characteristics 


H; Quality (strength) of the parent brand 
Hə History of previous brand extensions 


H4 Parent-brand conviction 


H4 Parent-brand experience 


Extension’s Marketing Context 
Hs Marketing support 


Hg Retailer acceptance 


Relatlonship of Parent Brand to Extension 


Product 
H7 Fit between parent brand and 
extension product 


Hg Linkage of the utility of the parent 
brand to product attributes of the 
original product category 


Extension's Product Category 
Characteristics 
Hg Perceived risk 
Hio Consumer Innovativeness 


Expected Sign Source 
Mediating Effects: Dependence Relationships Between Success 
Factors 
H,, Quality of the parent brand — retailer acceptance + Collins-Dodd and Louviere (1999) 
H4? Marketing support — retailer acceptance + Collins-Dodd and Louviere (1999) 
H43 History of previous brand extensions — parent-brand 
conviction + DelVecchlo (2000) .— 
H,4 Parent-brand experience > parent-brand conviction + Kirmani, Sood, and Bridges (1999) 
H4s Marketing support fit between parent brand and extension 
product 4 Klink and Smith (2001) 
Moderating Effects 
Ha. Quality of the parent brand x fit + Aaker and Keller (1990) 
Hə Quality of the parent brand x history of previous brand 
extensions + Keller and Aaker (1 992b) 
Hə, History of previous brand extensions x fit + Dacin and Smith (1994) 





A Brand Extension Is More Successful ... 


If the quality of the parent brand is high. 
If the history of previous brand extensions 
is successful. For example, 
eHigh number of previous brand 
extenslons. 
eHigh variability among product types 
offered by the parent brand. 
eLow variance in quality among previous 
brand extensions. 
If parent-brand conviction Is high. 


If parent-brand experience is high. 


If the marketing support is high. For 
example, 
eAdvertising support. 


eFirm’s marketing competence. 


If the retailer acceptance is high. 


If the fit between the parent brand and the 
extension is high. For example, 
eHigh global similarity. 
eHigh ability of the owner of the parent 
brand to make a product in the 
extenslon product class. 
«High relevance of the extended 
associations for the extension product. 
The less the utility of the parent brand is 
linked to product attributes of the original 
product category. 


If the perceived risk ls low. 
If consumer Innovativeness is high. 


Source 


Smith and Park (1992) 


Dacin and Smith (1994) 
Boush and Loken (1991) 


Dacin and Smith (1994) 
Kirmanl, Sood, and Bridges 
(1999) 
Swaminathan, Fox, and 
Reddy (2001) 


Heddy, Holak, and Bhat 
(1994) 
Reddy, Holak, and Bhat 
(1994) 
Nijssen (1999) 


Aaker and Keller (1990) 


Aaker and Keller (1990) 


Broniarczyk and Alba (1994) 
Rangaswamy, Burke, and 
Ollva (1993) 


Nijssen and Bucklin (1998) 
Klink and Smith (2001) 
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tive associations of sales volume, competitive clout, and 
traffic-generating appeal. Retailers also expect that manu- 
facturers are unlikely to place a strong brand name on an 
inferior product and risk diluting its brand equity. There- 
fore, the quality or strength of a parent brand is expected to 
have a positive influence on retailer acceptance of a new 
extension product (H,,; Collins-Dodd and Louviere 1999). 
Furthermore, in general, retailers appreciate manufacturers’ 
consumer advertising because it increases consumers’ prod- 
uct awareness, thereby preselling the product and reducing 
retailers’ selling costs. Therefore, greater marketing support 
in terms of consumer advertising should generate more 
retailer acceptance (H47; Collins-Dodd and Louviere 1999). 
Prior research has also found support for a positive impact 
of the history of previous brand extensions and parent- 
brand experience on parent-brand conviction. It might be 
expected that consumers' exposure to the brand name 
increases as the number of previous brand extensions 
increases. If we assume that experiences are positive, 
greater exposure to the brand name may generate greater 
liking for and trust in the brand name.? Therefore, the his- 
tory of previous extensions might have a positive effect on 
brand conviction (H44; DelVeechio 2000). Related, direct 
experience with the parent brand may increase the personal 
relevance of the brand and generate greater liking for the 
brand. This phenomenon has been labeled the “mere owner- 
ship effect" Direct experience might also cause greater 


2Although negative brand experiences that make consumers dis- 
like the brand are possible, such experiences are likely to be rare; 
otherwise, brands would not survive 1n the marketplace. 


parent-brand knowledge, stronger brand associations, and 
stronger autobiographical memories, all of which might 
result in higher levels of parent-brand conviction (H44; e.g., 
Kirmani, Sood, and Bridges 1999). Finally, research on 
typicality suggests that repeated exposure to an extension 
product may elevate consumers' perceptions of fit. Greater 
exposure to the brand extension helps consumers identify 
more shared attributes between the parent brand and the 
extension. Thus, higher levels of marketing support in terms 
of advertising lead to higher levels of perceived fit (H5; 
e.g., Klink and Smith 2001; Lane 2000). 


Mediating effects derived from managerial wisdom. 
During the interviews with brand managers and researchers, 
we gave them a description of the ten success factors and 
asked them to point out possible structural relationships 
among the factors and to give a reason for each. The experts 
specified a total of 13 structural relationships. However, just 
one or a few respondents pointed out several of these rela- 
tionships. We decided to exclude the relationships that were 
specified by only five or less experts (2596) to avoid any 
influence of personal subjective opinions on model devel- 
opment. As a result, we identified 8 structural relationships 
that were specified by at least six or more participants (see 
Table 2, Panel A). 

In general, the experts provided highly plausible reasons 
for the hypothesized relationships among success factors. A 
total of 13 participants expected a positive effect of parent- 
brand conviction on perceived quality of the parent brand 
(Hs). They reasoned that consumers who like or trust a par- 
ent brand should have more favorable beliefs about brand 
attributes and greater confidence in their beliefs than con- 


TABLE 2 
Judgmentally Derived Mediating and Moderating Effects of the Success Factors 





A: Personal Interviews with Brand Managers and Researchers 


Expected Percentage 
Mediating Effects: Dependence Relationships Between Success Factors Sign of Experts 
H4g Parent-brand conviction — quality of the parent brand + 62 
H,7 History of previous brand extensions — quality of the parent brand + 52 
Hig History of previous brand extensions — retailer acceptance + 38 
Hg Fit between parent brand and extension — retailer acceptance + 29 
Hog Perceived risk of unknown brands — consumer innovativeness — 29 
See Table 1 
H44 Quality of the parent brand — retailer acceptance + 48 
H42 Marketing support — retailer acceptance + 43 
H4& Marketing support — fit between parent brand and extension product + 38 


B: Group Discussion with Brand Managers and Researchers 


Expected 
Moderating Effects Sign 
Ho4 Fit x parent-brand conviction + 
Həs Marketing support x retailer acceptance + 
See Table 1 
Hə; Quality of the parent brand x fit + 
Ho», Quality of the parent brand x history of previous brand extensions + 
+ 


Hə, History of previous brand extensions x fit 
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sumers who dislike the brand. Moreover, strong and favor- 
able brand associations enable the brand to be strategically 
differentiated and positioned in the consumer’s mind. High 
levels of parent-brand conviction reflect favorable predispo- 
sitions toward the brand and may function as a “filter” for 
how a consumer evaluates the specific attributes of the 
brand (including perceived quality), all of which would lead 
to a positive relationship between parent-brand conviction 
and parent-brand quality. Within this context, 11 experts 
also expected a positive influence of the history of previous 
brand extensions on perceived quality of the parent brand 
(H7). It might be expected that consumers’ exposure to the 
brand name and their likelihood to try one of the brand's 
products increase as the number of previous brand exten- 
sions increases. Product trial provides new information 
about a brand. Information generated from product trial typ- 
ically results in increased brand recall and stronger brand 
attitudes, which in turn have a powerful and positive impact 
on parent-brand evaluation, unless product experience is 
negative (Swaminathan, Fox, and Reddy 2001). Moreover, 
several experts agreed that the history of previous brand 
extensions has a positive impact on retailer acceptance of a 
new extension product (Hg). They assumed that a success- 
ful history of previous extensions typically results in favor- 
able beliefs about extension attributes, such as retail perfor- 
mance expectations and positioning characteristics. They 
also assumed that a successful history of brand extensions 
might increase retailers' confidence in and favor of their 
evaluations of subsequent brand extensions. Retailers may 
perceive the parent brand as highly competent in making 
and introducing extension products, which would also result 
in a positive relationship between the history of previous 
extensions and retailer acceptance. Continuing with possi- 
ble effects on retailer acceptance, 6 participants expected a 
positive impact of perceived fit on retailer acceptance (Hg). 
Retailers want to avoid creating poor assortment percep- 
tions among customers. Not listing a new extension prod- 
uct, which is obviously affiliated with the parent brand (in 
terms of perceived fit between the extension product and the 
parent brand), may create the perception of an incomplete 
assortment. Therefore, high levels of fit might enhance the 
perceived importance of listing the new extension product 
to maintain store selection and quality reputations, which 
initiates a positive relationship between fit and retailer 
acceptance. Finally, 6 experts expected that the perceived 
risk of unknown brands would have a negative impact on 
consumer innovativeness (H20). They believed that the 
desire or willingness to try new experiences should 
decrease as the level of perceived risk increases. Diffusion 
theory holds that people respond differently to new prod- 
ucts. To some extent, the response differences reflect differ- 
ences in risk-taking propensity. Later adopters tend to be 
more risk averse than earlier adopters (Rogers 1983), which 
supports the notion that high levels of perceived risk are 
associated with a low desire to try new products. 


Moderating Effects 


Moderating effects derived from prior research. In addi- 
tion to the ten main effects variables (H,;—Hj9, Table 1) and 
the ten indirect effects (H;;—Hoo, Tables 1 and 2), a third 


group of effects, namely, moderating or interaction terms, 
may play an important role in determining extension suc- 
cess. Given the results of prior studies, there are three inter- 
actions between the success factors under investigation that 
are expected to influence extension success (see Table 1, 
H21—-H23). 

The premise for extending an existing brand name is 
that consumers use their beliefs about the parent brand to 
draw conclusions about an extension product. The degree to 
which brand associations are transferred to the extension 
depends on the level of perceived fit between the brand and 
the extension product. Therefore, a positive interaction is 
expected between the fit variable and the quality of the par- 
ent brand (e.g., Aaker and Keller 1990); that is, the positive 
effect of the quality of the parent brand on extension suc- 
cess increases as the level of perceived fit increases (H3;). 
Consumers also tend to be much more confident about their 
judgments of parent-brand quality if they are based on a 
large sample of instances (i.e., products that bear the parent 
brand name) rather than on a comparably small one. The 
“database” from which consumers draw information for 
forming judgments about the brand extension increases 
with the increase in the number of products affiliated with 
the brand (e.g., Dacin and Smith 1994). Consequently, the 
positive effect of higher levels of parent-brand quality on 
extension success increases as the number/success of previ- 
ous brand extensions increases (Hz). Consumers’ skepti- 
cism arising from low fit can also be thwarted if a company 
has historically demonstrated that it provides consistently 
good products across several product categories (e.g., Dacin 
and Smith 1994). Therefore, the negative effect of low lev- 
els of fit on extension evaluations diminishes as the number/ 
success of previous brand extensions increases (H>3). 


Moderating effects derived from managerial wisdom. 
We used expert knowledge to gain further insight into 
potentially relevant interaction terms. We acquired expert 
knowledge by conducting a second expert survey during a 
group discussion. The group consisted of three senior brand 
managers (from major German FMCG companies) and 
three researchers (from marketing departments of German 
universities). We presented the experts with ten success fac- 
tors that we identified in the literature and asked them to 
identify highly relevant interaction effects between these 
factors. The six experts were required to come up with a 
solution during the group discussion that represented a con- 
sensus of all their opinions. This procedure resulted in five 
interaction terms: the aforementioned moderating effects 
(H,);—H3), an interaction between fit and parent-brand con- 
viction (H34), and an interaction between marketing support 
and retailer acceptance (Has; see Table 2, Panel B). The 
experts provided highly plausible reasons for the interaction 
terms. For example, the degree to which brand trust and 
likeability are transferred to an extension product depends 
on the level of perceived fit between the brand and the 
extension, similar to the interaction between fit and the 
quality of the parent brand. Higher levels of perceived fit 
allow for better transfer of brand associations (i.e., affect 
and beliefs). Therefore, a positive interaction between the 
fit variable and parent-brand conviction is expected (H34). 
Likewise, the experts agreed that the positive effects of high 
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levels of consumer advertising on extension success depend 
(to an extent) on the distribution support of the extension 
(H-5). Consumer advertising is expected to increase con- 
sumers' product awareness and utility, thus generating con- 
sumer pull. The degree to which increased awareness and 
product utility translate into purchases of the extension 
product depends (to an extent) on the availability of the 
item on the shelf. Thus, consumer advertising positively 
influences extension success, provided that the product is 
available in retail stores, which brings about a positive 
interaction between the extension’s marketing support and 
retailer acceptance. 


Model Specification 


As we previously mentioned, the primary purpose of this 
study is to determine empirically the mediating and moder- 
ating effects between a broad set of determinants of brand 
extension success and the extent to which these determi- 
nants affect the success of brand extensions. Therefore, we 
integrated the identified direct, indirect, and moderating 
effects into alternative models. First, we estimated a main 
effects-only model (M1). Model M1 contains the direct 
effects of success factors on extension success that have 
been proposed in academic literature (H,;—Hj 9; see Table 1). 
Second, we specified Model M2, which considers the direct 
effects we investigated in M1 and the identified structural 
relationships among the determinants of extension success 
(H,—H 9; see Tables 1 and 2). Third, we introduced moder- 
ator effects in Model M3, which contains the direct and 
indirect effects we investigated in M2 and the identified 
moderator effects (H,—H54). We included the moderator 
variables by using a step-by-step procedure that resulted in 
two significant moderator effects (M3a and M3b). Finally, 
Model M4 explores the moderating effect of consumers' 
familiarity with the extension product by means of a two- 
group analysis. 


Research Design 


We tested the proposed models using a broad variety of real 
extensions and real parent brands in the German FMCG 
industry. We conducted two pretests to identify the parent 
brands and extension products that we finally used in the 
main study. We began with a set of 50 brands, taken from a 
list of the 100 brands on the German FMCG market, that 
generate the highest sales revenues. We limited our sample 
of brands to those that had launched at least five extensions 
within the past four years. We asked respondents (n = 40) in 
the first pretest to rate their familiarity with these brands. 
We chose the 25 brands with the highest awareness ratings. 
We concentrated on well-known brands in our study 
because brand extensions are particularly popular in the 
marketplace for well-established brands (e.g., Swami- 
nathan, Fox, and Reddy 2001; Tauber 1988). 

We randomly selected five real brand extensions intro- 
duced between 1998 and 2001 for each of the 25 brands 
'from the German LPV New Products Database.? (When we 


3The initialism LPV stands for "Lebensmittelpraxis Verlag," an 
information service for the German FMCG industry. 
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designed our study, the LPV database contained all the new 
products introduced in Germany since January 1996.) We 
conducted a second pretest with a convenience sample of 
171 participants to warrant that the brand extensions finally 
used in our study varied in terms of awareness and (per- 
ceived) extension success. The participants answered a 
series of questions about their familiarity with the extension 
products and their perceived quality of the extensions. We 
selected 22 parent brands with three extensions each on the 
basis of the pretest results. This process yielded 66 brand 
extensions for the main study. 

We used multiple-item measures for extension success 
and for success factors. We developed a questionnaire using 
variables and rating scales that closely coincide with those 
used in previous studies (see the Appendix). We conceptual- 
ized extension success as perceived extension quality (e.g., 
Aaker and Keller 1990; Bottomley and Doyle 1996; Bro- 
niarczyk and Alba 1994; Klink and Smith 2001) and mea- 
sured it with three questions that ascertained (1) the per- 
ceived overall quality of the extension (—3 = “very low,” and 
+3 = “very high"), (2) the market position of the extension 
product (extent to which participants agreed/disagreed with 
the statement "[extension product] belongs to the top 3 
products in the market" [-3 = "strongly disagree,” and +3 = 
“strongly agree"]), and (3) the perceived overall quality 
relative to competing products in the market (—3 = “below 
average," and +3 = “above average"). 

We took measures of success factors from typical previ- 
ous studies. For example, we measured the perceived fit by 
asking participants to rate the following kinds of items: (1) 
the overall similarity of the brand extension to the parent 
brand (—3 = “not very similar," and +3 = “very similar”), (2) 
the perceived ability of the company to make a product in 
the extension product class (“Would the people, facilities, 
and skills used in making the original product be helpful if 
the manufacturer were to make the extension product?" 
[-3 = “not at all helpful,” and +3 = “very helpful" ]), and (3) 
the relevance of the brand-specific associations in the exten- 
sion product category (Broniarczyk and Alba 1994). The 
Appendix provides an overview of all the measures we 
used. 

The questionnaire was administered to people living in 
Germany by means of door-to-door interviews. Respon- 
dents participated voluntarily without any compensation 
and were selected according to a quota sampling procedure 
with a representative structure for Germany in terms of age, 
gender, and number of household members. In addition, we 
required respondents to know the brands that appeared on 
the questionnaire. Students majoring in marketing con- 
ducted the interviews. They attended detailed training ses- 
sions before going into the field. Interviewers obtained 
respondents' names and telephone numbers for validation 
purposes. We randomly contacted 596 of the respondents to 
confirm that the interviews were completed as planned. A 
total of 2426 respondents completed the questionnaire, and 
7278 cases were obtained because each questionnaire had 
scales for 3 of the 66 brand extensions (2426 x 3 = 7278). 
We excluded respondents for whom extension categories 
were not relevant, which reduced the number of cases to 
6668. We used a combination of listwise deletion and the 


informative imputation procedure (Ford 1976) to handle 
missing values (.2%). 

We used STREAMS 2.50 (Gustafsson and Stahl 2000) 
to, handle the hierarchical structure (i.e., "repeated mea- 
sures" nature) of our data, applying two-level structural 
equation modeling within a framework of maximum likeli- 
hood estimation (e.g., Muthén 1989, 1990). STREAMS 
provides an interface to the estimation program AMOS 4.0 
(Arbuckle and Wothke 1999).4 We performed outlier analy- 
sis and excluded influential outliers before we estimated 
each of the specified structural equation models (e.g., 
Baumgartner and Homburg 1996, p. 148; Bollen 1989; 
Mullen, Milne, and Doney 1995). 

The sufficiently large sample enabled us to split the data 
randomly into two parts (Sample 1 and Sample 2), facilitat- 
ing the cross-validation of findings (Byrne 2001). Sample 1 
served as the calibration sample, which we used to test the 
hypothesized models. We subsequently tested the replicabil- 
ity of each model on the basis of Sample 2 (the validation 
sample). 


Hesults 


We present the results of our study in three sections. First, 
we briefly describe the results of the assessment of the 
multiple-item measures we used. Second, we analyze dif- 
ferent models with the direct, indirect, and moderating 
effects of the ten success factors shown in Tables 1 and 2. 
Third, we illustrate the managerial relevance of our findings 
by applying our “best” model to several hypothetical exten- 
sion decisions and comparing it with an intuitive model 
advocated by managers. 


Measurement Model Assessment 


In our examination of the reliability and validity of our 
measures, we computed Cronbach's alpha for all scales. 
The alphas varied from .80 to .92. We then subjected the 
items to exploratory and confirmatory factor analyses. We 
ascertained the quality of the construct measurement by 
examining the sign, size, and significance of the estimated 
factor loadings and the magnitude of measurement error. 
All the factor loadings were significant (t > 30.54), and they 
were highly related to their respective constructs, in support 
of convergent validity. Explained variance ranged from 
52.9% to 84.9%. We computed two reliability indexes to 
ascertain how well the constructs are measured by their 
indicators (e.g., Bagozzi and Yi 1988; Fornell and Larcker 
1981; Steenkamp and Van Trijp 1991). All constructs dis- 
played satisfactory levels of reliability, as indicated by 
individual-item reliabilities in excess of .70 and composite 
reliabilities ranging from .62 to .85. These figures are com- 
parable to figures obtained in many marketing studies (e.g., 
Edison and Geissler 2003; Laroche, Bergeron, and Gouta- 
land 2001). Finally, we conducted a test for construct dis- 
criminant validity. We calculated the shared variance esti- 
mate for each success factor and compared it with the 


4Covariance matrices are available on request. 


square of the phi coefficient that represents the correlation 
between pairs of success factors (Fornell and Larcker 
1981). Each of the shared variance estimates exceeded the 
square of the corresponding phi coefficient, which provided 
evidence of discriminant validity among all success factors. 
They represent distinct constructs in the structural equation 
models. In summary, the measures fit well with the data, 
and we used them in the following process to estimate and 
test the models previously described. 


Estimating and Testing Models with Direct, 
Indirect, and Moderating Effects 


Table 3 reports the parameter estimates and goodness-of-fit 
indicators of Models M1, M2, and M3. We use overall fit 
indicators to assess how well the models fit the data (e.g., 
Hu and Bentler 1995). All fit indicators are within a satis- 
factory range and show that the models account for a sub- 
stantial amount of variance. 

A comparison of the path coefficients across models 
denotes that there are almost no differences, indicating that 
the estimates are robust. Furthermore, we cross-validated 
our findings by comparing the estimates across the calibra- 
tion and the validation sample as a means to assess the 
robustness of the postulated model. The test involved con- 
straining the path coefficients and factor loadings and the 
variance—covariance of the constructs to be equal across the 
two samples. We then evaluated whether the resulting 
change in chi-square was significant (p < .01), with the 
appropriate number of degrees of freedom. We observed no 
statistically significant differences. This indicates that the 
models are robust when other samples are used. 


Effects of parent-brand characteristics. All the models 
provided statistically significant support for the positive 
role of the parent brand’s quality and consumers’ brand con- 
viction in determining brand extension success. We found 
notable differences for the history of previous brand exten- 
sions and parent-brand experience between the model that 
reflected only direct effects (M1) and the models that 
included indirect (M2) and moderator (M3) effects. When 
we examined only direct effects, parent-brand experience 
had no influence on extension success. Nevertheless, M2 
and M3 reveal that there is a powerful and highly signifi- 
cant relationship between parent-brand experience and 
parent-brand conviction (standardized path estimate = .64, 
p < .01). In other words, parent-brand experience has a sig- 
nificant and positive impact on extension success due to its 
positive effect on parent-brand conviction, even in the 
absence of any significant direct relationship between 
parent-brand experience and extension success. Therefore, 
ignoring the indirect effects of a variable results in faulty 
interpretation of the relevance of this factor for brand exten- 
sion success. In this case, the estimated influence of the 
parent-brand experience on the success of a brand extension 
increases from .00% (nonsignificant direct effect in M1) to 
10.39% (M2). 

Model M1 similarly provides a biased estimate for the 
effect of the history of previous brand extensions on brand 
extension success. In M1, the direct influence of the history 
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TABLE 3 
Structural Parameter Estimates and Goodness-of-Fit Indicators 








Estimates 

Paths M1 M2 M3a M3b 
H, Quality of the parent brand — extension success + 10** +.11™ +.11™ RITS 
Hə History of previous brand extensions — extension success —.04* n.s. n.s. n.s. 
Hs  Parent-brand conviction — extension success T.25** +.30** +.30** +.30** 
H4  Parent-brand experience — extension success n.s. n.s. n.s. n.s. 
Hs Marketing support — extension success +.13™ +.14™ +.13** +.13* 
Hg Retailer acceptance — extension success +.29™ +.30™ +.30™ +.30™ 
Hy Fit extenslon success +.45** +.46™ +.47™ +.47™ 
Ha Linkage of the utllity of the parent brand to specific product 

attributes — extension success n.s. n.s. n.s. n.s. 
Hg Perceived risk of unknown brands — extension success +.08** +.09** +.08** +.08** 
Hio Consumer Innovatrveness — extension success +.05** +.06™ +.06** 4.07** 
H4, Quallty of the parent brand — retailer acceptance — n.s. n.s. n.s. 
H4» Marketing support — retailer acceptance — +.56** +.56™ 4.56"* 
H43 History of previous brand extensions — parent-brand conviction — 13" .13** .13** 
H44, Parent-brand experience — parent-brand conviction — .64** .64** .64** 
H,5 Marketing support fit — +.40™ +.40** +.40™ 
H4g Parent-brand conviction — quality of the parent brand — T./4** +.74™ +.74™ 
H,7 History of previous brand extensions — quality of the parent brand — +.05* +.04* +.04* 
Hig History of previous brand extensions — retaller acceptance — +.11™ +.11™ T.11** 
H4g Fit between parent brand and extension — retailer acceptance — T.23** +.23™ +.23™ 
Hag Perceived risk of unknown brands — consumer Innovativeness — —.25™ -.25"" “= 25" 
Ho, Fit x quality of the parent brand — extension success — — +.07™ — 
Ho4 Fit x parent-brand conviction — extension success — — — +.08** 


History x fit; marketing support x retaller acceptance; perceived risk of unknown brands x quality; history x quality: n.s. 


Total Effects of the Success Factors (l.e., Significant Direct Effect + Significant Indirect Effects) 
[Percentage Relative Importance In Brackets] 


Quality of the parent brand 

History of previous brand extensions 
Parent-brand conviction 
Parent-brand experience 

Marketing support 

Retailer acceptance 

Fit 

Perceived risk of unknown brands 
Consumer innovativeness 

Linkage of the utility of the parent brand to product attributes 
Fit x quality of the parent brand 

Fit x parent-brand conviction 


NFI 

NNFI 

CFI 

RMSEA 

p value of test of close fit 
n (calibration sample) 


*Significant at the p « .05 level 
**Significant at the p < .01 level 


10 [7.19] .11 [4.76] .11 [4.62] .11 [4.60] 
—.04 [2.88] .09 [3.90] .09 [3.78] .09 [3.77] 
.25 [17.99] .38 [16.45] .38 [15.97] .38 [15.90] 
n.s. [00] .24 [10.39] .24[10.08] .24 [10.04] 
.43 [9.35] .52[22.51] .52[21.85] .52 [21.76] 
.29 [20.86] .30 [12.99] .30 [12.61] .30 [12.55] 
.45 [32.37] .53 [22.94] .54[22.69] .54 [22.59] 
.08 [5.76] .08 [3.46] .07 [2.94] .06 [2.51] 
05 [3.60] .06 [2.60] .06 [2.52] .07 [2.39] 
n.s. [.00] ns. [00] ns. [.00] n.s.  [.00] 
— — 07 [2.94] — 
— — — 08 [3.35] 
.93 .92 .92 .92 
.93 .92 .92 .92 
.95 .93 .93 .93 
.045 .048 .046 .046 
1.000 .995 1.000 1.000 
3068 3068 3068 3068 


Notes: n s = not significant NFI = normed fit Index, NNFI = nonnormed flt Index, CF] = comparative fit index, and RMSEA = root mean square 


error of approximation 


of previous brand extensions on extension success is signifi- 
cant but negative. Thus, H4 is not supported.» However, 


*The negative direct influence might be attributed to some kind 
of parent-brand dilution effect A history of brand extensions may 
cause increasing incongruity in the parent brand's product range. 
A review of the data indicates that respondents assessed nearly 
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Models M2 and M3 reveal that this factor is a significant, 
positive determinant of brand extension success as a result 


one-half of the selected parent brands as having only a moderately 
clear brand image (seven-point rating scale: —3 — "very unclear," 
and +3 = “very clear"; for ten brands, means range from —1 to .5). 


of its significant and positive effects on parent-brand con- 
viction, quality of the parent brand, and retailer acceptance. 


Effects of the extension's marketing context. All models 
support the direct effect on brand extension success associ- 
ated with marketing support of the extension. However, 
Model M1 again provides a biased estimate of the impact of 
this success factor on extension success. Models M2 and 
M3 reveal that marketing support has both a direct impact 
on extension success and three indirect effects on extension 
success through the following paths: 


1. Marketing support — fit — extension success. 
2. Marketing support — fit — retailer acceptance — extension 
SUCCESS. 


3. Marketing support — retailer acceptance — extension 
success. 


Accordingly, marketing support has a substantial total 
effect of .52 on the success of a brand extension. Conse- 
quently, a considerable change emerges in the relative influ- 
ence of this variable compared with Model MI, which 
ignores structural relationships among the success factors. 
The relative influence of marketing support on the success 
of a brand extension increases from 9.35% (M1) to 22.51% 
(M2). In general, the total effect of any variable X, on any 
variable X, represents the comprehensive impact of X, on 
X5, when the entire system of structural relationships is 
considered among the variables under investigation. As with 
the total effect of the marketing support variable, the total 
effects of all other success factors can be computed to deter- 
mine their relative importance for the success of a brand 
extension. Similarly, all models find strong support for the 
significance of the effect related to retailer acceptance. 
However, Table 3 shows that M1 overestimates the rele- 
vance of this factor for the success of an extension product. 


Effects of the relationship between the parent brand and 
the extension product. The linkage of the utility of the par- 
ent brand to specific product attributes is statistically 
insignificant in all models, whereas the fit between the par- 
ent and the extension category emerges as the most impor- 
tant factor (sign as expected). Although the significance of 
this factor was not unexpected (given the results of prior 
research; e.g., Bottomley and Holden 2001), the over- 
whelming impact of this factor compared with other brand 
extension success factors is a notable result, especially 
because we used real brand extensions. Not too long ago, 
researchers repeatedly presumed that the overwhelming 
impact of the fit could be attributed to the use of hypotheti- 
cal brand extensions (e.g., Klink and Smith 2001). How- 
ever, the results of our study show that the fit still has a sub- 
stantial effect on brand extension success, even under 
exclusive use of real brand extensions. Note that we applied 
a widely accepted and comprehensive measure of perceived 
fit (see the Appendix). The total effects reveal a consider- 
able change in the relative influence of perceived fit on 
brand extension success compared with Model M1, which 
ignores structural relationships among the success factors. 
Relative importance of the fit between the parent brand and 
the extension product decreases from 32.4% to 22.6% 
(though fit still remains the most important success factor). 


Effects of the extension's product category characteris- 
tics. Perceived risk of unknown brands and consumer inno- 
vativeness emerged as significant determinants of extension 
success in all models. However, these factors exhibited only 
minor relevance. 


Moderator effects. We applied the single-indicator tech- 
nique to incorporate the effects of moderating variables 
(Ping 1995, 1998). We found statistically significant effects 
for only two interaction terms (see Table 3), namely, the 
interaction terms of fit with the quality of the parent brand 
and with the parent-brand conviction. However, the contri- 
bution of the two interaction terms was small compared 
with the main effects of the success factors (direct and indi- 
rect effects of the variables). This means that moderating 
effects play a relatively minor role and that the incremental 
variance explained by the interaction terms is small. Simul- 
taneous estimation of both interaction effects supports their 
statistical significance. However, the various fit indicators 
are significantly lower than for M2, and they suggest that 
the model does not represent an acceptable fit to the data 
(normed fit index = .87, nonnormed fit index = .87, compar- 
ative fit index = .88, and root mean square error of approxi- 
mation = .058; p value of the test of close fit = .000). 

. Finally, the degree to which a consumer is familiar with 
the extension might be an important moderating variable 
because we used real brand extensions. For example, 
parent-brand characteristics (see Table 1) and the fit 
between the parent brand and the extension product should 
have a much greater effect when the extension is relatively 
unfamiliar. We applied a subgroup analysis (e.g., Jóreskog 
1971; Ping 1998) to test the moderating effect of con- 
sumers' familiarity. The procedure involved dividing our 
sample 1nto two groups of cases on the basis of different 
levels of familiarity (low and high) and estimating M2 in 
each subgroup without coefficient equality constraints. We 
measured consumers' familiarity with the extension by ask- 
ing respondents how often they used the extension product 
(—3 = “not at all,’ and +3 = “very frequently"). We divided 
our sample into two groups on the basis of a mean split. The 
low- (high-) familiarity subgroup comprised respondents 
below (above) the mean. Table 4 reports the parameter esti- 
mates and goodness-of-fit indicators of the two-group 
model (M4). 

In many ways, the results of the two-group analysis sup- 
port the analysis that is based on the total sample. The esti- 
mates are quite robust. We find notable differences between 
the two groups for only three variables. The quality of the 
parent brand and the marketing support of the extension 
have a greater impact when the extension is relatively unfa- 
miliar. This seems reasonable because respondents, who are 
unfamiliar with the extension product, will rely more heav- 
ily on external cues (e.g., brand name, marketing support) 
when evaluating the extension. The fit between the parent 
brand and the extension product has a greater effect when 
consumers are relatively familiar with the extension. This 
result is notable. We expected that the fit would have a 
greater impact within the group of consumers who were rel- 
atively unfamiliar with the extension product. However, 
high familiarity might help consumers identify or create 
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TABLE 4 


Moderating Effect of Consumers’ Brand Extension Famillarity (M4) 








Estimates 
Low High 
Familiarity Familiarity Total 

Paths (Below Mean) (Above Mean) Sample 
H, Quality of the parent brand — extension success +.20** +.11** +.11** 
Hə History of previous brand extensions — extension success n.s. n.s. n.s. 
H4 Parent-brand conviction — extension success +.31™ +.31** +.30** 
H4 Parent-brand experience — extension success n.s. n.s. n.s. 
Hs Marketing support — extension success dal. +.05* +.14™ 
Hg Retaller acceptance — extension success +.25** +.26** +.30** 
H7 Fit extension success +.46** +.53** +.46** 
Hg Linkage of the utility of the parent brand to specific 

product attributes — extension success n.s. n.s. n.s. 
Hg Perceived risk of unknown brands — extension success +.08™* +.09** +.09** 
Hj) Consumer innovativeness — extension success +.09** +.06* +.06** 
H4; Quality of the parent brand  retaller acceptance n.s. n.s. n.s. 
H4» Marketing support — retailer acceptance +.56™" +.39** +.56** 
H,3 History of previous brand extensions — parent-brand conviction +.14™ T.16** A Ca 
H,, Parent-brand experience — parent-brand conviction +.60™ T.61** .64** 
H4& Marketing support fit +.30** +.32** +.40** 
H4g Parent-brand conviction — quallty of the parent brand +.73"* T.72** +.74** 
H,7 Hlstory-of previous brand extensions — quality of the parent brand +.05* +.08** +.05* 
H4g History of previous brand extensions — retailer acceptance T.12** T.18** T.11** 
H4g Fit between parent brand and extension retailer acceptance 4:.19** T.29** T.23** 
Hog Perceived risk of unknown brands — consumer Innovativeness —.26** —.25** —.25** 

Total Effects of the Success Factors (i.e., Significant Direct Effect + Significant Indirect Effects) 
[Percentage Relative Importance in Brackets] 
Quality of the parent brand .20 [8.20] .11 [4.98] .11 [4.76] 
History of previous brand extensions 10 [4.10] .11 [4.98] 09 [3.90] 
Parent-brand conviction .46 [19.74] .39 [17 65] 38 [16.45] 
Parent-brand experlence .27 [11.07] .24 [10.86] 24 [10.39] 
Marketing support .50 [20.49] .35 [15.84] .52 [22.51] 
Retailer acceptance .25 [10.25] .26 [11.76] 30 [12.99] 
Fit .51 [20.90] .61 [27.60] .53 [22.94] 
Percelved rlsk of unknown brands 06 [2.46] .08 [3 61] .08 [3.46] 
Consumer innovativeness 09 [3.69] .06 [2.71] .06 [2.60] 
Linkage of the utility of the parent brand to specific product attributes n.s. n.s. n.s. 
Two-Groups Model Total Sample 

NFI .91 .92 
NNFI .91 .92 
CFI 92 .93 
RMSEA .033 .048 
p value of test of close fit 1.000 .995 


*Significant at the p < 05 level. 
""Significant at the p < .01 level. 


Notes: n = 3853 (below mean)/2275 (above mean). n.s. = not significant. NFI = normed fit index, NNFI = nonnormed fit index, CFI = comparative 


fit Index, and RMSEA = root mean square error of approximation. 


more shared attributes between the extension and the brand 
schema (e.g., Klink and Smith 2001). As a result, higher 
levels of familiarity may increase the impact of fit on con- 
sumers' extension evaluations. 

To determine whether the differences between the two 
groups were significant, we constrained the model coeffi- 
cients to be equal between subgroups, and we reestimated 
the model. We compared the resulting chi-square statistic 
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with the chi-square statistic from the first estimation. We 
found a statistically significant chi-square difference 
between the two groups, which suggests that the differences 
are significant. 

Model comparison. We compared the performance of 
the estimated models using nested and nonnested model 
comparisons (Leeflang et al. 2000) based on the results of 
the cross-validation. Model M2 emerged as the "best" 


model. The chi-square difference for M2 was substantially 
lower than that for the other models; that is, applying the 
constrained model (constrained path coefficients, factor 
loadings, and variance—covariance of the constructs) had 
the least negative consequence for Model M2. Mere addi- 
tion of parameters should improve model fit. Therefore, it 
was important to consider the degrees of freedom when we 
compared the performance of the alternative models. The 
chi-square difference divided by the difference in the 
degrees of freedom was lower for M2 than for the other 
models. Therefore, incorporating indirect effects into M1 
(resulting in M2) substantially improved prediction, 
whereas the incremental improvement by the interaction 
terms, which we introduced after accounting for the indirect 
effects of the ten success factors, was small. 


Empirical Application 


To illustrate‘ the managerial relevance of our findings, we 
applied our best-fitting Model M2 to several hypothetical 
extension decisions in an additional empirical study and 
compared it with an intuitive model that managers who are 
not aware of M2 might use. We knew from our first expert 
survey that, in general, nearly all 21 managers agreed that 
the ten success factors implemented in M2 were relevant for 
brand extension success. However, the experts were not 
able to agree on the relative importance weights for the ten 
success factors. Thus, an intuitive model that managers 
might use would assume equal importance weights for all 
success factors (i.e., M1 with equal importance weights). 
Indeed, many managers with whom we spoke seemed to 
use some kind of a heuristic technique with equally 
weighted success factors to arrive at expected performance 
judgments. 

In our application, we considered a manager who 
needed to decide whether to introduce several new products 
by extending established brands. We focused our illustrative 
cases on 12 typical, realistic brand extensions that had not 
been introduced yet (e.g., aspirin, adhesive bandages, Mon 
Cheri ice cream). The 12 brand extensions stemmed from 
six well-known FMCG brands in Germany. To apply M2 in 
this context, we needed data from a consumer survey on the 
levels of the ten brand extension success factors (seven- 
point rating scales) under consideration, in addition to the 
estimated parameters in Table 3. We administered a corre- 
sponding questionnaire to a convenience sample of 119 
marketing students to obtain the required data. The applica- 
tion of M2 to the collected data provided estimates of the 
expected overall extension success and the incremental 
influence of each driver of brand extension success. This 
kind of information helps managers decide whether to 
introduce a proposed extension. 

On the basis of the collected data, we assessed expected 
performance of the hypothetical extension products by tak- 
ing (1) the average of the ten success factors (i.e., M1 with 
equal importance weights) and (2) the weighted average of 
the ten success factors using the empirically determined 
importance weights (M2). We scaled the importance 
weights so that they added up to 100%. Therefore, an exten- 
sion product with the lowest rating on all success factors 


earns an expected performance score of 1, and an extension 
product with the highest rating on each factor earns a score 
of 7. We found notable differences between M2 and the 
intuitive model that managers advocated. Differences in 
expected performance scores achieved up to .5 points on a 
seven-point rating scale, which can be viewed as 
substantial. 

Furthermore, the assumption of equal weights for the 
success factors resulted in faulty interpretation of the incre- 
mental effect of the success factors on the expected perfor- 
mance of an extension product. The estimated path coeffi- 
cients of the various success factors are a possible measure 
for the incremental effect of a certain success factor on 
brand extension success. However, elasticity is the usual 
measure for the incremental effect of a variable. Therefore, 
using (1) a model with equal weights of the success factors 
and (2) the empirically determined importance weights 
(M2), we calculated the resulting increase in expected per- 
formance (expressed as a percentage) if a certain success 
factor is increased by 10%. The resulting elasticity mea- 
sured the incremental effect on expected performance 
(expressed as a percentage) of a 1096 increase in a specified 
success factor. Table 5 reports the calculated elasticities for 
all success factors averaged across the extension products. 
There are substantial differences between the two models 
for nearly all elasticities. For example, a 1096 increase in 
marketing support results in a 2.5296 increase of expected 
performance in Model M2 versus a 1.14% increase in a 
model with equal weights. This means that the assumption 
of equal weights substantially underestimates the effect of 
marketing support on expected performance. The incremen- 
tal effects that are associated with certain variables are the 
key factors in the decision of how to allocate resources to 
them. Thus, an intuitive model with equal weights can 
result in largely inappropriate allocation of resources. 


Summary 


We tested several conceptual models that considered brand 
extensions within a comprehensive nomological net by 
building on the results of prior research and managerial wis- 
dom derived from expert judgments. Our simultaneous 
analysis of a broad variety of success factors revealed nine 
factors that significantly influence brand extension success. 
Fit between the parent brand and the extension product, 
marketing support, parent-brand conviction, retailer accep- 
tance, and parent-brand experience were particularly major 
contributors in driving brand extension success. These fac- 
tors deserve managerial attention. In addition, we found 
several important structural relationships among the investi- 
gated success factors. An analysis that fails to account for 
these relationships causes faulty interpretation of the sig- 
nificance, the relative importance, and the incremental 
influence of success factors. In addition, we found statisti- 
cally significant effects for two interaction terms: the inter- 
action of fit with the quality of the parent brand and with 
parent-brand conviction. However, the contribution of the 
two interaction terms is small compared with the main 
effects of the success factors. We illustrated the managerial 
relevance of our findings in an additional empirical study. 
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TABLE 5 
Comparing M2 with an Intuitive Model (M1 with Equal Weights) 





Model M2 

Success Factor (a) Elasticity 
Quality of the parent brand 52 
History of previous brand extensions .31 
Parent-brand conviction 1.64 
Parent-brand experience .99 
Marketing support 2.52 
Retailer acceptance 1.26 
Fit 2.26 
Linkage of the utility of the parent 

brand to specific product attributes .00 
Percelved risk of unknown brands 27 
Consumer innovativeness 24 


Equal Weights 
(b) SD (c) Elasticity (d) SD (ay(c) 
.18 1.10 .33 A7 
.08 .82 .23 .38 
.59 1.01 .31 1.62 
.18 .98 .13 . 1.01 
.59 1.14 .21 2.20 
35 1.02 33 1.24 
1.23 1.05 .62 2.15 
.00 1.14 .19 .00 
.04 .81 13 34 
.03 .94 .09 25 


e eee e o ee a 

alncremental effect (percentage) on expected performance of a 10% increase in the success factor. For example, an increase In marketing 
support of 10% results In a 2.52% increase of expected performance in Model M2 versus a 1.14% increase In a model with equal weights (i.e., 
equal weights substantially underestimate the effect of marketing support on expected performance). 


Discussion 


Managerlal Implications 
Success factors that managers should care about. From 
a managerial perspective, it is especially important to know 
the relevant determinants of extension success. Our analy- 
ses help managers divide the large number of potentially 
relevant success factors into essential factors (i.e., fit, mar- 
keting support, retailer acceptance, parent-brand conviction, 
and parent-brand experience) and less relevant or unimpor- 
tant factors (i.e., history of previous extensions, consumer 
innovativeness, linkage of the utility of the parent brand to 
specific product attributes, and moderating effects). Man- 
agers should pay the greatest attention to the five success 
factors that substantially influence brand extension success. 
Consequently, the question arises of how to influence these 
factors to increase the likelihood of extension success. Our 
research uncovers several important mediating effects that 
may serve as a Starting point for the identification of 
promising ways to influence certain success determinants 
and, consequently, the likelihood of extension success. The 
following points focus on the issue of how managers can 
influence the five most important success factors identified 
in our study: 
*Fit between parent brand and extension product: Appropriate 
selection of the parent brand and extension product category 
can directly affect fit. Extension advertisements can also 
increase the salience of crucial brand associations that help 
consumers infer extension features and benefits and thus 
understand how an extension fits. Consumers will likely infer 
judgments of fit and consistency when an advertisement illus- 
trates how parent-brand attributes improve the extension’s 
ability to provide benefits. Repeated exposure to advertise- 
ments that evoke appropriate parent-brand associations also 
helps consumers establish linkages between the brand and the 
extension product, thus positively influencing perceived fit 
between the extension product and the extending brand (Lane 
2000). 
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°Parent-brand conviction and parent-brand experience: 

Although extension products do not guarantee success on the 
basis of the brand name alone, our results indicate that 
parent-brand characteristics, such as consumers’ parent-brand 
experience and conviction, play an important role in driving 
brand extension success. Managers frequently cannot influ- 
ence these two factors in the short or medium term, because 
both factors reflect specific characteristics of the parent 
brand. However, our results indicate that building customer- 
based brand equity and/or acquiring strong brands are favor- 
able strategies because they represent a prerequisite for the 
successful leveraging of an existing brand to reap some finan- 
cial benefit. Moreover, managers who have access to a port- 
folio of parent brands might consider influencing parent- 
brand conviction and parent-brand experience by selecting an 
appropriate brand that performs favorably in terms of these 
factors. 


*Retailer acceptance: There are various possible ways to 
influence retailers’ acceptance of a new product. For exam- 
ple, previous research has shown the positive effects of trade 
and slotting allowances on retailer acceptance of new prod- 
ucts (e.g., Collins-Dodd and Louviere 1999). Promotional 
allowances reduce retailers’ costs of telling consumers that 
the extension product can be obtained in a given store, thus 
strengthening interstore competition. Consumer advertising 
also builds demand and therefore should have a positive 
effect on retailers' decisions to accept new listings. 
eMarketing support. The marketing support that the extension 
product receives plays a critical role for FMCGs in determin- 
ing the success of the new product. This factor is of specific 
interest for managers because it is under a company's direct 
control and, in general, can be influenced 1n the short term. 
However, the financial well-being of the company represents 
a boundary condition regarding the total support that new 
introductions may receive. 


Relative importance of success factors. Another set of 
implications for managers is related to the incremental 
influence of each success factor on the performance of the 
extension product. Thus far, little has been known about the 
relative importance of success factors in explaining brand 


extension success. Therefore, managers might intuitively 
assume equal importance weights for all success factors. 
Our results indicate that relative importance varies substan- 
tially across the success factors. To illustrate the managerial 
relevance of this finding, we applied the model advocated 
by managers and our best model (M2) to 12 hypothetical 
extension decisions. Our illustrative cases show that the 
assumption of equal weights for the success factors results 
in faulty prediction of extension success and in faulty inter- 
pretation of the incremental effect of the success factors on 
expected performance of an extension product. This is a 
serious problem because it is imperative for managers to 
know the magnitude of effects so that they can analyze the 
reasons for brand extension failure/success and properly 
allocate resources to the various success factors that are 
under a firm’s control. Managers need to be aware that sim- 
ple heuristic techniques, which ignore the differences in 
importance weights among the success factors, are likely to 
cause notable deviations from marketplace performance. 
Our results demonstrate that managers need to pay attention 
to the varying incremental effects of success determinants 
when evaluating and managing extension products. 


Limitations and Further Research 


Measurement of the relative importance of brand extension 
success factors is a complex issue. It has been one of the 
key issues in prior research, but this work has provided lim- 
ited insights into the relative importance of success factors 
because it has only analyzed a small fraction of success fac- 
tors at one time. Against this background, our study is more 
appropriate for understanding relative importance. It syn- 
thesizes prior work and simultaneously examines the direct 
and indirect effects of a large number of potentially relevant 
success factors. However, note that the range and variance 
of brand extension success factors, among others, drive 
relative importance. We selected a broad variety of parent 


brands and extension products to ensure that the brand 
extensions used in our study varied with respect to con- 
sumers’ judgments of success factors. The selected parent 
brands belong to major firms on the German FMCG mar- 
ket. The extension products covered the ten most important 
food and nonfood product categories of ACNielsen’s well- 
known classification. The manipulation check of success 
factors confirmed a large empirical variance. In this context, 
we note that the majority of previous studies also did not 
vary consumers’ judgments of the success factors by means 
of an experimental design. A notable exception might be the 
fit variable that was often manipulated in previous work by 
the selection of low-fit and high-fit extensions. We realized 
the latter in our study and thus selected parent brands and 
extension products appropriately. Nevertheless, it might be 
useful for validation purposes if future work were to repli- 
cate our study in a laboratory setting using hypothetical par- 
ent brands and extension products that enable explicit 
manipulation of success factors. However, use of hypotheti- 
cal parent brands represents a serious threat to the external 
validity of the results (e.g., Boush and Loken 1991). 

Furthermore, we chose to analyze only FMCGs. It 
would be interesting to investigate the extent to which our 
findings could be generalized to other fields, such as con- 
sumer durables or services. Further research is also needed 
to investigate the generalizability of our results across mul- 
tiple categories in FMCG. 

In conclusion, fit should become a focus of further 
research, given the importance of the fit variable found in 
our study and in previous studies (e.g., Bottomley and 
Holden 2001). There is no universally accepted conceptual- 
ization of fit, which raises the more general question, What 
constitutes fit? The answer to this question is especially 
important for understanding how managers can manipulate 
perceived fit. 








APPENDIX 
Measures 
Success Factor/ltem Source 
Quallty of the Parent Brand 
Percelved overall quality of the flagship product (-3 = “inferlor,” and 3 = “superior’) Aaker and Keller 
Extent to which participants agreed/disagreed with the following statements (—3 = “strongly (1990); Sheinin and 
disagree,” and 3 = “strongly agree"): Schmitt (1994) 


[Brand name] offers high-quality products. 


The quality of [brand name] products is far above average. 


History of Previous Brand Extensions 


Number of products affillated with the brand (—3 = "very few products,” and 3 = “a lot of 


products") 


Success of the products affiliated with the brand (—3 = "not at all successful,” and 3 = "very 


successful") 


Degree of similanty between the products affiliated with the brand (—3 = “not very similar,” and 


3 = “very similar") 


Dacin and Smith 
(1994); Keller and 
Aaker (1992a); 
Sheinin and Schmitt 
(1994) 


' Participants were instructed to answer questions about products previously introduced and not 
the new extension product. This clarified that the question referred to all products previously 
introduced under the brand name and not to the new extension product. 
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aA EE 


Success Factor/Item 


Source 


au————————————————————————————ÓÓOÓÓÓÓÁÓÓOÓOÓÓOÓOÓODMEEXDELLLLLLLLOLGLZIL.LLLLULLIHLLILiizÉóÁb»ássmÁewiaónh!zádgí'áàihgqÁuazidRRR UU hel 


Parent-Brand Conviction 
Extent to which participants agreed/disagreed with the following statements (—3 = “strongly 
disagree; and 3 - "strongly agree"): 
In evaluating a new [product category] product, | could trust [brand name]. 
[Brand name] is a likeable brand. 
| relate to [brand name]. 


Parent-Brand Experlence 
Frequency of using the parent brand (—3 = "not very often,” and 3 = "very often") 
Frequency of purchasing the parent brand (-3 = “not very often,” and 3 = "very often") 
Intention to buy the parent brand in the future (-3 = "not very likely” and 3 = “very likely”) 


Marketing Support 
Extent to which participants agreed/disagreed with the following statements (—3 = "strongly 
disagree,” and 3 = “strongly agree”): 
[Extension product] is well supported in terms of advertising. 
[Extension product] recelves competent marketing support. 


Retaller Acceptance 
Extent to which participants agreed/disagreed with the following statements (—3 = "strongly 
disagree,” and 3 = “strongly agree"): 
[Extension product] is well supported in terms of distribution. 
[Extenslon product] is available In many supermarkets. 


Fit 

Global similarity between the parent brand and the extension product (—3 = “not very similar” 
and 3 = "very similar") 

Would the people, facilities, and skills used in making the original product be helpful If the 
manufacturer were to make the extension product? (-3 = “not at all helpful,” and 3 = "very 
helpful") 

Extent to which parent-brand-specific associations are relevant In the extension category: 
Step 1: stating of brand associations; Step 2: relevance of these associations In the 

extension category (— 3 = “not at all relevant” and 3 = “very relevant"). 


Linkage of the Utility of the Parent Brand to Product Attributes of the Original Product 
Category 
Extent to which participants agreed/disagreed with the following statement (-3 = “strongly 
disagree,” and 3 = “strongly agree"): 
[Brand name] Is closely tied to the attributes of the original product category. 
My associations with [brand name] are closely tied to the attributes of the original product 
category. 


Perceived Risk of Unknown Brands 
Extent to which participants agreed/disagreed with the following statements (-3 = "strongly 
disagree,” and 3 = "strongly agree”): 


If | buy an unknown brand, | would feel very uncertain of the level of quality that | am getting. 
| prefer buying a well-known brand, because | need the reassurance of an established brand 


name. 
| prefer buying a well-known brand, because the risk that my needs will not be met Is low 
compared with an unknown brand. 


Innovativeness 
Extent to which participants agreed/disagreed with the following statements (—3 = “strongly 
disagree,” and 3 = “strongly agree”): 
Overall, ! enjoy buying the latest products. 
| like to purchase new products before others do. 
Overall, it Is exciting to buy the latest products. 


DelVecchio (2000); 
Kirmani, Sood, and 
Bridges (1999) 


Broniarczyk and Alba 
(1994); Swaminathan, 
Fox, and Reddy 
(2001) 


Nijssen (1999); Reddy, 
Holak, and Bhat 
(1994) 


Nijssen (1999) 


Aaker and Keller 
(1990); Bottomley and 
Doyle (1996); 
Broniarczyk and Alba 
(1994) 


Rangaswamy, Burke, 
and Oliva (1993) 


DelVecchlo (2000) 


Hem, Chernatony, and 
Iversen (2001); Klink 
and Smith (2001); 
Midgley and Dowling 
(1978) 
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Sonya A. Grier, Anne M. Brumbaugh, & Corliss G. Thornton 


Crossover Dreams: Consumer 
Responses to Ethnic-Oriented 
Products 


The authors develop and evaluate a framework for investigating and understanding ethnic product crossover, that 
Is, when a product intended for one ethnic minority group gains significant penetration among consumers outside 
the referent ethnic group. In three studies, the authors investigate how a product's characteristics, the promotion 
and distribution decisions made for the product, and consumers' propensity for diversity influence the product's 
likelihood of crossing over from the intended ethnic target market to mainstream white consumers. Product char- 
acteristics interact with both other marketing decisions and consumers' diversity-seeking tendencies to influence 
whether consumers will be interested in ethnic products and the social context in which they are willing to con- 
sume them. The authors discuss the implications of the findings for theory and practice and provide directions for 
further research that include consideration of the product's ethnic embeddedness, the context in which the product 
will be consumed, and consumers' diversity-seeking tendencies. 


arketers of ethnic-oriented niche products fre- 
quently strategize to create crossover of their prod- 
ucts into larger markets for economies of scale and 
commercial viability. However, when such products are 
closely identified with the specialized target market they 
serve, successful crossover to a mainstream market can be a 
challenge. Nevertheless, some formerly ethnic-oriented 
niche products successfully cross over to the mainstream 
market. For example, 60% of hip-hop and rap, traditionally 
black music, is purchased primarily by white youths 
(Hughes 2002); salsa surpassed ketchup as the top-selling 
condiment in the United States in 1998 (Mellgren 2001); 
90% of sushi chef students at a Los Angeles academy are 
not Japanese; and more than 1000 U.S. grocery stores have 
sushi counters (Walkup 2003). In contrast, however, some 
ethnic-oriented niche products do not cross over despite 
marketers' intentions for them to do so. For example, 
annual lists of top-ten television programs for black and 
white consumers demonstrate strongly segregated audi- 
ences: Blacks watch shows targeted to a black audience, 
and whites watch shows intended for a white or mainstream 
audience (Gibbs 1999). Despite script and cast changes in 
response to marketing research, producers could not induce 
significant crossover among Anglo moviegoers for Price of 
Glory, a Rocky-like movie about a Hispanic fighter (Bannon 
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2000). Why do some ethnic-oriented products cross over 
but others do not? 

We examine the crossover of ethnic-oriented products 
into the larger mainstream market. Specifically, we address 
three key research questions: 


1. How do an ethnic-oriented product’s characteristics influ- 
ence its appeal among consumers of the referent ethnic 
minority and the mainstream ethnic majority? 

2. How do other marketing activities influence the appeal of 
and intended behaviors toward ethnic-oriented products 
among mainstream ethnic majority consumers? 

3. How do consumers differ in their willingness to purchase 
and consume ethnic-oriented products? 


We address these questions by first developing a framework 
for understanding ethnic-oriented product crossover. Then, 
in three studies, we examine several aspects of this frame- 
work, including how ethnic congruence, familiarity, diver- 
sity orientation, and social context influence consumers’ 
attitudes and purchase intentions toward ethnic-oriented 
products. We discuss the implications of our findings in 
relation to our framework and describe how our results 
inform our practical understanding of when crossover is and 
is not likely to occur. We also discuss how the framework 
may be applied to the understanding of crossover potential 
across product categories, consumption situations, and 
other social identity factors (e.g., age, gender). 


A Framework for Ethnic Crossover 


Crossover occurs when a product designed for one target 
segment meets with attraction and acceptance in another 
segment (Garafalo 1993; Gibbs 1999). Crossover has tradi- 
tionally been defined as the movement of a product from 
margin to mainstream, but not vice versa (Garafalo 1994). 
We focus on ethnic crossover, whereby a product that 
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reflects the cultural experience of an ethnic minority group 
(an ethnic-oriented product) gains significant penetration 
among consumers outside the referent ethnic group.! Many 
marketers, though not all (cf. Clunis 2004), want to develop 
authentically ethnic-oriented products that reflect their ref- 
erent culture well (i.e., that do not “sell out") but that also 
appeal to an audience that 1s sizable enough to ensure prof- 
itability (Andruss 2004; Guess 2004). However, the product 
attributes and other marketing decisions that give products 
their ethnic character may signal to mainstream and other- 
ethnic consumers that the products are not intended for 
them (Grier and Brumbaugh 1999). Alternatively, the prod- 
uct itself may be viewed favorably, but attitudes toward its 
consumption may be unfavorable. The purchase and con- 
sumption contexts surrounding an ethnic-oriented product 
likely affect whether it will cross over, and individual dif- 


ISeveral definitions are in order. We use the term “ethnicity,” 
though 1n some cases the term "race" may be more accurate (our 
participants self-identified on the basis of racial categories of 
black or white). We are aware that the two terms have distinct 
meanings (i.e., ethnic group membership includes race but 1s not 
limited to this factor). Nonetheless, because some of our stimuli 
operationalizations are based on ethnic categories (African Ameri- 
can), we use the term "ethnicity." In addition, this is conceptually 
consistent with prior research. The term “mainstream” refers to the 
numeric and social majority within a society, and the term "other- 
ethnic" refers to an ethnic minority group other than the referent 
ethnic group for the product in question. Throughout our studies, 
which take place in the United States, the referent ethnic minority 
group is black, the mainstream group is white, and other-ethnic 
groups include Hispanic and Asian American. 


ferences in preferences and beliefs about the referent ethnic 
group influence consumers' inclinations toward or away 
from ethnic-oriented products. 

Our conceptual framework (see Figure 1) proposes that 
consumer characteristics and consumption context are 
important moderating effects on how ethnic-oriented prod- 
uct characteristics (including the impact of other marketing- 
mix decisions on inferred product characteristics) will be 
interpreted and embraced (or rejected) by mainstream or 
other-ethnic consumers. Important mediating processes 
influence how these product characteristics will be used to 
form attitudes, which influence subsequent behavioral 
intent and, ultimately, actual behavior. 

This framework is based on attitude—intention—behavior 
models, which are frequently used to understand consump- 
tion intentions and behaviors across a variety of domains 
(Kozup, Creyer, and Burton 2003; Martin and Stewart 2001; 
Sheppard, Hartwick, and Warshaw 1988). Four additional 
key constructs, which we discuss in this section, are rele- 
vant for understanding crossover behaviors within this 
framework: characteristics that convey a product’s ethnic 
orientation, consumer characteristics that influence how the 
product cues are interpreted, mediating processes that con- 
sumers use to form attitudes and behavioral intentions from 
product cues, and features of the context in which the prod- 
uct will be purchased and consumed. 


Product Characteristics: Ethnic Congruence and 
Embeddedness 


Some target market-specific product attributes preclude 
outright the product's intended use by non-target market 
members (e.g., feminine hygiene products' use by men). 


FIGURE 1 
Framework for StudyIng Ethnic-Orlented Product Crossover 
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Similarly, some ethnic-oriented products function well only 
for members of the referent ethnic group (e.g., hair care 
products for blacks, Mandarin-language media for Chinese 
Americans). Consequently, crossover is not feasible for 
such products. 

However, most ethnic-oriented products are not so con- 
strained, and limits to crossover result from consumers’ 
beliefs that the products are not intended or appropriate for 
them. Prior research shows that consumers respond less 
favorably to spokespeople of another ethnic group than to 
spokespeople of their own ethnic groups (e.g., Green 1999; 
Whittler 1991; Whittler and DiMeo 1991; Williams, Qualls, 
and Grier 1995) and suggests that people prefer products 
associated with their own ethnic group over products asso- 
ciated with other ethnic groups (cf. Whittler 1989). 

Cues such as the ethnicity of a spokesperson or of the 
characters pictured on a package convey the ethnic orienta- 
tion of a product. However, ethnic orientation is about not 
only the ethnic group associated with the product but also 
the degree to which the product is associated with that 
group, or its “ethnic embeddedness” (Williams 1995). 
Advertisements that combine ethnic characters with ethnic- 
consistent cues, such as depictions of ancestral heritage; 
culture-specific assimilation experiences; and other 
cultural-bound traditions, practices, and symbols, are highly 
ethnically embedded (Williams 1995). Such advertisements 
tap into the ethnic experience of ethnic-congruent con- 
sumers and engender favorable responses among target 
market members (Appiah 2001b; Brumbaugh 2002). In 
contrast, advertisements that are low in ethnic embedded- 
ness include ethnic models combined with elements (e.g., 
background, language, themes) of mainstream American 
culture. 

Products themselves may also contain ethnic-specific 
cues that influence the degree to which they are perceived 
as ethnically embedded. In addition, other marketing strate- 
gies, such as distribution in ethnic-frequented channels and 
promotion in ethnic-targeted media, may further signal to 
mainstream and other-ethnic consumers that an ethnic- 
oriented product is not intended for them and may reduce 
its potential for crossover. For example, tortillas, a product 
traditionally associated with Mexican ethnic culture, that 
are produced by machine, packaged with English-language 
labels by a well-known U.S. company, and distributed 
through national grocery store chains are far less ethnically 
embedded than tortillas that are rolled by hand by a small 
Mexican company, packaged with Spanish-language labels, 
and sold only in bodegas in the southwestern United States. 
Although the product category is associated with a specific 
ethnic group, other cues about its production, brand, coun- 
try of origin, and distribution enhance (as in the latter exam- 
ple) or lessen (as in the former) its ethnic embeddedness. 


Consumer Characteristics: Distinctiveness and 
Diversity Seeking 


Characteristics of consumers influence how they interpret 
and respond to ethnic cues. Ethnic distinctiveness is one 
source of difference. Research on distinctiveness demon- 
strates that members of distinctive ethnic groups (groups 
that are a numeric minority and for whom ethnic group 


membership is salient and self-defining) respond more 
favorably to targeted appeals than do members of ethnically 
nondistinctive groups (groups that are a numeric majority 
and for whom ethnic group membership is neither salient 
nor self-defining) because of heightened levels of felt simi- 
larity that arise from ethnic congruence between product 
and consumer (Aaker, Brumbaugh, and Grier 2000; Grier 
and Deshpandé 2001; McGuire et al. 1978). For members 
of ethnically nondistinctive groups, ethnicity simply does 
not provide a basis for or information about attitudes toward 
targeted advertisements or products (Aaker, Brumbaugh, 
and Grier 2000). 

Another characteristic relevant for ethnic crossover is 
consumers' propensity to seek out cultural or ethnic-related 
diversity in people, products, and services, a characteristic 
we call "diversity seeking.” Related to but distinct from the 
brand-focused variety seeking ("the tendency of individuals 
to seek diversity in their choices of services or goods" 
[Kahn 1995, p. 139], diversity seeking captures con- 
sumers' predilection for consumption experiences that are 
related to cultures other than their own, including those 
defined by ethnicity. Consumers who are not members of a 
particular ethnic group may be attracted to ethnic-oriented 
products because they seek variety outside the usual main- 
stream fare (Kahn 1993), because they have an interest in 
the specific culture (Appiah 2004), or because they are 
interested in different ethnic cultures in general (Halter 
2000). 

For example, Hallmark markets a collection of greeting 
cards called "Common Threads," whose messages reflect a 
variety of world cultures, emphasizing global community 
and diverse cultural expression (Halter 2000). Hallmark 
research found that "Common Threads" buyers were over- 
whelmingly white consumers who also tended to purchase 
multicultural or ethnic-oriented clothing, jewelry, and deco- 
rative home items. This segment is likely similar to Ray and 
Anderson's (2000) “Cultural Creatives.” This group of 
people consists of 2696 of adult Americans who are con- 
cerned with self-actualization, spirituality, and self- 
expression and who like things that are foreign and exotic. 

A small but growing body of academic research has 
begun to explore inclinations toward cultural diversity. 
Miville and colleagues (Fuertes et al. 2000; Miville et al. 
1999) developed a scale that measures people's tolerance of 
people who are different from themselves. Douthitt, Eby, 
and Simon (1999) developed a scale that assesses people's 
receptivity to dissimilar others. However, no academic 
research has empirically examined a diversity-seeking trait, 
whereby some people are highly interested in and proac- 
tively seek out people, places, products, and experiences 
that are culturally different from their own. 


Consumption Context 


Prior research shows that the contexts in which products are 
purchased and consumed have a profound impact on con- 
sumption beliefs and behaviors. For example, contextual 
factors dictate whether a product will be consumed at all 
(Cote, McCullough, and Reilly 1985) and, if so, what prod- 
uct attributes will be weighted more heavily in choice 
(Quester and Smart 1998). Social beliefs and norms influ- 
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ence consumption more for products that are consumed in 
public than for those that are consumed in private (Bearden 
and Etzel 1982), and perceived visibility of a product 
increases these social influences (Fisher and Price 1992). 

The ethnic orientation of the context in which a product 
will be purchased or used has also been shown to influence 
its consumption. People are more likely to consume ethnic- 
consistent products when the consumption context is ethni- 
cally relevant than when it is associated with the main- 
stream or another ethnic group (Stayman and Deshpandé 
1989). The presence of others of the same ethnicity 
1ncreases the likelihood of ethnicity-consistent product con- 
sumption (Xu, Lotz, and Almeida 2004). Situations that 
heighten a person's ethnic-group salience lead to greater 
ethnicity-consistent consumption than situations that do not 
(Forehand and Deshpandé 2001). 

Thus, it is highly likely that the specific context in 
which an ethnic-oriented product is used will influence the 
likelihood of crossover. Public consumption may increase 
the social risks of ethnic-oriented products for mainstream 
or other-ethnic consumers (Bearden and Etzel 1982; Fisher 
and Price 1992). When given a choice, mainstream or other- 
ethnic consumers may be more likely to use ethnic-oriented 
products in private consumption contexts (e.g., alone at 
home) than in public consumption contexts (e.g., with oth- 
ers at a public retail space). Similarly, they may be more 
likely to use ethnic-oriented private goods than ethnic- 
oriented public goods (Ratner and Kahn 2002; Richins 
1994). 

The nature of the other people with whom a person 
might consume ethnic-oriented products may further facili- 
tate (e.g., 1f the others are of the referent ethnic group or are 
known for their tolerance of it) or inhibit (e.g., if others are 
not of the referent ethnic group or are known for their intol- 
erance of it) the consumption of such products (Wakefield 
and Inman 2003; Xu, Lotz, and Almeida 2004). For exam- 
ple, the idea of an African aerobics class may appeal to 
some nonblack fitness enthusiasts. However, the thought of 
attending the class as a minority member or engaging in 
unfamiliar movements may undermine otherwise favorable 
product-based attitudes (toward the class itself), causing 
behavior-based attitudes and intentions (toward enrolling in 
and attending the class) to be far less favorable. Attending 
the class with a friend who is a member of the referent eth- 
nic group may mitigate this effect. 


Mediating Processes: Familiarity 


Familiarity with the target product is one process that may 
mediate the impact of ad and product characteristics on atti- 
tudes and thus enhance the crossover of ethnic-oriented 
products. For example, familiarity gained from sampling 
Asian cuisine in restaurants is credited for increasing sales 
of Asian ingredients for use at home (Retail World 2000). 
Acceptance of ethnic desserts, such as Indian kulfi and 
Spanish churros, is greater when consumers are already 
familiar with similar offerings, such as ice cream and 
donuts (Strong 2002). 

Thus, consumers' enjoyment of ethnic-oriented prod- 
ucts likely depends on the ethnic-specific knowledge they 
have for such products. For example, whereas little ethnic- 
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specific knowledge is required to understand mainstream- 
themed products, significant ethnic-specific knowledge is 
required to decode, interpret, and understand products 
whose meanings are culture bound with a specific ethnic 
group (Brumbaugh 2002; Grier and Brumbaugh 1999). If 
relevant ethnic knowledge is more difficult to access, as 
when a consumer is not a member of the referent ethnic 
group, the resulting lack of familiarity with the product may 
lower appeal for the product. 


Studies Exploring the Framework 


In summary, our studies focus on four key constructs that 
influence whether and to what degree an ethnic-oriented 
product will cross over: product characteristics, including 
marketing-mix factors that consumers use to make infer- 
ences about the product; consumer characteristics, such as 
diversity seeking; mediating processes, such as consumer 
familiarity with product cues; and situational factors that 
make a consumption context more or less conducive for 
consuming ethnic-oriented products. We explore various 
aspects of our framework in three studies that examine the 
crossover of ethnic-oriented popular films into the larger 
mainstream market. Film represents a vivid, influential, and 
economically significant social institution (Wolf 1999) in 
which ethnic-oriented products must cross over for eco- 
nomic viability (Acapulco Black Film Festival 1997; 
Alexander 2001; Bannon 2000). In all three quasi experi- 
ments (we measure participant ethnicity but do not manipu- 
late it), we provide study participants with descriptions of 
movies that vary in their ethnic congruence with the partici- 
pant and in their level of ethnic embeddedness. In Study 1, 
we explore how diversity seeking and ethnic distinctiveness 
(consumer characteristics) moderate participants' attitudinal 
responses to ethnic-congruent and -incongruent movies 
(product characteristics). In Study 2, we investigate the role 
of familiarity (a mediating process) on the influence of the 
product's ethnic congruence on attitudinal responses. In 
Study 3, we examine behavioral intentions among high- and 
low-diversity-seeking mainstream white participants (con- 
sumer characteristics) for different levels and types of eth- 
nic embeddedness and situations (consumption context) 
associated with ethnic-oriented movies. We find support for 
our framework across all three studies, which we detail in 
the sections that follow. 


Study 1 


In Study 1, we explore the moderating impact of two con- 
sumer characteristics, ethnic distinctiveness and diversity 
seeking, on the main effect of congruence between con- 
sumer ethnicity and a product's ethnic orientation on its 
appeal. Prior research shows that people who perceive 
themselves as similar to ad spokespeople hold more favor- 
able attitudes toward the advertisement than people who 
perceive themselves as dissimilar (Whittler 1991). How- 
ever, research on target marketing shows that this holds 
only for ethnically distinctive (versus nondistinctive) people 
(Aaker, Brumbaugh, and Grier 2000). We expect to find a 
similar outcome for ethnic-oriented products (cf. Whittler 
1989). Specifically, we hypothesize the following: 


H,: Ethnic congruence between a product and a consumer 
favorably influences attitudes toward the product only for 
members of a distinctive ethnic group. 


Furthermore, we propose that people who are high in 
‘diversity seeking should be more interested in ethnic- 
incongruent products and find them more appealing than 
people who are low in this characteristic. We predict that 
diversity seeking will have a compensating effect on prod- 
uct preferences, enhancing attitudes toward a product only 
when participant ethnicity and product ethnic orientation 
are different. 


H»: People who are high in diversity seeking have more favor- 
able attitudes toward ethnically incongruent products than 
people who are low in diversity seeking. 


Method 


Stimuli development and pretesting. We created three 
movie descriptions that represented three movie genres as 
replicates for our ad stimuli (for the stimuli for all studies, 
see Appendix A). We pretested each (n = 47) for consis- 
tency with the intended genre compared with the other two 
movies. The focal manipulation was movie ethnic orienta- 
tion as conveyed through movie cues in each movie descrip- 
tion. In addition, we pretested each manipulation for consis- 
tency with the intended ethnic group compared with other 
ethnic groups on seven-point scales. We manipulated movie 
ethnic orientation in My Turn Again (action genre) through 
the music of the main protagonist; “rock” was more associ- 
ated with white ethnicity (M = 5.54) than “R&B” (M = 
3.39; F(1, 46) = 31.56, p < .01), and R&B was more associ- 
ated with black ethnicity (M = 4.65) than rock (M = 2.17; 
F(1, 46) = 39.55, p < .01). We manipulated movie ethnic 
orientation in Thin Blood (drama genre) through the name 
of the main protagonist; “Nathaniel Wellington" was more 
associated with white ethnicity (M = 5.17) than “Tyrone 
Washington" (M = 2.58; F(1, 46) = 56.18, p « .01), and 
Tyrone Washington was more associated with black ethnic- 
ity (M = 5.71) than Nathaniel Wellington (M = 3.13; 
F(1, 46) 2 65.57, p « .01). Finally, we manipulated movie 
ethnic orientation in The One (romance genre) through the 
city and ethnicity of the main protagonists; "women in 
Seattle" were more associated with white ethnicity (M = 
4.92) than “African American women in Detroit" (M = 
1.17; F(1, 46) = 189.17, p « .01), and African American 
women in Detroit were more associated with black ethnicity 
(M = 6.83) than women in Seattle (M = 3.38; F(1, 46) = 
153.79, p < .01).2 

Design, procedure, and participants. Half of the partici- 
pants saw the black version of My Turn Again, the white 
version of Thin Blood, and the black version of The One; 
the other balf saw the other versions of the movies. Thus, 
the basic design of this study was a 2 (movie ethnic orienta- 


ZWe recognize that this manipulation 1s not strictly parallel in 
that we do not specify “white” in the white manipulation (i.e., 
women in Detroit). We decided not to emphasize ethnicity for this 
condition because in the United States, white is the cultural and 
ethnic default ın contrast to ethnic minorities (Brumbaugh 2002) 
and is rarely (if ever) used to describe the mainstream (Billig 
1997). 


tion: black or white) x 2 (participant ethnicity: black or 
white) x 2 (diversity seeking: high or low) mixed-factorial 
design in which movie ethnic orientation is a within- 
subjects factor and participant ethnicity and diversity seek- 
ing are between-subjects factors. Preference for genre and 
movie replicate factors are included as covariates. 

Participants were 67 black and 62 white Americans who 
were recruited from public areas in two ethnically diverse 
cities on the West Coast. Participants ranged in age from 18 
to 60 years; 59% were female. They were given a study 
packet that contained all stimuli and measures and were 
asked to read each movie description and to complete items 
that assessed their attitude toward the movie. After complet- 
ing these measures for all three movies, participants com- 
pleted items on their preference for different movie genres, 
their preference for movies about people different from 
themselves, and their demographic characteristics, includ- 
ing ethnicity. 

Measures. We selected six movie attitude items on the 
basis of a review of prior industry marketing research that 
assessed the appeal of major movies among ethnically 
diverse Americans ("I would be interested in seeing this 
movie,” “This movie would be entertaining," “This is my 
kind of movie," “This movie would be interesting,” “The 
storyline attracts me to the movie," and “The characters 
attract me to the movie"). We assessed the six items on 
seven-point Likert scales and averaged the responses to 
yield a single dependent measure of attitude toward the 
movie (à = .95). We used the Likert scale item “I like 
movies about people different from me" to create a 
diversity-seeking dummy variable containing zero or one to 
reflect the median split on this item (for additional details 
on this item, see Appendix B).? 


Results 


Unless otherwise indicated, degrees of freedom are (1, 
386). The genre covariate was significant and positive. The 
movie covariate was significant such that The One was pre- 
ferred over Thin Blood, which was preferred over My Turn 
Again. The model was significant (F(10, 386) = 8.99, p < 
.01). Both movie ethnic orientation (F = 6.90, p « .01) and 
participant ethnicity (F = 8.13, p < .01) were significant but 
were subsumed by a significant interaction between the two 
(F(2, 386) = 13.64, p « .01), as we expected. The main 
effect of diversity seeking was significant (F — 4.32, p « 
.04), and the anticipated three-way interaction among 
movie ethnicity, participant ethnicity, and diversity seeking 
was significant (F(2, 386) = 5.76, p « .02). We tested the 
hypotheses through a priori contrasts evaluated with one- 
tailed tests at p « .05. 

H, proposed that movies for which movie ethnic orien- 
tation and participant ethnicity were congruent would be 
more appealing than movies for which ethnicity was incon- 
gruent, but only for members of distinctive ethnic groups. 


3The results were similar whether the genre and diversity pref- 
erence measures were included as continuous variables or as the 
median split dummies discussed here. We used the dummy varia- 
bles to facilitate the interpretation and description of the results. 
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The results show that distinctive black participants pre- 
ferred black-oriented movies (M = 4.44) over white- 
oriented movies (M = 3.55; F = 20.68, p < .01) and that 
nondistinctive white participants had no preference for 
white-oriented movies (M = 3.59) over black-oriented 
movies (M = 3.45, p > .20), as we hypothesized. 

H, proposed that people who are high in diversity seek- 
ing would like ethnically incongruent movies more than 
people who are low in this characteristic. This held true for 
both white and black participants. Specifically, white par- 
ticipants liked black-oriented movies more when they were 
high in diversity seeking (M = 3.77) than when they were 
low in diversity seeking (M = 3.05; F = 4.56, p < .03); 
diversity seeking had no impact for ethnic-congruent white- 
oriented movies (preference high: M = 3.75; preference 
low: M = 3.48, p > .20). Black participants liked white- 
Oriented movies more when they were high in diversity 
seeking (M = 3.91) than when they were low in diversity 
seeking (M = 3.05; F = 5.64, p < .02); diversity seeking had 
no impact for ethnic-congruent black-oriented movies 
(preference high: M = 4.46; preference low: M = 4.40, p > 
.20). 


Discussion 


We found support for both H; and H5. As we anticipated, a 
match in ethnicity between potential viewers and movie 
actors enhanced attitude toward the movie only for distinc- 
tive black participants. Diversity seeking worked as hypoth- 
esized among all participants, enhancing attitude toward the 
movie only when movie and participant ethnicity were 
incongruent. The results are consistent with advertising 
research that shows that white consumers have no prefer- 
ence for white versus black sources featured in advertise- 
ments. However, these results are inconsistent with industry 
research that shows that white consumers tend to prefer 
mainstream or white-oriented movies over black-oriented 
movies (Easton 1991). To explore further what might be 
driving this discrepancy between our attitude results and 
observed phenomena, in Study 2, we evaluate the role of 
perceived familiarity as a mediating evaluation process 
between interpretations of ethnic-specific product cues and 
a product's appeal. 


Study 2 


People differ in their ability to make sense of ethnic- 
specific information when forming attitudes about ethnic- 
oriented products. Some may interpret a product's ethnic- 
specific cues in ways that make the product seem familiar to 
them, and others may not. Similarly, viewers' enjoyment of 
ethnic-oriented movies likely depends on the ethnic-specific 
knowledge they bring to bear on interpreting such movies. 
For members of the referent ethnic group of an ethnic- 
oriented movie, ethnic-relevant information is readily avail- 
able and accessible. The more accessible and easily 
retrieved this information, the more familiar the viewer 
should be with the story depicted in the movie (Tversky and 
Kahnemann 1974) and the more the viewer should like the 
movie (Hoyer and Brown 1990; Laroche, Kim, and Zhou 
1996; Loken and Ward 1990). However, such ethnic- 
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specific knowledge may be either unavailable or more diffi- 
cult to access for mainstream or other-ethnic people, and : 
consequently, the content of ethnic-oriented movies may be 
more difficult to understand and less familiar. This lack of 
familiarity with the meaning of the movie's message or 
story line will likely result in lower appeal for such a movie. 
Therefore, we investigate the following hypothesis regard- 
ing the mediating influence of familiarity with an ethnic- 
oriented product on attitudes toward that product: 


H3: (a) Ethnic congruence between a product and a consumer 
makes the product seem more familiar to the consumer, 
and (b) familiarity with a product enhances attitudes 
toward that product 


Method 


Stimuli development and pretesting. We created a single 
movie description for a comedy titled Coming Home. 
Movie ethnic orientation was conveyed through the ethnic 
group of the main protagonists (see Appendix A). Pretesting 
(n = 33) identical to that for the first study showed that the 
cue “Trish American” was more associated with white eth- 
nicity (M = 5.36) than the cue "African American" (M = 
3.71; Fd, 32) = 8.54, p < .01) and that African American 
was more associated with black ethnicity (M = 4.13) than 
Irish American (M = 1.97; F(1, 32) = 17.81, p « .01). 


Design, procedure, and participants. This study had a 2 
(movie ethnicity: black or white) x 2 (participant ethnicity: 
black or white) fully crossed between-subjects design that 
included genre preference as a covariate. Participants were 
67 black and 49 white American adults who were recruited 
from the participant pools at two universities, one on the 
West Coast and one in the Southeast. Participants ranged in 
age from 18 to 45 years; 62% were female. Participants 
were randomly assigned to either the black- or the white- 
oriented movie condition. They first read a description of a 
movie and then responded to questions about their attitude 
toward the movie, familiarity with the movie, genre prefer- 
ence, and demographics. 

Measures. After reading the movie description, partici- 
pants completed the same six movie attitude items as those 
in Study 1 (a = .95). In addition, they answered three seven- 
point Likert items that assessed their familiarity with the 
movie theme (“The movie would involve situations familiar 
to me," “The movie would be relevant to issues in my life,” 
and “The movie would address topics I have experienced"); 
we averaged the responses to yield a single familiarity mea- 
sure ( = .86). A single item that was identical to the one in 
Study 1 assessed participants' preferences for the comedy 
movie genre. 


Results 


Unless otherwise indicated, degrees of freedom are (1, 
115). The analysis of variance (ANOVA) model was signifi- 
cant for attitude toward the movie (F(4, 115) = 2.74, p « 
03) and approached significance for familiarity 
(F(4, 115) = 1.92, p < .11). Genre preference was signifi- 
cant and positive in both models. The interaction between 
movie ethnic orientation and participant ethnicity was sig- 
nificant for familiarity (F = 6.10, p < .02). We tested the 


hypotheses at p < .05 through a priori contrasts evaluated 
with one-tailed tests. 

Black participants preferred the black-oriented movie 
(M = 3.86) over the white-oriented movie (M = 3.35; F = 
2.49, p « .06), but white participants did not prefer the 
white-oriented movie (M = 3.50) over the black-oriented 
movie (M = 3.55, p > .20). Thus, initial results of Study 2 
support H, and are consistent with those of Study 1. H3, 
posited that ethnic congruence between tbe viewer and 
movie ethnic orientation would enhance familiarity of the 
movie for the viewer. As we expected, black participants 
felt more familiar with the black-oriented movie (M = 3.83) 
than with the white-oriented movie (M = 2.96; F = 4.74, p < 
.02), and white participants felt marginally more familiar 
with the white-oriented movie (M = 3.77) than with the 
black-oriented movie (M = 3.12; F = 1.93, p « .08). Fur- 
thermore, Hay is supported by a univariate regression, 
which shows that familiarity had a favorable impact on atti- 
tude toward the movie (B = .47, p « .01). As we anticipated, 
this effect was moderated neither by participant nor movie 
ethnicity, nor by their interaction (ps » .20). 

Because ethnic congruence had a direct impact on atti- 
tudes for black participants per our reassessment of Hj, it is 
possible to evaluate whether familiarity mediates the impact 
of ethnic congruence on attitude toward the movie for black 
participants. To do so, we followed the three-step procedure 
that Baron and Kenny (1986) outline. The results for H; and 
Ha, show that ethnic congruence favorably influenced both 
attitude toward the movie and familiarity (the first step of 
their mediation test). Consistent with Hay, familiarity posi- 
tively influenced attitude toward the movie (the second 
step). If mediation occurs, the previously significant effect 
of ethnic congruence on movie attitude will be weakened 
when familiarity is included in the original ANOVA model 
(the third step). Indeed, black participants' preferences for 
the black-oriented movie over the white-oriented movie 
became nonsignificant when familiarity was included in the 
original model (F = .17, p = .68). Therefore, familiarity is a 
mediator of the impact of ethnic congruence on attitude 
toward the movie for black participants. 


Discussion 


Again, the results for black participants are consistent with 
expectations and previous research. Black participants felt 
more familiar with black-oriented movie cues and held 
more favorable attitudes toward the black-oriented movie 
than the white-oriented movie. Mediation analysis shows 
that familiarity mediated the impact of an ethnic match on 
attitude toward the movie among black participants. The 
results of the movie manipulation for white participants 
were unremarkable; familiarity was marginally affected by 
ethnic congruence.4 A movie’s ethnic orientation, when 





4Puzzled by the lack of effects on white viewers, who are cru- 
cial for crossover, we conducted another study that built on the 
first two by manipulating the presence or absence of a targeting 
cue (“These movie concepts have been designed to appeal to 
people like you as the target market”) included with a movie 
description with the intent of enhancing felt targetedness, which 
has been found to influence attitudes among whites in previous 
research (Aaker, Brumbaugh, and Grier 2000). Again, we manipu- 


manipulated by actor ethnicity, ethnicity associated with the 
city in which the movie takes place, and ethnicity associ- 
ated with the music genre related to the plot; has little 
impact on white participants’ attitudes toward the movies to 
which they were exposed in the previous studies. 

This result suggests that white consumers’ attitudes 
toward ethnic-oriented movies would equally favorably dis- 
pose them toward seeing white- and black-oriented movies. 
Again, movie industry data contradict this interpretation. 
White consumers simply do not view black-oriented movies 
with the frequency that our attitude measures would imply 
(Easton 1991; Masters 1996). We offer two explanations for 
this observation: First, explicit attitude measures may be 
inappropriate because of social desirability and other 
biases. People may be unwilling to share their true feelings 
about other-ethnic movies (Devine 1989) or may adjust 
(consciously or subconsciously) their initial reactions 
toward more socially acceptable and less-biased responses 
(Martin, Seta, and Crelia 1990; Monteith 1993). Second, 
consistency between a person's overall attitude toward a 
movie and the actual behavior of seeing that movie at a par- 
ticular theater with specific others may be particularly low 
(for a review, see Kraus 1995). Temporal distance between 
measuring attitude toward a movie and actual attendance 1s 
likely to weaken the relationship between attitude and 
behavior (Davidson and Jaccard 1979). The movie itself as 
attitude object may be unrelated to the behavior of attending 
the movie, attenuating the relationship between the two 
(Ajzen and Fishbein 1977). 

To address these issues, we present a series of behav- 
ioral intention items in our final study to evaluate how our 
experimental factors influence intended behaviors in addi- 
tion to attitude toward the movie. We also evaluate hypothe- 
ses that propose that some marketing-mix factors may 
enhance or detract from an ethnic-oriented movie's attrac- 
tion among white consumers, depending on their level of 
diversity seeking. 


Study 3 
Most research shows that movies with a high level of ethnic 
embeddedness fare poorly with mainstream audiences, 
attract smaller audiences (Carter 2001), and make lower 
revenues (Alexander 2001) than movies with mainstream 
themes. Movies that are high in ethnic embeddedness are 
frequently produced by smaller, independent filmmakers 
with fewer resources to devote to hiring big-name stars, 
gaining access to distribution channels, and promoting their 
films (Acapulco Black Film Festival 1997; Alexander 2000; 
Johnson 2003). In addition to the overall ethnic-oriented 
theme of such movies, these other marketing-mix decisions 
likely reinforce or lessen the product's perceived ethnic 





lated movie ethnic orientation as black or white through the R&B 
or rock music cue, and participants included black and white 
adults and students from university participant pools. As we 
expected, black participants had more favorable attitudes for 
ethnic-incongruent movies when the descriptions were accompa- 
nied by the targeting cue than when they were not. However, the 
targeting cue had no impact on targetedness or attitudes for 
whites’ responses to the black-oriented movie, as was the case 
with our previous studies. 
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embeddedness. Unknown actors do not have the draw that 
famous actors do (De Vany and Walls 1997), reducing pos- 
sibilities for crossover. Smaller promotional budgets used 
selectively in ethnic-consistent media likely either fail to 
reach potential mainstream viewers or, if these viewers are 
exposed to promotions for the film, imply that the movie is 
not intended for them. Limited distribution in ethnic- 
oriented venues may limit consumption among mainstream 
consumers who are unaccustomed to being a numeric 
minority. 

Therefore, in this final study, we evaluate two hypothe- 
ses about how marketing decisions influence mainstream 
consumers’ behavioral intentions toward ethnic-oriented 
movies. Product factors that enhance a movie’s ethnic 
embeddedness (e.g., actor ethnicity, actor fame, ethnic- 
oriented plot) may contribute to perceptions that a movie 
represents the referent culture well. People who are inter- 
ested in authentic representations of other cultures (e.g., 
those who are high in diversity seeking) may be more likely 
to see films that are high in ethnic embeddedness than those 
who are not. Therefore, we hypothesize the following: 


Ha: Factors that contribute to an ethnic-oriented product's eth- 
nic embeddedness enhance behavioral intent for that prod- 
uct among mainstream consumers, particularly those who 
are high in diversity seeking. 


However, the same marketing-mix factors that enhance 
the ethnic embeddedness of a movie may also limit whether 
mainstream viewers are willing to see it. Specifically, lower 
marketing budgets, especially if used in targeted media, and 
limited distribution in ethnic-oriented venues not frequented 
by mainstream customers may reduce the likelihood that 
mainstream consumers will see the film, particularly if they 
were not otherwise favorably disposed toward the ethnic 
orientation of the movie (e.g., those who are low in diver- 
sity seeking). Therefore, we hypothesize the following: 


Hs: Factors that contribute to an ethnic-oriented consumption 
context reduce behavioral intent for ethnic-oriented prod- 
ucts among mainstream consumers, particularly those 
who are low 1n diversity seeking. 


Method 


Stimuli development and pretesting. We created two 
detailed movie descriptions, Killer Mind and The Story, 
which included the plot, a review (same across conditions), 
and additional manipulated information on promotion and 
distribution of the film (see Appendix A). In the first set of 
the movie descriptions, we include names of black actors 
and systematically manipulate four two-level factors of 
interest: marketing budget (large or small), location where 
the movie was being shown (associated with black or main- 
stream audiences), ethnic embeddedness of the plot (high: 
associated with black culture; low: associated with main- 
stream culture), and actor fame (famous or unknown). We 
pretested (n = 57) the actors’ names to ensure that they were 
either recognized as or inferred to be black actors and to 
ensure that they were either famous or unknown. Indeed, 
the two famous actors used in the study were recognized as 
such (M = 6.27) compared with the two fictitious actors 
used (M = 3.14; F(1, 227) = 360.24, p « .01). In a recogni- 
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tion task, 90.4% of participants recognized the names of the 
famous actors, and 95.6% did not recognize the names of 
the unknown actors (y7(1) = 169.00, p < .01). These four 
black actors were also identified as black (M = 6.09) com- 
pared with the white (M = 1.82) and Hispanic (M = 1.52) 
actors we included as foils in the pretest (F(2, 683) = 
852.95, p « .01). 

Marketing budget was either $27 million (just over 
2003's $25 million average; Friedman et al. 2003) in the 
large condition or $1.3 million (approximately 596 of the 
large budget; Alexander 2001) in the small condition. To 
make this manipulation more salient and meaningful, the 
text of the budget manipulation also includes the implica- 
tions of a large or small marketing budget on advertising 
reach. The location of the movie was stated as either where 
black viewers would be likely to see it (black condition) or 
where a majority of viewers would be likely to see it (main- 
stream condition). To effect the plot manipulation, we 
included a subplot in the movie description that stated 
either that black ethnicity was of focal interest to the story 
line (high ethnic embeddedness) or that another facet of the 
subplot was of focal interest to the story line (low ethnic 
embeddedness). 

In the second set of the movie descriptions, we created 
one mainstream control version for each movie in which 
there was no mention of actor name, race, or fame. The 
movie budget was large, the location was mainstream, and 
the ethnic embeddedness of the plot was low. 


Design, procedure, participants, and measures. In this 
design, the movie descriptions comprising the four manipu- 
lated factors are nested under the black condition of a two- 
level actor ethnicity factor, and the mainstream control 
descriptions are nested under the mainstream condition of 
this factor. We used the SAS design procedure to generate 
an incomplete factorial design with the four between- 
subjects factors plus the movie replicate for the two movies 
that enabled us to assess all main effects and two-way inter- 
actions among design factors.» This design yielded 16 dif- 
ferent conditions nested under black actor. Therefore, the 
design of this study was an incomplete 2 (actor fame: 
famous or unknown) x 2 (budget: large or small) x 2 (loca- 
tion: black or mainstream) x 2 (ethnic embeddedness of 
plot: high or low) plus control ANOVA model. 

Participants were 97 white American MBA students 
who were recruited from a participant pool at a southeastern 
university; they participated in this study as one of several 
studies in a group testing session. Participants ranged in age 
from 22 to 33 years; 37% were female. Of the 97 partici- 
pants, 86 were randomly assigned to read one of the 16 
black-oriented movie descriptions, and 11 were randomly 
assigned to read both of the mainstream movie descriptions. 
All participants first read a description of the movie to 
which they had been assigned and then responded to the 





?Power analysis suggests that we would require more than 500 
participants to evaluate the fully crossed design. Given that we had 
no prior hypotheses regarding three- and four-way interactions, we 
elected to study a more parsimonious and practical model that 
enabled us to evaluate main effects and two-way interactions but 
that required only approximately 100 participants. 


same questions about their attitudes toward the movie (a = 
.96) used in Studies 1 and 2. Then, they completed eight 
additional seven-point items that assessed the likelihood 
that they would see the movie in a theater or rent the video 
or DVD and whether they would do so alone, with a friend, 
with their spouse or partner, or with a work colleague. 
These questions comprise two additional fully crossed 
within-subjects factors: venue (see in theater or rent on 
video or DVD) and social context (alone, with friend, with 
spouse or partner, or with work colleague). Participants in 
the mainstream condition repeated this procedure for the 
second movie. Finally, participants provided measures of 
genre preference, diversity seeking, and demographic data, 
as in the previous studies. 


Results 


We initially analyzed attitude toward the movie with the 
aforementioned ANOVA model that included all main 
effects and two-way interactions among design factors 
nested under actor ethnicity (black versus mainstream con- 
trol) as well as the main effect of and interactions with 
diversity seeking. Although the model was significant 
(F(16, 96) = 3.00, p < .01), this significance was driven only 
by the covariates; no experimental design effects were sig- 
nificant (ps > .17). Again, movie attitudes for white con- 
sumers were unaffected by our experimental manipulations. 

For behavioral intentions, we also crossed the ANOVA 
model with venue and social context factors. No four- or 
five-way interactions among design factors and diversity 
seeking, venue, and social context achieved significance, so 
we dropped them from the final analyses. We included atti- 
tude toward the movie as a covariate, and unless otherwise 


stated, degrees of freedom are (1, 775). We tested hypothe- 
ses at p « .05 through a priori contrasts evaluated with one- 
tailed tests. Relevant cell means appear in Table 1. This 
model was significant (F(117, 775) 2 5.05, p « .01), and 
actor ethnicity had a main effect such that participants were 
more likely to see the mainstream control movies (M = 
3.75) than the black-oriented movies (M = 3.22; F = 10.80, 
p « .01), consistent with industry data. 

We hypothesized two effects on intended behaviors. 
First, in H4, we predicted that characteristics of an ethnic- 
oriented movie that added to its ethnic embeddedness 
would increase the chances that white customers would see 
the movie compared with ethnic-oriented movies that were 
low in ethnic embeddedness, particularly for high diversity 
seekers. Indeed, plot and actor fame interacted (F — 5.71, 
p < .02) such that all participants were more likely to see a 
movie with a highly ethnically embedded plot and unknown 
actors (M = 3.67) than with a less ethnically embedded plot 
and/or famous actors (for the remaining three cells, M = 
3.07; F = 3.25, p « .04). Furthermore, the significant tbree- 
way interaction among plot, budget, and diversity seeking 
was significant (F = 4.95, p « .02) such that low-diversity- 
seeking participants were less willing to see movies with 
highly ethnically embedded plots and large budgets (M = 
2.48) than other movies (for the remaining three cells, M = 
3.32; F = 5.70, p < .01), consistent with H4. High-diversity- 
seeking participants were equally willing to see movies 
regardless of plot and budget conditions. 

Second, in Hs, we predicted that other marketing factors 
(particularly distribution decisions) that increased the ethnic 
orientation of the consumption context would decrease the 
chances that white consumers would see an ethnic-oriented 


TABLE 1 
Study 3: Effect of Diversity Seeking and Movie Characteristics on White Participants’ Likelihood of 
Seelng a Movie with Black Actors 





Low 

Budget Small 
Location 

Ethnic 2.48 

Mainstream 3.91 
Plot 

High embeddedness 3.60 

Low embeddedness 2.48 

Unknown 

Venue 

Rent 3.14 

See 3.20 
Social Context 

Friend 3.28 

Partner or spouse 3.88 

Self 2.28 

Work colleague 3.25 


Diversity Seeking 
High 
Large Small Large 
3.13 3.32 3.31 
3.10 3.29 3.16 
3.10 3.61 3.27 
3.13 3.02 3.23 
Actor Fame 
Famous Unknown Famous 
3.11 3.83 3.26 
3.05 2.87 3.13 
3.82 3.83 3.87 
3.41 3.21 3.72 
2.18 2.71 1.93 
2.91 3.64 3.26 
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movie, particularly among low-diversity-seeking con- 
sumers. The significant interaction between budget and 
venue (F = 3.99, p < .05) shows that participants were less 
likely to see a high-budget movie in a theater (M = 2.92; 
F = 9.51, p < .01) than to rent it (M = 3.47); there was no 
difference in whether they would rent (M = 3.26) or see in a 
theater (M = 3.22) small-budget movies (p > .20). The 
three-way interaction among location, budget, and diversity 
seeking was significant (F = 11.49, p < .01) such that low- 
diversity-seeking consumers were less likely to see a small- 
budget film at an ethnic location (M = 2.48) than at a main- 
stream location (M = 3.91; F = 15.75, p < .01) and 
compared with a high-budget film (for two cells, M = 3.11; 
F = 12.74, p < .01). 

Actor fame interacted with venue and diversity seeking 
(three-way interaction: F = 3.23, p < .07) such that low- 
diversity-seeking participants were equally willing to rent 
or see movies regardless of whether the actors were famous 
(ps > .20). High-diversity-seeking participants were more 
willing to rent a movie with unknown black actors (M = 
3.83) than they were to see a movie with black characters in 
the theater or rent a movie with famous black actors (for 
three cells, M = 3.10; F = 8.46, p < .01). 

Finally, the person with whom participants would see a 
movie affected the types of movies they were willing to see. 
In the control condition, people were equally willing to see 
a movie with any other person, whether a friend (M = 4.27), 
a partner or spouse (M = 4.23), or a work colleague (M = 
3.98), but they were less willing to see a movie alone (M = 
2.52) than with these others (for three cells, M = 4.16; F = 
44.30, p « .01). This was not true for high- and low- 
diversity-seeking participants seeing black-oriented movies 
with famous or unknown actors in different social contexts 
(three-way interaction: F(3, 775) = 2.55, p « .05). Low- 
diversity-seeking participants were more willing to see a 
movie with a friend if it had famous actors (M = 3.82) than 
if it had unknown actors (M = 3.28; F = 2.07, p < .08); this 
difference was not significant for high-diversity-seeking 
participants (p » .20). High-diversity-seeking participants 
were more willing to see a movie with a partner or spouse if 
it had famous actors (M = 3.72) than if it had unknown 
actors (M = 3.21; F = 7.08, p < .01); this difference was not 
significant for low-diversity-seeking participants (p » .20). 
High-diversity-seeking participants were more willing to 
see a movie alone if it had unknown actors (M = 2.81) than 
if it had famous actors (M = 1.93; F = 2.67, p « .05). 


Discussion 


The results provide support for H4 and H; as well as insight 
into how high- and low-diversity-seeking white consumers 
interpret different cues and marketing tactics regarding 
ethnic-oriented products. As we predicted in Hy, all partici- 
pants were more likely to see a black-oriented movie that 
had a highly ethnically embedded plot and a low promo- 
tional budget than to see other types of black-oriented 
movies. We interpret this result to suggest that something 
about the authenticity of the movie as conveyed through the 
plot and lack of commercial push was more appealing for 
white viewers than black-oriented films of the more main- 
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stream comedic or "gangsta" genres. High-diversity- 
seeking participants also seemed to interpret the use of 
unknown actors positively, particularly for home rental, and 
were more likely to see black-oriented movies with 
unknown actors alone or with a friend than were low- 
diversity-seeking participants. In summary, white partici- 
pants, especially those high in diversity seeking, were 
favorably disposed toward seeing black-oriented movies 
that contained several cues (versus only one or few) that 
reinforced their ethnic orientation. 

Social norms seemed to have a different effect on low- 
and high-diversity-seeking participants. Notably, high- 
diversity-seeking participants were less likely to go with a 
partner or spouse to see a black-oriented movie that had 
unknown actors than one with famous actors, but low- 
diversity-seeking participants were less likely to go with a 
friend to see a black-oriented movie that had unknown 
actors than one with famous actors. Examination of these 
means in Table 1 suggests that the social risk associated 
with seeing a black-oriented film with unknown actors was 
higher with more casual acquaintances than with a partner 
or spouse for low-diversity-seeking participants. In contrast, 
high-diversity-seeking participants either were not bothered 
by such risk or had friends who were similarly favorably 
disposed toward black-oriented movies with unknown 
actors. Instead, high-diversity-seeking participants were 
less likely to see that movie with a partner or spouse who, 
perhaps, had little inclination toward such films. 


General Discussion 


The findings of our studies are consistent with prior 
research with regard to black participants. The results 
across the first two studies show that black participants pre- 
ferred black movies over white movies, echoing work on 
advertising response. The diversity-seeking characteristic 
enhanced attitudes toward white movies among black par- 
ticipants, compensating for the absence of a match in eth- 
nicity. Furthermore, we demonstrated that though famili- 
arity positively enhanced attitudes for all participants, 
familiarity mediated the impact of ethnic congruence on 
attitudes only for black participants. These results enable us 
to understand more clearly why mainstream movies attract 
ethnic minority viewers: Familiarity with their dominant 
cultural themes enbances attitudes for potential ethnic 
minority viewers, and ethnic minority viewers who were 
high in diversity seeking were more favorably disposed 
toward other-ethnic mainstream movies. 

The results among white participants also demonstrated 
that the diversity-seeking characteristic moderated whites’ 
attitudes, such that white participants who were higher in 
diversity seeking had more positive attitudes toward black 
movies than did white participants who were lower in diver- 
sity seeking. Although the main effect of the ethnic orienta- 
tion of the movie did not directly influence attitudes among 
white participants, it did affect behavioral intentions: The 
presence of black actors decreased the likelihood that white 
participants would see the movie compared with main- 
stream control conditions. The results show that marketing- 
mix factors interacted with the social context of product 


consumption to influence the behavioral intentions of 
whites toward black movies. 


Implications for Practice 


The crossover of ethnic-oriented products from niche seg- 
ments into larger consumer segments may have positive 
social and economic consequences. Ethnic-oriented prod- 
ucts such as movies serve as socialization vehicles, expos- 
ing people to a variety of images, cultures, and lifestyles not 
otherwise learned through personal experience (Gerbner et 
al. 1994; Tharp 2001). However, many ethnic-oriented 
products, particularly those reflecting U.S. ethnic minori- 
ties, are targeted solely toward the referent ethnic group 
because of concerns that other ethnic groups will be unin- 
terested (Carter 2001; Johnson 2003; Masters 1996). Our 
results identify strategies that marketers of ethnic-oriented 
films and other ethnic-oriented products might use to gener- 
ate crossover appeal along with social and economic 
benefits. 

Our results show that ethnic congruence enhances 
familiarity and that familiarity enhances attitudes among all 
consumers. Changing product characteristics to enhance 
appeal among target viewers is a standard movie industry 
practice (Diamond 1989; Pampel, Fost, and O'Malley 1994) 
that may be used fruitfully to increase crossover appeal of 
ethnic-oriented products among mainstream consumers. 
However, it may be more difficult (or undesirable) to 
change the characteristics of highly ethnically embedded 
movies whose story lines are inextricably bound to ethnic 
experiences simply to elicit feelings of familiarity among 
white audiences. Alternatively, marketers might induce feel- 
ings of familiarity through promotional changes (versus 
product changes) that focus on aspects of the movie or 
product that either transcend ethnic boundaries or hold 
unique appeal among relevant subsegments. Our results 
indicate that more concerted attention toward the creation 
of strategic crossover with multiple targeted promotional 
messages placed in carefully selected media outlets may 
prove successful. 

The findings also demonstrate that there is a subset of 
white viewers who may seek out cultural diversity in some 
of the products they choose. Marketers of ethnic-oriented 
films and other ethnic-oriented products may find success 
in inducing crossover in regions where high-diversity- 
seeking white consumers are located. Geodemographic 
research shows that there are distinct neighborhoods that 
are increasing rapidly in ethnic composition while retaining 
diversity across a range of ethnic groups, including whites. 
Places such as Orange County, Calif. (Y1 and Herndon 
2001), and Queens, N.Y. (Campos-Flores 2001), have 
favorable socioeconomic characteristics and critical masses 
of ethnic-group consumers. Whites who choose to live in 
these areas may well be those who seek out and value diver- 
sity. Such geographic concentration is ideal for distribution- 
based strategies to increase crossover potential of ethnic- 
oriented products such as films. For example, by placing 
highly embedded ethnic-oriented movies in such neighbor- 
hoods, marketers may efficiently and effectively target 
whites who are predisposed toward attending them. 


The results also demonstrate that the relationship 
between the degree of the ethnic embeddedness of a product 
and whites’ behavioral intentions toward that product is not 
Straightforward. Marketers of ethnic-oriented products 
sometimes attempt to water down their offerings to make 
them more palatable for mainstream or other-ethnic audi- 
ences. However, our findings indicate that this strategy may 
be unsuccessful for two reasons: First, low-diversity- 
seeking white consumers may be uninterested in an ethnic- 
oriented product regardless of its degree of ethnic embed- 
dedness simply because its ethnic orientation alone signals 
to these consumers that the product is not for them and thus 
reduces its appeal. Second, weakening the ethnic embed- 
dedness of the product may eliminate the very characteristic 
that appealed to high-diversity-seeking white and referent 
ethnic consumers in the first place, namely, genuine repre- 
sentation of the referent ethnic group. This counterintuitive 
(and even ironic) nature of the degree of ethnic orientation 
was evident in white participants’ behavioral intention rat- 
ings. Highly ethnically embedded films tend to have 
unknown actors, lower budgets, and sparser distribution, 
whereas less ethnically embedded films (mainstream ethnic 
films) tend to be supported with well-known actors, larger 
budgets, and more widespread distribution. As the typical 
ethnic fare, the mainstream ethnic films appeared to hold 
less appeal to whites in our sample, perhaps because such 
movies are perceived as no different from other mainstream 
movies and thus evoke no special interest. Ironically, these 
are most likely the ethnic movies that whites are made 
aware of through greater promotion and distribution efforts. 
Given that, in general, highly ethnically embedded films 
evoke greater appeal, especially among high-diversity- 
seeking whites, marketers may want to create different mar- 
keting strategies depending on how deeply the product- 
related cues are embedded in the referent ethnic culture and 
how these cues resonate with both referent ethnic con- 
sumers and low- and high-diversity-seeking mainstream 
consumers. 

Our findings similarly demonstrate that the strength of 
association of a particular ethnic group with a consumption 
context may also facilitate (or hinder) the consumption of 
ethnic-oriented products. Thus, the results emphasize the 
importance of considering not only how widely a product 1s 
distributed but also where the product is distributed and the 
ethnic meanings associated with the distribution context. 
Although changing the ethnic composition at or beliefs 
about an ethnic-specific venue or retail location may be 
challenging, creative promotional strategies can be used to 
overcome contextual barriers to consumption. For example, 
special events at the consumption venue that appeal to 
mainstream or other-ethnic audiences, sales promotions that 
encourage consumption among mainstream or other-ethnic 
audiences, and communications that signal that mainstream 
and other-ethnic consumers are especially invited to partake 
of ethnic-oriented offerings might ameliorate distribution- 
specific causes of crossover failure. In general, the results 
emphasize the significance of the ethnic composition of 
consumption contexts, the role of a person’s companion in 
facilitating or inhibiting other-ethnic product consumption, 
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and the ethnic embeddedness of the product in attracting 
non-target market consumers. 


Theoretical Contributions 


Our framework identifies key constructs that are relevant to 
the crossover of ethnic-oriented products and examines the 
relationships among these factors that affect an ethnic- 
oriented product’s ability to garner interest in and gain sales 
among mainstream and other-ethnic consumer segments. 
The framework generalizes beyond our application to white 
viewers’ predilection for black-oriented movies to other 
product categories, other referent ethnic groups, and other 
nontarget customer segments. Because services possess a 
high degree of social embeddedness (i.e., services are deliv- 
ered by people and frequently produced and consumed in 
the presence of other people), they may provide a particu- 
larly rich context in which to extend this framework. For 
example, the framework would help explain whether or not 
a white mother would take her biracial daughter with 
“frizzy” hair to an African American-oriented salon located 
in an African American neighborhood. The framework also 
provides a mechanism for understanding how the social, 
cultural, and historical beliefs about other referent ethnic 
groups (e.g., Asian, Hispanic) likely influence the crossover 
of products associated with these groups into mainstream 
and other-ethnic segments. Extending the framework fur- 
ther, it similarly provides an approach for understanding 
how to market a product that has become strongly associ- 
ated with a particular consumer segment on the basis of 
characteristics other than ethnicity (e.g., age, gender, sexual 
Orientation, or region) to a broader group of consumers. 
Our results also contribute to the theoretical understand- 
ing of the processes by which non-target market consumers 
form inferences about products that are associated with spe- 
cific consumer segments. One clear outcome of this 
research regarding such processes is the inadequacy of atti- 
tudes in predicting whites’ behavioral intentions toward 
ethnic-oriented products. In general, it is accepted in con- 
sumer contexts that attitude toward a product is somewhat 
indicative of purchase intent (Kraus 1995). However, 
research on the attitude—behavior link shows that this link 
may be attenuated by the social characteristics of potential 
consumption contexts (Zalesny and Ford 1990). We believe 
that social context explains our findings similarly. Although 
whites’ attitudes were seemingly unaffected by the ethnic 
orientation of a movie, it was not because ethnicity does not 
matter to these white consumers. Rather, our attitude mea- 
sures were decontextualized and did not reflect or include 
the appreciable influence of ethnic-oriented context on 
movie consumption behavior. Moviegoing is fundamentally 
a social activity (Austin 1989), and our results show that the 
ethnic orientation of the social context in which a person 
anticipates seeing a given movie has a profound impact on 
the decision to see that movie. Further research exploring a 
range of dependent measures that tap into purchase inten- 
tion and behaviors, perhaps implicitly to avoid social desir- 
ability biases, might show greater variability among and 
insights into white consumers and ethnic-oriented products. 
In addition, our findings extend burgeoning research on 
ethnic embeddedness (Appiah 2004; Brumbaugh 2002; 
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Williams 1995) that has examined the degree to which 
advertisements contain ethnic cues. Our studies examine 
not only the ethnic embeddedness of the product itself but 
also its consumption context. Our results are consistent with 
those of Appiah (2001a), who finds that some products 
associated through advertising with black culture may be 
desired by mainstream consumers more than products asso- 
ciated with mainstream culture. The associations related to 
culture alter the nature of the products themselves, giving 
them different meaning that causes them to resonate with 
certain, possibly unexpected, segments. Ethnic-oriented 
films that are high in ethnic embeddedness and that possess 
qualities that are authentic to the ethnic-group culture seem 
to be endowed with a similar ability to resonate with high- 
diversity-seeking consumers. Thus, our work extends the 
limited research on ethnic embeddedness in the advertising 
domain into products and consumption contexts. 


Limitatlons and Directions for Further Research 


As in all research, we made some concessions that create 
limitations in this research and opportunities for future 
study. Across all three studies, we assessed consumers’ 
responses to descriptions of movies rather than to filmed 
movie trailers. Although certainly not as rich as multimedia 
clips or advertisements, such descriptions are common in 
newspapers and television guides and serve an advertising 
function in and of themselves. Research shows that people 
use such descriptions to make moviegoing decisions 
(Cooper-Martin 1991), and industry research before final 
production relies on similar stimuli (Austin 1989). 
Nonetheless, additional investigations with movie adver- 
tisements and trailers could further increase the generaliz- 
ability of our results. 

À potential problem with our operationalization of 
movie venue location in Study 3 is its implied distance from 
our white participants’ homes. Specifically, our white par- 
ticipants may have inferred that they not only would be in 
the minority at the black movie venue but also would need 
to travel farther to reach that venue compared with the 
mainstream locations. Although follow-up analyses pre- 
clude this explanation,$ we readily concede that this is a 
limitation of Study 3 and encourage future researchers to 
test these contextual effects on crossover potential. 

Our findings reveal subtle but important interactions 
among marketing-mix factors that influence whites' will- 


6]t has been noted that the black/mainstream location factor may 
be confounded with perceived distance such that participants are 
less likely to see a movie at a black location than a mainstream 
Jocation because they perceive the former to be farther away than 
the latter. Additional analyses suggest that this is not the likely 
cause. We analyzed all of our behavioral items (whether partici- 
pants would see the movie in a theater under different conditions), 
with our covariates and location as our sole factor. Multivariate 
ANOVA results show no difference in means across dependent 
measures between black and mainstream locations (p » .20). Uni- 
variate results show that only the difference of whether a person 
would see the movie with a friend at a black versus a mainstream 
location approached significance (p « .06). Therefore, we con- 
clude that participants in this study are unlikely to associate the 
ethnic orientation of the location with distance. 


ingness to see ethnic-oriented movies. However, because of 
the complexity of the design and the number of factors we 
explored in Study 3, we were unable to investigate all com- 
binations of marketing and movie characteristics that might 
increase white consumers’ likelihood of seeing ethnic- 
oriented movies in theaters. For example, simply increasing 
the marketing budget to make high-diversity-seeking con- 
sumers aware that a highly ethnically embedded movie was 
showing in their area might increase their attendance. This 
effect 1s captured by a five-way interaction that we were 
unable to evaluate. Additional research could explore these 
interactions and relationships. 

Our results indicate that diversity seeking may be a 
fruitful segmentation variable, especially given the increase 
in ethnic-oriented products and the changing ethnic compo- 
sition of the marketplace. In Studies 1 and 3, we used a 
context-specific, single-item measure that we assessed after 
presenting our movie stimuli. Both the use of a single item 
and the timing of its administration in our studies are weak- 
nesses that could be remedied in further research. Although 
we provide support for this single item (see Appendix B) 
and believe that it is an adequate indicator of diversity- 
seeking consumption behavior in this moviegoing context 
(Drolet and Morrison 2001), additional studies might use 
existing related scales (Douthitt, Eby, and Simon 1999; 
Miville et al. 1999) to evaluate the extent to which this char- 
acteristic influences consumption of ethnic-oriented prod- 
ucts. Alternatively, further research might build on our steps 
toward the development of a context-neutral measure of 
diversity seeking related to consumption activities to iden- 
tify consumers who are predisposed toward products asso- 
ciated with other ethnic groups. These initial explorations 
indicate that this characteristic may have multiple dimen- 
sions that influence distinct consumption behaviors in dif- 
ferent ways. Our single context-specific item scratches the 
surface of what may be a rich area for future exploration. 

We conjectured in the discussion of our final study that 
consumers might use product cues to assess the authenticity 
of an ethnic-oriented movie. We suggested that highly eth- 
nically embedded plots with lesser-known actors and lower 
marketing budgets could be interpreted as more representa- 
tive of the ethnic cultures portrayed and that this authentic- 
ity makes such movies more attractive to high-diversity- 
seeking consumers. Authenticity 1s indeed an inference that 
marketers attempt to create in their target markets for ethnic 
products (e.g., Watts and Orbe 2002). Further research 
should address whether consumers make such assessments 
of authenticity and whether such assessments mediate (as 
our discussions imply they might) the impact of marketing 
tactics on behavioral intentions (cf. Grayson and Martinec 
2004). 

Finally, our research examined the traditional notion of 
crossover from a niche market into the mainstream market. 
However, a key to increasing the economic viability of 
ethnic-oriented products may be consumption by members 
of other ethnic minorities, who may identify with the refer- 
ent ethnic minority’s experiences and may be predisposed 
toward products associated with the referent ethnic minority 
group. Research that includes multiple ethnic groups will 


provide additional theoretical and practical insights into 
crossover among a variety of consumer segments. 

Our framework provides a robust structure in which to 
study diverse consumer segments’ consumption of an 
equally diverse portfolio of goods and services (Hirschman 
and Holbrook 1982). In applying this framework to movies, 
we have begun to understand how mainstream white con- 
sumers perceive ethnic-oriented products. Additional stud- 
ies that apply this framework to the crossover of other prod- 
uct categories, among different consumer segments, with 
other social characteristics and individual difference factors, 
and across a variety of consumption contexts will help mar- 
keters understand, predict, and enable crossover consump- 
tion in a multicultural marketplace. 


Appendix A 
Movie Descriptions 


Study 1 

My Turn Again (action genre). In this action-packed 
film, Tony Larkins, a rock/R&B music singer declining in 
popularity, marries a shy country girl and helps her become 
a famous actress. Her fame not only eclipses his, but he 
gradually realizes that he was used and plots his revenge. 


Thin Blood (drama genre). In a modern-day romance, 
Nathaniel Wellington/Tyrone Washington, a devoted son, 
must choose between caring for his cantankerous but well- 
meaning father and moving across the country to marry the 
woman he loves. While his mother wants him to stay near 
home, his sister, who fell out of her father’s favor, argues 
that he should do what he wants. Nathaniel is torn, and his 
uncertainty, although comic at times, just might lead to 
tragedy. 

The One (romance genre). This comedy finds two single 
professional women in Seattle/African American women 
in Detroit searching for The One in all the wrong places! 
Susan is a lawyer, and Jackie is a business school professor. 
Their time-consuming jobs mean they must make their 
search as "efficient" as possible. So they decide to place a 
"highly creative" singles ad in the local paper. Their creativ- 
ity is surpassed only by that of the variety of men who 
respond to their ads. 


Study 2 


Coming Home. This light comedy finds three Irish 
American/African American brothers suddenly living 
under the same roof for the first time since childhood. Their 
different experiences have changed each of them in ways 
unimaginable to the others. Each day brings new discover- 
ies and excitement for the brothers, as they each pursue 
careers, women, and a good time on the town! 


Study 3 


The Story. The main character in the film is a career- 
focused, up-and-coming newscaster at a local morning news 
show who believes she has everything going for her. When 
studio execs at a competing show take a possible interest in 
hiring her, her boss decides to get her out on the streets to 
start covering stories herself to see if she has what it takes to 
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TABLE A1 
Factor Loadings for Diversity-Seeking Subscales 








Learning Hirlng Eating Living 

Learning about other cultures is something | enjoy. .8275 .0835 .2670 .3672 
| view travel as an opportunity to learn about different cultures. .7608 0142 2265 .0796 
When choosing where to go on vacation, | favor places with people of 

different ethnic groups. 5017 .2958 .3614 .2065 
| think too much emphasis Is placed on diversity In hiring decisions. (R) .1040 ./768 .0890 .2220 
If | were hiring, | would try to seek out candidates who are different from 

the current work force in terms of ethnicity, race, gender, and sexual 

orientation. .0458 .6673 —.1134 .3442 
| think it Is Important to create a workplace that represents the local 

population In terms of ethnicity, race, gender, and sexual orlentation. .0548 .5892 .1926 .1687 
| enjoy trying different types of food from foreign countries. .3211 .0475 .6945 .0384 
| avoid going to restaurants with foods from novel places (e.g., Ethloplan 

or Peruvian). (R) .1202 .1004 .6582 .2363 
Other than Chinese or Mexican food, | don't really seek out "ethnic" 

foods. (R) .3188 .0545 .6381 .1183 
| would welcome a member of another cultural, ethnic, or racial group 

Into my family. .1409 .2137 .0746 .6879 
It is very important to me to live in a neighborhood with people from 

different ethnlc groups. .2459 .4024 .2011 .5808 
| would consider living in an area where there lived many people who 

were not from my ethnic group .1784 .2578 .2125 .5783 


Notes: R = Item Is reverse scored. 


be a serious journalist for the station. She is none too happy, 
however, when she is assigned to the best cameraman in the 
crew—the two have never gotten along, and sparks fly the 
minute they start working together. As it turns out, it is the 
first time a Black/this newscaster and cameraman have 
been paired together, and the comedic racial-oriented/—— 
subtext between them 1s a storyline all its own. 

Even worse, unbeknownst to the driven newscaster, her 
cameraman is really an undercover agent for the FBI work- 
ing to break up a complicated money laundering ring. As 
the newscaster covers her not-so-serious “lifestyle” stories, 
the hapless cameraman risks life and limb in outrageously 
comedic chase and fight scenes. As he pays less and less 
attention to the newscaster, she finds herself increasingly 
attracted to him. The romantic tension between the two of 
them is charming, and the contrast between the camera- 
man’s boring film job and his secret FBI life is hysterical. 
The plot twists and turns make for a movie that is both 
hilarious and endearing. 


«This movie has received excellent reviews from major, credi- 
ble film critics, generally on the order of 3.5 out of 4 stars. As 
a combination romance/comedy, critics believe it will have 
wide appeal to both men and women. 

*The film will open February 1, 2004. Though distributed 
nationally, this movie will be shown primarily where the 








Black/ population is most concentrated and in the- 
aters where Black/ moviegoers are most likely to 
attend. 


eLarge budget in control condition: This movie is supported 

by a very large/small promotional budget of $27/$1.3 mil- 
lion. Ad placements in national television and print outlets 
will dominate/will focus on local cable and outdoor out- 
lets—potential viewers will be made well aware of this 
movie’s release/may not be aware of this movie before it is 
released. 
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*Omitted in control condition: Veteran actors Angela Basset 
and Will Smith turn in outstanding performances in this 
film as the newscaster and her cameraman /Previously 
unknown actors Laquisha Brown and Tyrone Jefferson 
make their successful film debuts in this movie as the news- 
caster and her cameraman. 


Killer Mind. The movie begins with a clever twist, as a 
forensic investigator examines and dissects a grisly crime 
scene. It seems that people are being killed by a serial killer, 
and clues are few and far between. The local police team, 
led by a new detective and his veteran partner, avoid saying 
the killings were done by a serial killer until it becomes 
obvious and they absolutely have to—concern about public 
outcry weighs heavily on their minds, but they want to keep 
the details secret to avoid “copy cat” killings. When the new 
detective’s boss wants to take him off of the case because 
he’s new to the department, he successfully fights for his 
right to be the first Black/ in his department to work 
a serial murder. Issues of race/officer loyalty form a subtle 
backdrop against which the story unfolds, and the detectives 
are surprised to find out who their true friends are. 

Eventually, a world-renowned psychiatrist who lectures 
on serial killers is brought in on the case. An academic 
recluse, she reluctantly joins the detectives in their search 
for the murderer. As she examines the forensic evidence 
collected in the opening scene and subsequent crime scenes, 
she realizes that the killer is her cousin. It turns out the 
killer is playing games with the psychiatrist, making his 
capture one of the scariest and most exciting scenes in the 
movie. 





*Thought to be a “sleeper” in 2004, this action/suspense movie 
has garnered uniformly outstanding reviews from all critics, 
and it 1s believed that it will be in contention for several 
major awards. Though there are some graphic scenes, critics 


believe the combination of intrigue, mental gamesmanship, 
and clever dialog will draw both men and women into its sin- 
ister plot. 

*The film will open January 30, 2004. Though distributed 
nationally, this movie will be shown primarily where the 








Black/ population is most concentrated and ın the- 
aters where Black/ moviegoers are most likely to 
attend. 


eLarge budget in control condition: This movie is supported 
by a very large/small promotional budget of $27/$1.3 mil- 
lion. Ad placements in national television and print outlets 
will dominate/will focus on local cable and outdoor out- 
lets—potential viewers will be made well aware of this 
movie's release/may not be aware of this movie before it is 
released. 


*Omitted in control condition’ Veteran actors Will Smith 
and Angela Basset turn in outstanding performances in 
this film as the new detective and his partner./Previously 
unknown actors Tyrone Jefferson and Laquisha Brown 
make their successful film debuts in this movie as the new 
detective and his partner. 


Appendix B 
Diversity-Seeking Measure 


To our knowledge, a diversity-seeking scale does not exist. 
Although the Miville-Guzman Universality Diversity Ori- 
entation (Miville et al. 1999) and Receptivity to Dissimilar 
Others (Douthitt, Eby, and Simon 1999) scales are close, we 
believe that they reflect passive tolerance or acceptance of 
diversity rather than the proactive seeking out of diversity 
that we believe drives crossover consumption behaviors. 
Therefore, we created our single-item measure, “T like see- 


ing movies about other kinds of people," to capture this 
individual difference trait in a movie-viewing context. 
Although a multi-item scale would be preferred in general, 
research suggests that a multi-item measure is neither 


, always desirable nor necessary (Drolet and Morrison 2001). 


Indeed, additional analyses suggest that our single item is 
more than adequate in its application in Studies 1 and 3 
because it 1s related to other measures of diversity seeking. 
Specifically, additional testing (n = 95) regarding the 
preliminary development of a diversity-seeking scale shows 
that this single item is correlated with a 12-item measure 
(a = .85) of diversity seeking (r = .27, p < .01). Initial data 
analyses pertaining to the development of the diversity- 
seeking scale show that this scale comprises four subfactors 
(confirmatory factor analysis: %2? = 53.23, p = .28; adjusted 
goodness-of-fit index = .87). Our single-item measure was 
significantly correlated with three of the four subscales 
listed in Table A1 (learning: r = .27, p < .01; hiring: r = .20, 
p « .06; eating: not significant; living: r= .25, p « .01), sug- 
gesting that different facets of general diversity-seeking ten- 
dencies have different influences on domain-specific con- 
sumption behaviors. In addition, this item was correlated 
with the number of diversity-related behaviors (e.g., “I have 
been to a same-sex wedding,” and “I have spent the night in 
the home of someone of another ethnic group") out of 18 in 
which participants claimed to have engaged (r = .23, p < 
.02). Therefore, we are confident that this measure serves as 
an adequate representation of the diversity-seeking orienta- 
tion that we believe moderates the impact of product ethnic 
orientation on attitudes and behavioral intentions. 





REFERENCES 


Aaker, Jennifer A., Anne M. Brumbaugh, and Sonya A. Grier 
(2000), “Non-Target Market Effects and Viewer Distinctive- 
ness: The Impact of Target Marketing on Attitudes,” Journal of 
Consumer Psychology, 9 (3), 127-40. 

Acapulco Black Film Festival (1997), “State of Black Film,” pre- 
sentation at the Acapulco Black Film Festival, moderated by 
Warrington Hudlin, Acapulco, Mexico. 

Ajzen, Icek and Martin Fishbein (1977), “Attitude-Behavior Rela- 
tions: A Theoretical Analysis and Review of Empirical 
Research,” Psychological Bulletin, 84 (5), 888-918. 

Alexander, George (2000), “Fade to Black,” Black Enterprise, 
(December), 107-115. 

(2001), “Reaching the Silver Screen," Black Enterprise, 
(December), 89-93. 

Andruss, Paula Lyon (2004), “‘Dora’ Translates Well,” Marketing 
News, (October 13), 8. ; 

Appiah, Osei (20018), “Black, White, Hispanic, and Asian Ameri- 
can Adolescents' Responses to Culturally Embedded Ads," 
Howard Journal of Communications, 12 (1), 29-48. 

(2001b), "Ethnic Identification on Adolescents’ Evalua- 

tions of Advertisements,” Journal of Advertising Research, 41 

(5), 7-22. 

(2004), “It Must Be the Cues: Racial Differences in Ado- 
lescents’ Responses to Culturally Embedded Ads,” in Diversity 
in Advertising, Jerome D. Williams, Wei-Na Lee, and Curtis P. 
Haugvedt, eds. Hillsdale, NJ: Lawrence Erlbaum Associates, 
217-36. 

Austin, Bruce A. (1989), Immediate Seating: A Look at Movie 
Audiences. Belmont, CA: Wadsworth. 











Bannon, Lisa (2000), “Is the Country Ready for a Hispanic 
‘Rocky’? Hollywood Thinks Not,” The Wall Street Journal, 
(April 6), Al. 

Baron, Reuben M. and David A. Kenny (1986), “The Moderator- 
Mediator Variable Distinction in Social Psychological 
Research: Conceptual, Strategic, and Statistical Considera- 
tions,” Journal of Personality and Social Psychology, 51 (6), 
1173-82. 

Bearden, William O. and Michael J. Etzel (1982), “Reference 
Group Influence on Product and Brand Purchase Decisions,” 
Journal of Consumer Research, 9 (3), 183-94. 

Billig, Michael (1997), “Keeping the White Queen in Play,” in Off 
White: Readings on Race, Power, and Society, Michelle Fine, 
Lois Weis, Linda C. Powell, and L. Mun Wong, eds. New York: 
Routledge, 149-66. 

Brumbaugh, Anne M. (2002), “Source and Nonsource Cues in 
Advertising and Their Effects on the Activation of Cultural and 
Subcultural Knowledge on the Route to Persuasion,” Journal of 
Consumer Research, 29 (2), 258-69. 

Campos-Flores, Arian (2001), “Changing of the Guard; New 
Groups of Strivers Are Displacing the Old Ones,” Newsweek, 
(August 6), 48. 

Carter, Kelly (2001), “Studios’ Views of Black Films May Under- 
cut Success,” USA Today, (April 11), D4. 

Clunis, Andrew (2004), “No Sell-Out: Choice Bosses Deny 
Rumours," The Voice, (October 18-24), 8. 

Cooper-Martin, Elizabeth (1991), “Consumers and Movies: Some 
Findings on Experiential Products,” in Advances in Consumer 


Consumer Responses to Ethnic-Orlented Products / 49 


Research, Vol. 18, Rebecca Holman and Michael Solomon, 
eds. Provo, UT: Association for Consumer Research, 372-78. 

Cote, Joseph A., James McCullough, and Michael Reilly (1985), 
“Effects of Unexpected Situations on Behavior-Intention Dif- 
ferences: A Garbology Analysis,” Journal of Consumer 
Research, 12 (2), 188-94. 

Davidson, A.R. and J.J. Jaccard (1979), “Variables That Moderate 
the Attitude-Behavior Relation: Results of a Longitudinal Sur- 
vey,” Journal of Personality and Social Psychology, 37 (8), 
1364-76. 

De Vany, Arthur S. and W. David Walls (1997), “The Market for 
Motion Pictures: Rank, Revenue, and Survival,’ Economic 
Inquiry, 35 (4), 783-94. 

Devine, P.G. (1989), “Stereotypes and Prejudice: Their Automatic 
and Controlled Components,” Journal of Personality and 
Social Psychology, 56 (1), 5-18. 

Diamond, Helene (1989), “Lights, Camera, Action! A Film’s Not a 
Wrap Until Test Audience Says So,” Marketing News, 23 (19), 
10-11. 

Douthitt, Shane S., Lillian T. Eby, and Shana A. Simon (1999), 
“Diversity of Life Experiences: The Development and Valida- 
tion of a Biographical Measure of Receptiveness to Dissimilar 
Others,” International Journal of Selection and Assessment, 7 
(2), 112-25 

Drolet, Aimee L. and Donald G. Morrison (2001), “Do We Really 
Need Multiple-Item Measures in Service Research,” Journal of 
Service Research, 3 (February), 196—204. 

Easton, Nina J. (1991), “Good News/Bad News of the New Black 
Cinema; The Films Are Finally Coming Out in Force, but Mar- 
keting African-American Themes to a Broad Audience Is a 
Challenge Now,” Los Angeles Times, (June 16), 5. 

Fisher, Robert J. and Linda L. Price (1992), “An Investigation into 
the Social Context of Early Adoption Behavior,” Journal of 
Consumer Research, 19 (4), 477-86. 

Forehand, Mark R. and Rohit Deshpandé (2001), “What We See 
Makes Us Who We Are: Priming Ethnic Self-Awareness and 
Advertising Response,” Journal of Marketing Research, 38 
(August), 336-48. 

Friedman, Wayne, Jon Fine, Tobi Elkin, and Ann Marie Kerwin 
(2003), “Print Feels Heat as Movie Dollars Shift,” Advertising 
Age, (September 1), 17. 

Fuertes, Jairo N., Marie L. Miville, Jonathan J. Mohr, William E. 
Sedlacek, and Denise Gretchen (2000), “Factor Structure and 
Short-Form of the Miville Guzman Universality-Diversity 
Scale," Measurement and Evaluation in Counseling and Devel- 
opment, 33 (3), 157-69. 

Garafalo, Reebee (1993), “Black Popular Music: Crossing Over or 
Going Under,” 1n Popular Music: Politics, Policies, and Institu- 
tions, Tony Bennett, Simon Frith, Lawrence Grossberg, John 
Shepherd, and Graeme Turner, eds. London: Routledge, 
231-48. 

(1994), "Culture Versus Commerce: The Marketing of 
Black Popular Music," Public Culture, 7 (1), 275-87. 

Gerbner, George, Larry Gross, Michael Morgan, and Nancy Sig- 
norielli (1994), “Growing Up with Television: The Cultivation 
Perspective," in Media Effects: Advances in Theory and 
Research, Jennings Bryant and Dolf Zillmann, eds. Hillsdale, 
NJ: Lawrence Erlbaum Associates, 17-41. 

Gibbs, James L. (1999), “Television and Diversity: A Quantum 
Leap Model,” in A Nation Divided: Diversity, Inequality, and 
Community in American Society, Phyllis Moen, Donna 
Dempster-McLain, and Henry Walker, eds. Ithaca, NY: Cornell 

-University Press, 260-74. 

Grayson, Kent and Radan Martinec (2004), “Consumer Percep- 
tions of Iconicity and Indexicality and Their Influence on 
Assessment of Authentic Market Offerings,” Journal of Con- 
sumer Research, 31 (2), 296—312. 

Green, Corliss L. (1999), “Ethnic Evaluations of Advertising: 
Interaction Effects of Strength of Ethnic Identification, Media 





50 / Journal of Marketing, April 2006 


Placement, and Degree of Racial Composition, Journal of 
Advertising, 28 (1), 49—64. 

Grier, Sonya A. and Anne M. Brumbaugh (1999), “Noticing Cul- 
tural Differences: Ad Meanings Created by Target and Non- 
Target Markets," Journal of Advertising, 28 (Spring), 79—93. 

and Rohit Deshpandé (2001), “Social Dimensions of Con- 
sumer Distinctiveness: The Influence of Social Status on 
Group Identity and Advertising Persuasion," Journal of Mar- 
keting Research, 38 (May), 216-24. 

Guess, Terri P. (2004), "Diversity at the Fancy Food Show; Black 
Exhibitors Applaud New Pavilion for Minorities,” Network 
Journal, 10 (8), 25. 

Halter, Marilyn (2000), Shopping for Identity: The Marketing of 
Ethnicity. New York: Schocken Books. 

Hirschman, Elizabeth C. and Morris B. Holbrook (1982), “Hedo- 
nic Consumption: Emerging Concepts, Methods, and Proposi- 
tions,” Journal of Marketing, 46 (July), 92-101. 

Hoyer, Wayne D. and Steven P. Brown (1990), "Effects of Brand 
Awareness on Choice for a Common, Repeat-Purchase Prod- 
uct,” Journal of Consumer Research, 18 (2), 141-48. 

Hughes, Alan (2002), "Hip Hop Economy," Black Enterprise, 
(May), 70—75. 

Johnson, Lynn D. (2003), "Independent Black Filmmakers on 
Hollywood: The Distribution of Black Film," Bright Lights 
Film Journal, (accessed July 14, 2003), [available at 
www.brightlightsfilm.com/36/distribution].html ]. 

Kahn, Barbara E. (1993), "The Influence of Positive Affect on 
Variety Seeking Among Safe, Enjoyable Products,” Journal of 
Consumer Research, 20 (2), 257—10. 

(1995), "Consumer Variety-Seeking Among Goods and 
Services," Journal of Retailing and Consumer Services, 2 (3), 
139-48. 

Kozup, John C., Elizabeth H. Creyer, and Scot Burton (2003), 
“Making Healthful Food Choices: The Influence of Health 
Claims and Nutrition Information on Consumers’ Evaluations 
of Packaged Food Products and Restaurant Menu Items,” Jour- 
nal of Marketing, 67 (April), 19-34. 

Kraus, Stephen J. (1995), “Attitudes and the Prediction of Behav- 
ior: A Meta-Analysis of the Empirical Literature,” Personality 
and Social Psychology Bulletin, 21 (1), 58-75. 

Laroche, Michel, Chankon Kim, and Lianxi Zhou (1996), *Famili- 
arity and Confidence as Determinants of Purchase Intention: 
An Empirical Test in a Multiple Brand Context,” Journal of 
Business Research, 37 (2), 115—20. 

Loken, Barbara and James Ward (1990), “Alternative Approaches 
to Understanding the Determinants of Typicality," Journal of 
Consumer Research, 17 (2), 111-16. 

Martin, Ingrit M. and David W. Stewart (2001), “The Differential 
Impact of Goal Congruency on Attitudes, Intentions, and the 
Transfer of Brand Equity,” Journal of Marketing Research, 38 
(November), 471-84. 

Martin, Leonard L., John J. Seta, and Rick A. Crelia (1990), 
“Assimilation and Contrast as a Function of People’s Willing- 
ness and Ability to Expend Effort in Forming an Impression,” 
Journal of Personality and Social Psychology, 59 (1), 27-37. 

Masters, Kim (1996), “Testing the Faith: Preachers White Shakeup 
Hollywood’s Belief That Whites Won’t Go to Black Movies,” 
Time, (December 16), 73-74. 

McGuire, William J., Claire V. McGuire, Pamela Child, and Terry 
Fujioka (1978), “Salience of Ethnicity in the Spontaneous Self- 
Concept as a Function of One’s Ethnic Distinctiveness in the 
Social Environment,” Journal of Personality and Social Psy- 
chology, 36 (5), 511-20. 

Mellgren, James (2001), “The Bold New World of Condiments,” 
Gourmet Retailer, (October 1), 132-35. 

Miville, Marie L., Charles Gelso, Raj: J. Pannu, Will Liu, Pegah 
Touradji, Pauline Holloway, and Jairo Fuertes (1999), “Appre- 
ciating Similarities and Valuing Differences: The Miville- 








Guzman Universality-Diversity Scale,” Journal of Counseling 
Psychology, 46 (3), 291-307. 

Monteith, M J. (1993), “Self-Regulation of Prejudiced Responses: 
Implications for Progress in Prejudice Reduction Efforts,” 
Journal of Personality and Social Psychology, 65 (3), 469-85. 

Pampel, Fred, Dan Fost, and Sharon O’Malley (1994), “Marketing 
the Movies,” American Demographics, 16 (3), 48-52. 

Quester, Pascale G. and Justin Smart (1998), “The Influence of 
Consumption Situation and Product Involvement over Con- 
sumers’ Use of Product Attribute,” Journal of Consumer Mar- 
keting, 15 (3), 220-38. 

Ratner, Rebecca K. and Barbara E. Kahn (2002), “The Impact of 
Private Versus Public Consumption on Variety-Seeking Behav- 
ior," Journal of Consumer Research, 29 (2), 246-57. 

Ray, Paul H. and Sherry H. Anderson (2000), The Cultural Cre- 
atives: How 50 Million People Are Changing the World. New 
York: Harmony Books. 

Retail World (2000), "Asian Food Sales Double,” (August 7—18), 16. 

Richins, Marsha L. (1994), “Valuing Things: The Public and Pri- 
vate Meanings of Possessions,” Journal of Consumer Research, 
21 (December), 504—521. 

Sheppard, Blair H., Jon Hartwick, and Paul R. Warshaw (1988), 
“The Theory of Reasoned Action: A Meta-Analysis of Past 
Research,” Journal of Consumer Research, 15 (3), 325-43. 

Stayman, Douglas M. and Rohit Deshpandé (1989), “Situational 
Ethnicity and Consumer Behavior,’ Journal of Consumer 
Research, 16 (3), 361-71. 

Strong, Andrea (2002), “Worldly Sweets,” Restaurant Business, 
(March 15), 66-71. 

Tharp, Marye C. (2001), Marketing and Consumer Identity in 
Multiethnic America. Thousand Oaks, CA: Sage Publications. 

Tversky, Amos and Daniel Kahnemann (1974), “Judgment Under 
Uncertainty: Heuristics and Biases,” Science, (September 27), 
1124-31. 

Wakefield, Kirk L. and J. Jeffrey Inman (2003), “Situational Price 
Sensitivity: The Role of Consumption Occasion, Social Con- 
text, and Income,” Journal of Retailing, 79 (4), 199—212. 

Walkup, Carolyn (2003), “Mainstream Eateries Catch Sushi Fever, 
Net Raw Fish for Menus," Nation's Restaurant News, (April 
T), 8-9. 


Watts, E.K. and M. Orbe (2002), “The Spectacular Consumption 
of ‘True’ African American Culture: ‘Whassup’ with the Bud- 
weiser Guys?" Critical Studies in Media Communication, 19 
(1), 1-20. 

Whittler, Tommy E. (1989), “Viewers’ Processing of Actor’s Race 
and Message Claims ın Advertising Stimuli,” Psychology and 
Marketing, 6 (4), 287—309. 

(1991), "The Effects of Actors’ Race in Commercial 

Advertising: Review and Extension," Journal of Advertising, 

20 (1), 54—60. 

and Joan DiMeo (1991), “Viewers’ Reactions to Racial 
Cues in Advertising Stimuli," Journal of Advertising Research, 
31 (6), 37-46. 

Williams, Jacqueline A. (1995), “The Effect of Ethnic and Ameri- 
can Identification on Consumers’ Responses to Ethnic Adver- 
tising Appeals," doctoral dissertation, Department of Market- 
ing, Florida State University. 

Williams, Jerome D., William J. Qualls, and Sonya A. Grier 
(1995), “Racially Exclusive Real Estate Advertising: Public 
Policy Implications for Fair Housing Practices,” Journal of 
Public Policy & Marketing, 14 (Fall), 225-44. 

Wolf, Michael J. (1999), The Entertainment Economy: How 
Mega-Media Forces Áre Transforming Our Lives. New York: 
Random House. 

Xu, Jing, Soyeon Shim, Sherry Lotz, and David Almeida (2004), 
"Ethnic Identity, Socialization Factors, and Culture-Specific 
Consumption Behavior, Psychology and Marketing, 21 (2), 
93-112. 

Yi, Daniel and Ray Herndon (2001), “Diversity Turns O.C. into 
Mixer; Demographics: Census Data Show Increasing Chance 
of Encountering Those of Different Race or Ethnic Back- 
ground,” Los Angeles Times, (May 29), B1. 

Zalesny, Mary D. and J. Kevin Ford (1990), “Extending the Social 
Information Processing Perspective: New Links to Attitudes, 
Behaviors, and Perceptions,” Organizational Behavior and 
Human Decision Processes, 47 (2), 205—246. 








Consumer Responses to Ethnic-Orlented Products / 51 


Chris Pullig, Carolyn J. Simmons, & Richard G. Netemeyer 


Brand Dilution: When Do New 
Brands Hurt Existing Brands? 


Potential for trademark dilution occurs when a new brand introduces itself with an identical or substantially similar 
mark to that of an existing brand. Currently, there is neither a clear standard for legal proof of dilution nor widely 
accepted measures for managerial use. Using a shared-brand-network model, the authors use three different but 
conceptually related measurement methodologies (response latency, aided recall, and simulated choice) across 
five experiments to examine “blurring” dilution effects. The results suggest that when junior brands operate In simi- 
lar categorles and position on similar attributes, there is no immediate threat of blurring dilution. However, if the 
junior brand chooses to position on dissimilar attributes in a similar category, dilution of attribute associations is 
likely. When the junior brand operates in dissimilar categories, there is a threat of blurring dilution for both category 
and attribute associations. The final two studies demonstrate that dilution of junior brands (i.e., operating in dis- 
similar categories positioned on dissimilar attributes) can suppress consideration and choice probabilities for the 
senior brand and that this effect may extend over a period of time. 


ecently, tbe United States Supreme Court unani- (2) In what situations is dilution likely to occur in response 
mously ruled in favor of a small store against a to the presence of a junior brand? 
world-famous lingerie manufacturer and retailer 

(Mosely v. V. Secret Catalogue 2003). The owners of the 

small store, which sells lingerie and other "adult novelty" Background 


items, named their store Victor's Secret. Shortly after open- . 
ins: they! were sued: Dy Viona secrets which- claimed Trademark Infringement Versus Dilution 


protection of its brand name under the Federal Trademark Trademark dilution differs from the more familiar legal 
Dilution Act of 1995. At issue before the Supreme Court doctrine of trademark infringement. “Trademark infringe- 


was the legal standard for proof in trademark dilution cases. ment” occurs when a competing party (i.e., a junior brand) 
In its decision, the Court ruled that trademark holders (.e.,  US€S an identical or substantially similar mark (e.g, brand 
the senior brand, Victoria's Secret) must show evidence of Pame, slogan, symbol) that is already being used by an 
“actual dilution” by the junior brand (i.e., Victor's Secret). existing party (Le., a seme brand) such that consumers are 
Showing a mental association between the two marks or a likely to be confused, mistaken, or deceived about the 
likelihood of dilution is insufficient. At the same time, how- source of the goods being sold. HONO infringement can 
ever, trademark holders do not need to prove actual lost extend beyond competing brands. If a “not insubstantial 

sales to show that dilution has occurred. proportion of consumers are confused because they think a 


These vague guidelines define a broad area within brand has been extended into an unrelated category, a find- 
which case law establishes what constitutes the legal stan- ing of infringement would also be likely (Jacoby 2001). 
dard of proof for trademark dilution. However, how to Thus, as a legal doctrine, the primary focus of trademark 
establish and/or measure dilution is not only a legal and infringement is the protection of consumers from mislead- 
public policy issue but also one of concern to brand man- ing information. Conversely, “trademark dilution” is 


agers who work to maintain and leverage brand equity. This defined AS “the lessening of the capacity of a famous mark 
article addresses two questions pertinent to this issue: (1) to identify and distinguish goods or services, regardless of 
How can and should trademark dilution be measured? and the presence or absence of (1) competition between the 


owner of the famous mark and other parties or (2) likeli- 


e a a a de a a E TELE hood of confusion, mistake, or deception” (Federal Trade- 
ee ee Professor of Marketing, Hankamer School of = mark Dilution Act 1995, § 4, emphasis added). Thus, the 

usiness, Baylor University (e-mail: Chris_Pulllg@Baylor.edu). Carolyn J. ; f f dilution is th l f the brand 
Simmons is Visiting Associate Professor of Marketing, Williams School of DUM eae oo duos TOC HOD edades 
Commerce, Economics, and Polltics, Washington and Lee University (e- asset, or the protection of brand equity (Feldwick 1996; 
mail: simmonsc@wiu.edu). Richard G. Netemeyer is Ralph A. Beaton Keller 1993). 


Chalr of Free Enterprise, Mcintire School of Commerce, University of Vir- Dilution may take two broad forms: (1) tarnishment and 
ginia (e-mail: rgn3p 6 virginia.edu). This research was funded in part by (2) blurring (Simonson 1993). “Tarnishment” refers to a 
research grants from the Hankamer School of Business at Baylor Univer- lowered evaluation of a senior brand due to a junior brand. 


sity, Pamplin College of Business at Virginia Polytechnic Institute and . d : ; : 
State University, and McIntire School of Commerce at the University of The damage of tarnishment is fairly obvious in that there is 


Virginia. the attachment of a negative association to the senior brand. 
For example, consider the case in which the American 
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Express slogan “Don’t leave home without it” was used to 
promote condoms by another firm (American Express Co. v. 
Vibra Approved Lab. Corp. 1989). American Express 
believed that the association of condoms with its slogan 
could create a negative association for its traveler’s check 
brand, even though there was likely no confusion that it was 
the maker of the condoms. 

“Blurring” has been described as the “gradual whittling 
away" of the identity of the mark (brand) in the mind of the 
public (Schechter [1927] 1970) and/or the lessening of the 
uniqueness of the brand in the minds of consumers (Simon- 
son 1993) due to the emergence of a junior brand. Both of 
these perspectives hint at the weakening of important brand 
‘associations (e.g., product category, specific attributes). 
This weakening of associations may affect brand choice by 
making consumers less likely to consider the brand when 
making a decision in the category and less likely to associ- 
ate it confidently with desired attributes (Alba, Hutchinson, 
and Lynch 1991). For example, the Lexis data service 
argued that the introduction of the Lexus automobile poten- 
tially added a new category association that could blur the 
meaning of the brand in consumers’ minds (Mead Data 
Central Inc. v. Toyota Motor Sales, USA Inc. 1989). 
Although both forms of dilution merit examination, blur- 
ring is of more interest because it is less obvious, is less fre- 
quently studied, and lacks consensus on how it should be 
assessed. Thus, this article focuses on blurring dilution. 


Likelihood of Dilution or Actual Dilution 


A thorny issue in trademark dilution is how to prove dilu- 
tion. Before the Supreme Court’s ruling in the Victoria’s 
Secret case, a split of interpretation had developed between 
the Second and Fourth Circuits about whether the plaintiff 
in a trademark dilution case needed to prove actual harm to 
its famous mark or a mere likelihood of harm (Lee 2004; 
McCarthy 2004). A superficial reading of the ruling sug- 
gests that the Court took the Fourth Circuit’s side in requir- 
ing proof of actual dilution. However, the Court indicated 
that proof of actual dilution does not require evidence of 
lost sales or profits. Furthermore, it added that dilution 
. could be shown through circumstantial evidence, such as a 
mental association between the junior and the senior marks, 
when the two marks are completely identical. This part of 
the ruling has created substantial confusion by suggesting 
that proof of likelihood of dilution can be adequate. 

McCarthy (2004) interprets this ruling as creating two 
types of cases: one in which the marks are identical and one 
in which they are not. In the case of identical marks, a mere 
mental association may be adequate circumstantial evi- 
dence of dilution. When the marks are not identical, proof 
of actual dilution is required. Other legal scholars (e.g., Lee 
2004) view this part of the opinion as simply giving eviden- 
tiary weight to the degree of similarity between the two 
marks. They argue that it defies logic to apply two com- 
pletely different evidentiary standards to a mark that is 
identical in every aspect compared with a mark that is only 
slightly different (e.g., varies in font style). 

Furthermore, subsequent decisions have applied the evi- 
dentiary standard differently. Some district courts have 


interpreted the opinion to mean that when competing marks 
are completely identical, mere mental association (i.e., like- 
lihood of dilution) is sufficient circumstantial evidence of 
dilution. However, these decisions find that there is dilution 
only after a finding of likelihood of confusion, that is, trade- 
mark infringement (e.g., Nike v. Variety Wholesalers Inc. 
2003). Other district courts have interpreted the Court’s rul- 
ing to mean that a mental association is only one piece of 
evidence and that additional evidence of actual dilution is 
required even when the marks are identical (e.g., Savin 
Corp v. The Savin Group 2003). 

In summary, this split in legal opinion has generated 
considerable debate about how to demonstrate and measure 
dilution. Thus, a better understanding of how dilution can 
be measured can contribute to this debate and offer insights 
for managers who are concerned about maintaining brand 
equity and defending against trademark dilution. 


Dilutlon as a Change in Brand Knowledge 


Implicit in the legal doctrine of trademark dilution is the 
notion that dilution is a change in brand knowledge (Jacoby 
2001). In the case of blurring, it is the weakening of brand 
associations, such as the brand's category, attributes, adver- 
tising tag lines, and so forth. This view is consistent with 
the suggestion in brand management and trademark dilution 
literature that a brand’s value or equity resides in con- 
sumers' brand knowledge or memory network. 

For example, Keller (1993, p. 2) defines customer-based 
brand equity as “the differential effect of brand knowledge 
on consumer response to the marketing of a brand.” Two 
prior studies also view trademark dilution as a change in 
brand knowledge. Peterson, Smith, and Zerrillo (1999) 
focus on “brand dominance” and “brand typicality," which 
refer to the probability that a brand will be recalled given its 
category as a retrieval cue and the probability that a cate- 
gory will be recalled given the brand name as a retrieval 
cue, respectively. Morrin and Jacoby’s (2000) conceptual- 
ization is broader; it considers the weakening of associa- 
tions between the brand and any of its aspects or associa- 
tions, not just its category. Finally, legal scholars and public 
policy researchers suggest that dilution by blurring involves 
the formation of a shared brand knowledge network, in 
which knowledge about the senior and junior brands may 
compete for activation when a consumer thinks about the 
senior brand (Jacoby 2001; McCarthy 2004). Jacoby (2001, 
p. 1048) notes that the harm of dilution could take several 
forms: 


At the very least, the consumer would be required to 
engage in additional cognitive effort to parse out which 
network applied to the current circumstances [senior or 
junior brand]. Generally, the more effortful the activity 
(including a mental activity), the less likely it is to be per- 
formed... “[L]aboratory studies reveal that ... trademark 
dilution can reduce the strength of pre-existing brand 
associations through the creation of additional nodes in 
consumer's brand based memory networks. Further, expo- 
sure to the second comer's [junior brand] promotional use 
of the mark can reduce the accuracy and, to some extent, 
the speed of retrieval from memory of the first-user 
[senior brand] information. 
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Our conceptualization of dilution is consistent with these 
writings. ! 


The Conceptualization and Measurement of 
Dilution 


We view blurring as a weakening of the associations 
between a brand and its distinctive aspects (e.g., category, 
attributes, benefits) that define the meaning of the brand in 
consumers’ minds. We now consider how brand associa- 
tions are used in decision making and derive measures of 
dilution based on this conceptualization. 

When consumers retrieve brand knowledge from mem- 
ory, they may (1) use desired brand aspects as retrieval cues 
for a brand name or (2) use the brand name as a retrieval 
cue to access knowledge of the brand's aspects. Thus, we 
focus on the associative strength underlying each of these 
two processes. "Brand name accessibility" refers to the 
likelihood that the brand will come to mind when a brand 
aspect is used as a retrieval cue (e.g., how likely it is that 
Big Red will come to mind when a person thinks of chew- 
ing gum or cinnamon flavor). When brand name accessibil- 
ity decreases (i.e., is diluted), it is less likely that the brand 
will be recalled and enter the consumer's consideration set 
in decision-making situations (Alba, Hutchinson, and 
Lynch 1991). "Aspect accessibility" refers to the likelihood 
that a given brand aspect will come to mind when the brand 
name is used as a retrieval cue (e.g., how likely it is that 
chewing gum or cinnamon flavor will come to mind when a 
person thinks about the Big Red brand). When aspect acces- 
sibility decreases (i.e., is diluted), even consumers who seek 
the benefits offered by the brand are less likely to consider 
and choose the brand (Alba, Hutchinson, and Lynch 1991)? 

On the basis of this conceptualization, we use three 
dilution measurement methods: (1) response latency to rec- 
ognize brand associations, (2) recall of brand associations, 
and (3) brand choice. The first two measures directly assess 
brand name and aspect accessibility. Although both have 
been advocated as useful in the assessment of dilution 





lPeterson, Smith, and Zerrillo (1999) and Morrin and Jacoby 
(2000) are the first to measure and examine moderators of trade- 
mark dilution. Their studies have many strengths, such as an 
empirical investigation of brand-knowledge-based measures of 
dilution, the examination of multiple brands, the use of recogni- 
tion and recall measures, and the use of brand names and logos as 
stimuli. Nonetheless, their studies suggest the need for further 
empirical work. Limitations include weak and inconsistent effects, 
effects that occur only in comparison to a condition in which the 
senior brand's associations are reinforced, pooling of effects so 
that it is not possible to examine differential effects for many spe- 
cific brand associations, and a limited examination of a key mod- 
erator of dilution (attribute similarity is not examined). We extend 
their pioneering work with a research design that can address all 
these issues. 

2Keller (1993) divides brand knowledge into two similar com- 
ponents: brand awareness and brand image. Our brand name 
accessibility component parallels his brand awareness component, 
but our aspect accessibility component is less inclusive than his 
brand image component, because we include only specific associ- 
ations and not brand attitudes. Our focus on specific associations 
follows from understanding blurring as a weakening of association 
between a brand and its distinctive aspects. 
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(Morrin and Jacoby 2000; Peterson, Smith, and Zerrillo 
1999: Simonson 1993), the response latency measure has 
been singled out as particularly relevant. Reduction in the 
speed at which people are able to access, recognize, and 
verify brand associations is what Jacoby (2001, p. 1049) 
refers to as “the essence of a weakening of associations" for 
dilution. In contrast, our third measure addresses the conse- 
quences of reduced brand name or aspect accessibility, 
namely, reduced probability of consideration and brand 
choice. 


The Shared Brand Network and Predicting 
Dilution 


To understand how a junior-brand strategy may affect the 
senior brand, we consider how brand information is stored 
in and retrieved from memory. Associative network theory 
(Anderson 1983) suggests that brand information is 
encoded in long-term memory as a pattern of linkages 
between concept nodes (e.g., associations between the 
brand and its aspects). Although the senior and junior 
brands are not related from the perspectives of ownership, 
production, or marketing, they are related by a similar mark 
(e.g., name, logo, defining slogan). Thus, although con- 
sumers are likely to possess cognitive networks for both the 
senior and the junior brands, there is some connection 
between these networks due to the shared/similar trade- 
mark. To the extent that there are other similarities between 
the two brands (i.e., similar aspects), these two networks 
become even more interconnected (Jacoby 2001). Recent 
memory research shows that such related information is 
often encoded in one composite memory trace (Humphreys 
et al. 2000; Shiffrin 2003). However, regardless of whether 
the junior brand shares a composite memory trace with the 
senior brand, it is linked to the senior brand in memory 
through what we call a “shared brand network.” 

What are the implications of this shared brand network 
for the brand name and aspect accessibility of the senior 
brand? Consider how consumers retrieve knowledge from 
memory. At retrieval, cues activate corresponding nodes in 
memory, and this activation spreads to related constructs. 
The speed at which activation spreads is determined by the 
strength and proximity of linkages among constructs. The 
closer constructs are encoded in memory, and the stronger 
the linkages among them, the greater is the likelihood that 
related constructs will be activated. Because the senior and 
junior brands reside in a shared brand network, thinking 
about the senior brand activates associations for the junior 
brand (Anderson, Green, and McCulloch 2000). Further- 
more, because all associated constructs compete for activa- 
tion, these junior-brand associations are likely to reduce the 
probability and speed of activation of senior-brand associa- 
tions (Burke and Srull 1988). However, this competitive 
interference is likely moderated by the degree of similarity 
between the aspects of the senior and junior brands. Indeed, 
adding similar constructs to the shared brand network (e.g., 
when the junior brand shares similar aspects with the senior 
brand) is likely to increase the probability and speed of 
activation of senior-brand associations through the spread- 
ing of activation among these closely related constructs 
(Humphreys et al. 2000). 


Therefore, we expect that when the junior brand’s 
aspects (e.g., category, attributes) are dissimilar to the 
senior brand, the senior brand’s name accessibility and 
aspect accessibility decrease; that is, the senior brand expe- 
riences blurring dilution. When the junior brand’s aspects 
are similar to those of the senior brand, the senior brand’s 
name accessibility and aspect accessibility increase; that is, 
the senior brand experiences reinforcement of its distinctive 
associations. Although somewhat similar effects have been 
found in the brand extension literature, there are two key 
differences between this stream of research and our studies: 
First, most research in extension dilution examines incon- 
sistent or failed extensions, relying on measures of global 
brand evaluations (cf. John, Loken, and Joiner 1998), not 
accessibility (cf. Morrin 1999). Second, brand extensions 
are known to be related to the parent brand. In our research, 
the senior and junior brands are known to be unrelated. 
Although dilution applies equally to situations in which 
consumers are or are not confused about the relationship 
between the junior and the senior brands (Jacoby 2001), 
dilution comes to the forefront only when there is no 
confusion. 


How Do These Effects Occur In the Absence of 
Confusion? 


It may seem implausible that the dilution and reinforcement 
effects we predict occur without confusion about the rela- 
tionship between the senior and the junior brands. Why 
would the consumer not simply use his or her knowledge 
that the brands are unrelated to selectively “ignore” junior- 
brand associations when thinking about the senior brand? 
Again, consider how knowledge is retrieved from memory. 
When a stimulus such as a brand name is encountered, its 
associations are activated according to their accessibility. 
Highly accessible associations contribute to an “automatic 
interpretation” of the construct. This automatic interpreta- 
tion incorporates any blurring or reinforcement that results 
from associations with the junior brand and its aspects in 
the shared brand network. This automatic interpretation 
may be corrected through the consideration of less accessi- 
ble associations or externally available information, such as 
the knowledge that the brands are unrelated. However, such 
correction requires cognitive effort, and as a result, under- 
correction is common (Jacoby 2001; Johar and Simmons 
2000). Thus, the accessibility of senior-brand associations 
is influenced by junior-brand associations, even when there 
is no confusion, and dilution or reinforcement is the 


consequence. 


Study 1: Effects of Product 
Category Simllarity on Response 
Latencies 


The effect of category similarity on dilution is an important 
issue because the trademark dilution doctrine extends to 
noncompetitive brands (i.e., brands in different categories) 
and to brands in direct competition with the senior brand. 
Understanding where the danger lies can help public policy 
makers and brand managers monitor and control dilution. 
We examine the moderating effects of category similarity 


on reinforcement and dilution of the senior brand while 
holding the similarity of other brand aspects constant at a 
relatively neutral level. Recall that we expect high category 
similarity to lead to reinforcement of the senior brand and 
low category similarity to lead to dilution of the senior 
brand. We focus on the case in which the junior brand’s 
name is identical to that of the senior brand. 


H;: Category similarity moderates the effect of a same-named 
Junior brand on the senior brand’s distinctive associations. 
a. When category similarity is high, the senior brand is 
reinforced (i.e., brand name accessibility and aspect 
accessibility increase). 
b. When category similarity is low, the senior brand 1s 
diluted (i.e., brand name accessibility and aspect acces- 
sibility decrease). 


Method 


Overview. Figure 1 displays our procedure and analy- 
ses. Our senior brand is Big Red, a chewing gum that is 
strongly associated with cinnamon flavor. In Step 1, we 
expose participants to a series of advertisements, including 
one for a new brand (the junior brand or an identical 
unbranded product). The new product belongs either to a 
category that is highly similar to that of the senior brand 
(bubble gum) or to one that is dissimilar to that of the senior 
brand (snack bars). In Step 2, we measure two dependent 
variables: (1) senior-brand name accessibility, or the 
response latency to recognize correctly the brand name as 
“going with” the distinctive aspect given as a cue, and (2) 
senior-brand aspect accessibility, or the response latency to 
recognize correctly a distinctive aspect as “going with” the 
brand name given as a cue. (The senior brand’s distinctive 
aspects are chewing gum and cinnamon flavor.) In Step 3, 
we determine whether exposure to the junior brand 
increases or decreases these accessibilities compared with 
exposure to an identical unbranded product. If exposure to 
the junior brand increases brand name or aspect accessibil- 
ity (1.e., if response latencies decrease), the senior brand has 
been reinforced by the junior brand. If exposure to the 
Junior brand decreases brand name or aspect accessibilities 
(i.e., if response latencies increase), the senior brand has 
been diluted by the junior brand. Thus, our category simi- 
larity manipulation enables us to determine whether cate- 
gory similarity moderates reinforcement and dilution.3 

Design and sample. Pretest participants indicated (on 
seven-point scales) that they were highly familiar with Big 
Red chewing gum (M = 6.73) and strongly associated Big 
Red with cinnamon flavor (M = 6.52). Thus, we focus on 
these two distinctive aspects of Big Red (i.e., chewing gum 
and cinnamon flavor). Pretest participants also identified 
bubble gum and snack bars as high (M = 5.26) and low 
(M = 2.05), respectively, in similarity to chewing gum; this 
difference is significant (F; 1g = 42.39, p < .01). On the 
basis of these results, our design is a 2 (category similarity: 





3In Studies 1—3, we use an identical unbranded product as a 
contro] to isolate the effect of a junior brand. In Studies 4—5, we 
use a well-known brand as a control to increase ecological validity 
(consumers rarely encounter unbranded products) and external 
validity (our pattern of results is unaffected, increasing our confi- 
dence 1n their generality). 
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high [bubble gum] and low [snack bars]) x 2 (new product: - 


junior brand [Big Red] and unbranded [Brand A]) x 2 
(order of ad presentation) between-subjects design. We hold 
attribute similarity constant at a neutral level, as we 
describe subsequently. Participants were 89 randomly 
assigned undergraduate students. 


Procedure and materials. In a computer-administered 
procedure, participants saw three advertisements: one for 
bubble gum, one for bite-size candy, and one for snack bars. 
Two of these were filler advertisements, which were 
intended to reduce hypothesis guessing. The third advertise- 
ment was for the new product, either a bubble gum (high 
category similarity) or a snack bar (low category similarity). 
Advertisements in each similarity condition were identical 
except for brand name (Big Red or Brand A), and they con- 
tained a disclaimer that stated that the product is not manu- 
factured by the makers of Big Red (Brand A) chewing gum. 
We warned participants that some of the advertisements 
might be for products that were not yet branded and that 
these products would be identified with a letter. In all con- 
ditions, the two filler advertisements were for well-known 
products in the remaining two categories for that condition 
(Skittles bite-size candy and either Bazooka bubble gum or 
Nutri-Grain snack bars). 

Each advertisement contained a brand name, a product 
category, and two claims. We told participants that the 
advertisements were depicted in this simple fashion because 
we were interested in their reactions to the informational 
content of the advertisements. In the new product advertise- 
ments, we held attribute similarity with Big Red chewing 
gum constant at a neutral level with one similar and one dis- 
similar attribute. For the bubble gum, claims were "great 
cinnamon flavor" (similar to Big Red chewing gum) and 
"blow awesome bubbles" (dissimilar). For the snack bar, the 
claims were "great cinnamon flavor" (similar) and 
"enriched with vitamins and minerals" (dissimilar). In a 
pretest, these two dissimilar attributes were not rated as dif- 
fering in dissimilarity to Big Red chewing gum (M - 1.89 
and 1.31; Fi, 1g = 3.31, p > .09). Each advertisement was 
displayed for 30 seconds.4 The new-product advertisement 
was always second in the sequence, and presentation of the 
filler advertisements was counterbalanced. 

After presentation of the advertisements, we collected 
accessibility measures using procedures that Fazio (1990) 
described. First, we presented a brand name or an aspect. 
Participants focused on this cue and pressed the space bar 
when they were ready to see the second cue (an aspect or 
brand name, depending on which was not presented first). 
The second cue was then displayed on the next screen. Par- 
ticipants pressed a key to indicate whether the second cue 
went with the first cue (yes or no). They were instructed to 
work as quickly as possible without sacrificing accuracy. 
The computer captured the response latency (in millisec- 
onds) from the time the second cue was displayed until a 
response key was pressed. Accessibility trails were pre- 





*The 30-second display was intended to encourage participants 
to take the time to read the disclaimer. The viewing situation was 
more naturalistic (display time was not controlled) in Studies 3, 4, 
and 5. The results were similar. 


ceded by a set of practice trials that were unrelated to the 
brands presented in the advertisements. Participants prac- 
ticed until they were familiar with the procedure and could 
work quickly. The practice trials also provided a buffer 
between presentation of the advertisements and collection 
of dependent variables of interest. Following the accessibil- 
ity trials, we administered manipulation checks for category 
similarity and lack of confusion between the senior and the 
junior brands. We also measured familiarity with Big Red.5 


Results 


Manipulation and data checks. Participants indicated 
(on seven-point scales) that they were highly familiar with 
Big Red chewing gum (M = 6.03) and strongly associated 
Big Red with cinnamon flavor (M = 6.38); thus, the chew- 
ing gum category and cinnamon flavor are good exemplars 
of the brand’s distinctive aspects. As for the dissimilar 
attributes used in the advertisements for new products, par- 
ticipants indicated a weak association of Big Red with awe- 
some bubbles (M = 1.92) and with vitamins and minerals 
(M = 1.19). They also identified bubble gum as high (M = 
6.63) and snack bars as low (M = 2.95) in similarity to 
chewing gum; this difference is significant (Fi. 72 = 27.42, 
p « .001). We also collected a seven-point scale measure- 
ment of liking for the Big Red brand. Liking the brand did 
not differ across conditions, indicating that our manipula- 
tions did not create tarnishment effects (F3 ¢9 = 1.07, p > 
.10). Finally, a free-recall measure of ad content showed 
that participants were not confused about the relationship 
between the senior brand and the junior brand; only 3.6% of 
participants did not report the disclaimer, and this number 
did not vary by condition (x2 = .19, p > .10). We eliminated 
respondents who did not report the disclaimer. 

Although we instructed participants to work as quickly 
as possible without sacrificing accuracy, it is possible that 
some traded off accuracy for speed. That is, effects on 
accessibilities could be reflected in the accuracy of 
responses rather than in response latencies. For example, 
participants could make errors such that dilution shows up 
as a lower probability of accurately identifying cinnamon as 
an aspect of Big Red rather than as a longer response 
latency to recognize this association. Thus, we checked our 
data for such errors. The average error rate was low (4.2%) 
and did not vary by condition (all %2 < 1.0, ps > .10). This 
indicates that participant error in responding is not prob- 
lematic. Still, we eliminated these errors from our analysis. 

Hypothesis tests. We log-transformed response latencies 


to achieve distribution normality (Fazio 1990). We used 
each respondent’s mean latency on filler trials as a covariate 





"There were 92 trials 1n three blocks, with a 30-second rest 
between blocks. Experimental trials measured the accessibilities 
that we report in Table 1. Filler trials served to (1) decrease 
hypothesis guessing, (2) create a set of trials in which correct yes 
and no responses are equally represented and thus avoid creating a 
response bias, and (3) provide an average response time for each 
participant for use as a covariate. (The covariate isolates variance 
due to individual differences; this analysis should increase the sen- 
sitivity of our tests compared with Morrin and Jacoby's [2000] use 
of similar measures.) Within each block, we randomized the order 
of trials for each participant. 
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to isolate variance due to individual differences, thus reduc- 
ing error variance and increasing the power of our statistical 
tests. Order of presentation of the filler advertisements had 
no impact on response latencies (ps > .10), and so we do not 
discuss its effects further. 

H; predicts that category similarity moderates the effect 
of the junior brand on the senior brand, such that the senior 
brand’s association with its distinctive aspects is reinforced 
when category similarity is high (H;,) and diluted when 
category similarity is low (Hj). This prediction implies an 
interaction between category similarity and new product of 
a particular form. When similarity is high, senior—brand 
name and aspect accessibility should increase (and response 
latencies should decrease) when the new product is branded 
Big Red rather than when it is unbranded. When similarity 
is low, senior-brand name and aspect accessibility should 
decrease (and response Jatencies should increase) when the 
new product is branded Big Red rather than when it is 
unbranded. As we predicted, there is a significant category 
interaction between similarity and new product for all 
senior-brand accessibility measures (ps < .05). 

Response latency means for the new-product planned 
contrasts (Big Red versus unbranded) appear in Table 1. 
Reinforcement is shown by a significant, positive contrast 
value, indicating that senior-brand associations are more 
accessible when the new product is a junior brand than 
when it is unbranded. Dilution is shown by a significant, 
negative contrast value, indicating that senior-brand associ- 
ations are less accessible when the new product is a junior 
brand than when it is unbranded. As we predicted in Hia 
when category similarity is high, senior-brand name and 
aspect accessibility are reinforced; all four entries in Table 
1, Column 1, are positive and significant (Fs > 5.00, ps < 
.05). When category similarity is low, these accessibilities 
are diluted; all entries 1n Table 1, Column 2, are negative 
and significant (Fs > 4.00, ps < .05), ın support of Hy. 


TABLE 1 
Study 1: Mean Change In Accessibility (Means) 





Category Similarity 





Column 1: Column 2: 
High Low 
(Bubble (Snack 
Gum) Bars) 
Brand Name Accessibility 
(Aspect — Brand Name) 
Chewing gum — Big Red 212 —247 
Cinnamon — Big Red 302 —276 
Aspect Accessibility 
(Brand Name — Aspect) 
Big Red chewing gum 327 —119 
Big Red — cinnamon 174 —334 


ERE: za dicio duse dai aaa ctp pd C 
Notes. Mean changes in accessibility reflect our planned contrasts: 
response latency after exposure to unbranded product ver- 
sus response latency after exposure to Identical junior brand. 
Significant, positive and negative contrasts indicate rein- 
forcement and dilution, respectively, for the senior brand. All 
contrasts are consistent with prediction and are significant at 
p < .05. a > b indicates that a is a retrieval cue for retneving b. 
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Study 2: Effects of Category and 
Attribute Similarity on Response 
Latencies 


The results of Study 1 confirm our predictions about the 
effects of category similarity. However, category similarity 
is only one of many ways that junior and senior brands may 
be similar or dissimilar. Consider again the example of Vic- 
tor’s Secret. Lingerie is a major part of its business; thus, 
many consumers might perceive category similarity 
between Victor’s Secret and Victoria’s Secret. At the same 
time, Victor’s Secret may be viewed as different from Victo- 
rias Secret on many attributes (e.g., "classiness,' store 
appearance). What is the impact of this feature or attribute 
dissimilarity, and what happens when a junior brand oper- 
ates in a dissimilar category but attempts to leverage dis- 
tinctive attributes of the senior brand? For example, Sleep 
Inn was originally slated to be called McSleep Inn to com- 
municate the "basic, convenient, inexpensive, and/or stan- 
dardized qualities" associated with McDonald's (a dissimi- 
lar category with similar attributes; "McReading" 1988, p. 
32). What is the impact of this feature or attribute simi- 
larity? Study 2 addresses these questions to capture the 
complexity of real-world situations. 


Predictions 


A variety of evidence suggests that category similarity is 
likely to take precedence over similarity in other aspects 
(e.g., attributes) in the determination of whether a Junior 
brand dilutes a senior brand. Category (or relational) simi- 
larity, versus feature overlap (Tversky 1977), is likely to be 
accessed first and to have a greater influence on perceived 
similarity (Medin, Goldstone, and Gentner 1993). Thus, the 
encoding of new information (e.g., junior-brand knowledge) 
is likely to be guided by category similarity, such that closer 
associations are formed between senior and junior brands 
when category similarity is high. When category similarity 
is low, these associations should be more distinct and less 
tightly linked (Anderson, Green, and McCulloch 2000; 
Shiffrin 2003). Consistent with this view, it has been shown 
that category similarity influences the accessibility of con- 
structs in memory more than does structural commonalities 
(attribute matching; Forbus, Gentner, and Law 1994). 
Therefore, our predictions are based on the assumption that 
category similarity takes precedence over attribute or fea- 
ture similarity in the determination of how tightly linked 
junior-brand knowledge is with senior-brand knowledge in 
the shared brand network. 

When category similarity is high and the senior and 
junior brands form a closely connected shared brand net- 
work, aspect similarity should have strong effects due to 
spreading activation. Similar aspects should reinforce 
senior—brand name and aspect accessibility, whereas dissimi- 
lar aspects should dilute brand name and aspect accessibil- 
ity. Thus, senior-brand category associations (e.g., Big Red 
— chewing gum, chewing gum — Big Red) should be rein- 
forced (because the categories are similar). However, senior- 
brand attribute associations (e.g., Big Red — cinnamon, and 
cinnamon — Big Red) should be reinforced when attribute 
similarity is high and diluted when attribute similarity is low. 


When category similarity is low, senior-brand category 
associations should be diluted because the categories are 
dissimilar. Furthermore, because the senior and junior 
brands are encoded more distantly and distinctly in the 
shared brand network, similar attributes should have less 
ability to reinforce senior-brand attribute associations. 
Similar attributes are linked by their similarity, but because 
they are encoded in distinctive locations, the speed of acti- 
vation of this link is lessened. Therefore, we expect that 
when attribute similarity is low, dilution of attribute associ- 
ations is likely. When attribute similarity is high, reinforce- 
ment of attribute association is unlikely, though spreading 
activation along the relatively weak associations among 
similar attributes is likely to lessen dilution effects. (For all 
hypothesis tests, we measure both brand name and aspect 
accessibility; however, we expect no difference across these 
measures.) 


H5: Category and attribute similarity jointly determine the 
effect of a same-named junior brand on the senior brand’s 
distinctive associations. 

a. When category and attribute similarity are both high, 
the senior brand’s category and attribute associations 
are reinforced (i.e., accessibilities increase). 

b. When category similarity is high and attribute simi- 
larity is low, (1) the senior brand's category associations 
are reinforced (1.e., accessibilities increase), and (ii) the 
senior brand’s attribute associations are diluted (i.e., 
accessibilities decrease). 

c. When category similarity is low and attribute similarity 
is high, (1) the senior brand's category associations are 
diluted (i.e., accessibilities decrease), and (11) the senior 
brand's attribute associations are not reinforced and 
may be diluted (1.e., accessibilities do not increase and 
may decrease). 

d. When category and attribute similarity are both low, the 
senior brand's category and attribute associations are 
diluted (i.e., accessibilities decrease). 


Method 


Big Red chewing gum is again our senior brand. Our design 
is a 2 (category similarity: high [bubble gumj and low 
[snack bars]) x 2 (attribute similarity: high [cinnamon fla- 
vor and breath freshening] and low [strawberry flavor and 
containing immunity-boosting Echinacea]) x 2 (new prod- 
uct: junior brand [Big Red] and unbranded [Brand A]) x 2 
(order of ad presentation) between-subjects design. Pretest 
participants indicated (on seven-point scales) that they are 
highly familiar with Big Red chewing gum (M = 6.73), 
strongly associate Big Red with fresh breath (M = 4.63) and 
cinnamon flavor (M = 6.52), and believe that it is unlikely 
that Big Red chewing gum is strawberry flavored (M = 
1.41) or contains echinacea (M - 1.27). These likelihood 
ratings do not differ significantly from one another (F4, 24 = 
2.34, p > .10) and are both significantly lower than the like- 
lihood that Big Red chewing gum freshens breath (M = 
4.66) or is cinnamon flavored (M = 6.11; ps < .01). Partici- 
pants were 235 undergraduate business students randomly 
assigned to conditions. Procedures and materials were oth- 
erwise identical to those of Study 1. 


Results 


Manipulation and data checks. As we expected, partici- 
pants indicated (on seven-point scales) that they were 
highly familiar with Big Red chewing gum (M = 6.17) and 
strongly associated Big Red with fresh breath (M = 5.32) 
and cinnamon flavor (M = 6.21). Participants identified 
bubble gum and snack bars as high (M = 6.34) and low 
(M = 2.34), respectively, in similarity to chewing gum; this 
difference is significant (Fi, 212 = 11.66, p < .05). Partici- 
pants also indicated that strawberry flavor (M = 1.73) and 
echinacea (M = 1.18) were equally weak in their association 
with the Big Red brand (F;, 211 = .57, p > .10). A free-recall 
measure of ad content showed that participants were not 
confused about the relationship between the senior brand 
and the junior brand; only 1.3% of participants did not 
report the disclaimer, and this number did not vary by con- 
dition (x2 = 6.19, p > .10). Again, we collected a seven- 
point-scale measure of brand liking for the Big Red brand. 
Liking did not differ across conditions, indicating that our 
manipulations did not create tarnishment effects (F7, 188 = 
1.56, p » .10). The average error rate in the accessibility tri- 
als was low (2.3%) and did not vary by condition (x5 = 
9.02, p > .10). As in Study 1, we eliminated participants 
who failed to recall the disclaimer and those who made 
errors. 


Hypothesis tests. Again, we log-transformed response 
latencies to achieve distribution normality (Fazio 1990), and 
we used participants’ mean response latency on filler trials 
as a covariate to isolate variance due to individual differ- 
ences. Order of presentation of the filler advertisements had 
no impact on response latencies (ps > .10); thus, we do not 
discuss its effects further. Planned contrast values for the 
nontransformed response latencies appear in Table 2. Rein- 
forcement (dilution) effects are indicated as significant and 
positive (negative) contrast values. All contrasts that are 
consistent with prediction appear in bold. 

The results are as we predicted for 22 of the 24 con- 
trasts. As H2, predicts, when category and attribute simi- 
larity are both high, the senior brand is uniformly rein- 
forced; all six entries in Table 2, Column 1, are positive and 
significant (Fs > 6.00, ps < .05). Ha, predicts that when 
category similarity is high and attribute similarity is low, 
category associations are reinforced and attribute associa- 
tions are diluted. The results for category associations are 
mixed. As the first entry in Table 2, Column 2, shows, brand 
name accessibility for the category is reinforced, as pre- 
dicted (F; 44 = 12.91, p < .01), but aspect accessibility for 
the category is not (Column 2, fourth entry; Fy 4; = 2.07, 
p > .10). The results for attribute associations are all 
diluted; as we predicted, the second, third, fifth, and sixth 
entries in Table 2, Column 2, are negative and significant 
(Fs > 9.00, ps < .01). 

Ho) predicts that when category similarity is low and 
attribute similarity is high, category associations are 
diluted. The results are mixed here as well. Brand name 
accessibility for the category is diluted (Table 2, Column 3, 
first entry; Fy 44 = 12.91, p < .01), but aspect accessibility 
for the category is unaffected (Table 2, Column 3, fourth 
entry; F1, 42 = 2.07, p > .10). H2, predicts that when cate- 
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TABLE 2 
Study 2: Mean Change in Accessibility (Means) 


High Category Simllarity 


Low Category Similarity 


(Bubble Gum) (Snack Bars) 
Column 1: Column 2: Column 3: Column 4: 
High Attribute Low Attribute High Attribute Low Attribute 
Similarity Similarity Similarity Similarity 
(Cinnamon, (Strawberry, (Cinnamon, (Strawberry, 
Fresh Breath) Echinacea) Fresh Breath) Echinacea) 
Brand Name Accessibility (Aspect — Brand Name) 
Chewing gum — Big Red 195* 243* —86" —234* 
Cinnamon — Big Red 152* —346* —144* —342* 
Fresh breath — Blg Red 145* —66* 95 —255* 
Aspect Accessibility (Brand Name — Aspect) 
Big Red chewing gum 182* —41 —132 —402" 
Big Red — cinnamon 51* —-327* —116* —316* 
Big Red — fresh breath 349* —250* 63 —340* 





“Contrast Is significant at p « .05. 


Notes: Mean changes in accessibility reflect our planned contrasts: response latency after exposure to unbranded product versus response 
latency after exposure to junior brand. Significant, positive and negatlve contrasts indicate reinforcement and dilution, respectively, for 
the senior brand. Bolded contrasts are consistent with prediction. a — b indicates that a is a retrieval cue for retrieving b. 


gory similarity is low and attribute similarity is high, 
attribute associations are not reinforced and may be diluted. 
Consistent with this prediction, two of the four planned 
contrasts demonstrate dilution (Table 2, Column 3, second 
and fifth entries; Fs > 4.00, ps « .05), and two exhibit null 
effects (Table 2, Column 3, third and sixth entries; Fs « 
2.20, ps > .05). Finally, as H24 predicts, when category and 
attribute similarity are both low, the senior brand is uni- 
formly diluted, as evidenced by the six negative and signifi- 
cant values in Table 2, Column 4 (Fs » 6.00, ps « .05). 


Summary and Discussion of Studles 1 and 2 


Dilution effects. Study 1 manipulated product category 
similarity and held attribute similarity constant at a moder- 
ate level. As we predicted, the accessibility of distinctive 
senior-brand associations was reinforced with high category 
similarity and was diluted with low category similarity. 
Study 2 manipulated both product category and attribute 
similarity. As predicted, in general, accessibility of the 
senior brand's category associations was reinforced when 
category similarity was high and was diluted when category 
similarity was low (though this effect did not hold for two 
aspect accessibility measures). As predicted, the accessibil- 
ity of the senior brand's attribute associations depended on 
both category and attribute similarity. When category simi- 
Jarity was high, the senior brand's attribute associations 
were reinforced by attribute similarity and were diluted by 
attribute dissimilarity. When category similarity was low, 
the senior brand's attribute associations were not reinforced 
by attribute similarity; indeed, they were often diluted. 

From a theoretical standpoint, our results suggest that 
category similarity determines the extent to which senior 
and junior brands are connected in the shared brand net- 
work. Further work examining the structure of the shared 
brand network can help better understand when junior 


60 / Journal of Marketing, April 2006 


brands pose the greatest threats and why. From managerial 
and public policy standpoints, our results offer insights into 
the understanding of present dilution and the future risk of 
dilution. Present dilution is likely when there are important 
dissimilarities between the senior and the junior brands, 
particularly at the category level. This insight is important 
because brands in different categories intuitively seem less 
of a risk, perhaps because they seem less likely to confuse 
consumers with respect to their (lack of) relationship with 
the senior brand. If confusion is indeed less likely, the dilu- 
tion doctrine is an important source of trademark protection 
in this case because unlike infringement, it extends to situa- 
tions in which there is no consumer confusion. 

In contrast, the future risk of dilution is high when there 
are important similarities between the senior and the junior 
brands, particularly at the category level. Present dilution is 
unlikely—rather, the senior brand is likely to be rein- 
forced—but the close association between the junior and 
the senior brands in the shared brand network means that 
future actions by the junior brand could create harmful 
associations that readily affect the senior brand (Swann and 
Davis 1994). In this situation, brand managers may find that 
proving infringement through evidence of confusion is 
more feasible. If a case for infringement cannot be made, 
careful monitoring for emerging evidence of dilution will 
be particularly important. It is noteworthy that similar 
junior brands (and risk of future dilution) may be common. 
As the shared brand network conceptualization predicts, in 
terms of the creation of desired associations, junior brands 
benefit the most when they are similar to the senior brand 
(Pullig, Simmons, and Netemeyer 2005). 

Finally, it appears that the greatest challenges in detect- 
ing and proving present dilution occur when the junior 
brand is in a dissimilar category or possesses dissimilar 
attributes, but not both. In these cases, dilution is not uni- 


form across associations, and some associations (e.g., the 
category association when category similarity it high) may 
be reinforced. Thus, dilution is more difficult to demon- 
strate. For the brand manager, careful measurement and an 
appreciation of which associations are likely rather than 
unlikely to be diluted are particularly important. 


‘Response latency measure. The response latency mea- 
sure is a good candidate for meeting the broad guidelines 
for evidence of dilution outlined by the Court. It goes 
beyond the idea of "mere association" between brands 
(rejected by the Court as inadequate) and shows an impact 
on how consumers process the senior brand, though it falls 
short of showing actual effects on sales (which the Court 
has deemed to be unnecessary). However, although 
response latencies provide a theoretically viable way to 
measure dilution, the procedures are cumbersome. Study 3 
examines how well a simpler paper-and-pencil aided-recall 
task can replicate the effects of Study 2. Such a measure 
might provide a more tractable measurement methodology 
for the field. 


Study 3: Effects of Attribute and 
Category Similarity on Aided Recall 


We expect aided-recall measures to be less sensitive to dif- 
ferences in the accessibility of brand knowledge than 
response latency recognition measures. Aided recall 
requires the respondent to retrieve or recollect an associa- 
tion, whereas successful performance of a recognition task 
may be based solely on familiarity (Long and Prat 2002). 
Familiarity involves an assessment of the similarity 
between a target and the related knowledge in memory, not 
the explicit retrieval of the target from memory. Further- 
more, familiarity may vary from weak to strong, whereas 
the retrieval required by recall is largely a threshold-based 
process. Therefore, although we expect that, in general, the 
aided-recall measure of dilution will replicate the effects we 
observed in Study 2, we also expect that it will show fewer 
of these effects. 


Method 


We use a paper-and-pencil aided-recall method to assess 
reinforcement and dilution across levels of category and 
attribute similarity. Our design is identical to that of Study 
2, with one exception. In a within-subjects manipulation, 
we vary the order of recall: brand name recall first versus 
aspect recall first. We included this order manipulation for 
two reasons: First, because successful recall in a paper-and- 
pencil task is more cognitively and physically demanding, 
we were concerned about fatigue effects in reported recall. 
Second, because a recall task does not measure strength of 
association, just whether that strength exceeds the threshold 
necessary for recall, we were particularly concerned about 
differential transfer effects (e.g., giving participants Big 
Red as a retrieval cue for brand aspects increases the proba- 
bility that they will subsequently recall Big Red when they 
are given an aspect as a retrieval cue). 

Participants were 448 undergraduate business students 
randomly assigned to conditions. We asked them to evaluate 


at their own pace the same advertisements as those used in 
Study 2. Then, to provide closure to these processing 
instructions and to measure recall of the disclaimer, we 
asked participants to list all the aspects they remembered 
from the advertisements. Then, they participated in an 
unrelated study, which took approximately 15 minutes and 
included a series of arithmetic calculations on product 
search time and price expectations; simple arithmetic tasks 
are an effective way to clear short-term memory (Spears 
1978). Finally, we administered the recall task. 

We measured aspect accessibility by asking participants 
to recall all that came to mind when they thought of each 
brand from a list of brands, including the brand of interest 
(i.e., Big Red chewing gum). We measured brand name 
accessibility by asking them to recall what came to mind 
when they thought of a list of aspects, including the aspects 
of interest (i.e., chewing gum and cinnamon flavor). The 
order of aspect accessibility and of brand name accessibility 
measures was counterbalanced. Successful retrieval of the 
Big Red brand name in response to a distinctive aspect 
(e.g., chewing gum) or successful retrieval of a distinctive 
aspect (e.g., chewing gum) in response to the Big Red brand 
name is coded as a 1, and unsuccessful retrieval is coded as 
a 0. Two independent coders, who were unaware of the pur- 
pose of the research, coded the recall responses. Intercoder 
agreement was 94%. All differences were resolved through 
discussion. Following the recall task, participants com- 
pleted manipulation and confounding check measures. 


Results 


Manipulation and data checks. Participants indicated (on 
seven-point scales) that they were highly familiar with Big 
Red chewing gum (M = 5.87) and strongly associated Big 
Red with cinnamon flavor (M = 6.35) and fresh breath (M = 
4.91). They also identified bubble gum and snack bars as 
high (M = 6.07) and low (M = 2.84), respectively, in simi- 
larity to chewing gum; this difference 1s significant 
(Fi. 480 = 43.67, p < .01). Participants also rated strawberry 
flavor (M = 2.17) and echinacea (M = 2.22) as equally weak 
in their association with the Big Red brand (F; 4&6 = 1.14, 
p > .10). A free-recall measure of ad content showed that 
participants were not confused about the relationship 
between the senior brand and the junior brand; only 2.1% of 
participants did not report the disclaimer, and this number 
did not vary by condition (x^; = 4.20, p > .10). We elimi- 
nated participants who failed to recall the disclaimer. Order 
of presentation of the filler advertisements had no impact on 
proportions of successful recall (ps > .10); thus, we do not 
discus its effects further. Recall that task order resulted in 
significant differential transfer for all recall measures (ps < 
.05); therefore, the following analyses are based on the first 
recall task (brand name or aspect) for each respondent. 


Hypothesis tests. As in Study 2, we test H, regarding the 
effects of category and attribute similarity on dilution and 
reinforcement of the senior brand. Our dependent measure 
is the successful recall of brand aspects, a nominal variable; 
thus, we use a logistic regression model. As in Study 2, our 
planned contrasts compare participants who were exposed 
to the junior brand with those who were exposed to an iden- 
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tical unbranded product. Contrast values and significance 3, Column 4, supports this prediction for each of the aspect 
tests appear in Table 3. Contrasts that are consistent with accessibility measures (fourth, fifth, and sixth entries; Wald 
the predictions in H5 appear in bold. In Study 2, using statistics > 4.00, p < .05) and for brand name accessibility _ 
response latency measures, we found that 22 of the 24 con- when the category is used as a retrieval cue (chewing 
trasts were consistent with prediction. In Study 3, using an gum — Big Red; Wald statistic = 4.07, p < .05). 
aided-recall measure, we found that though all contrasts 

were in the predicted direction, only 16 of the 24 contrasts Discussion 

were consistent with prediction when statistical significance 


, i As we expected, recall measures of dilution were less sensi- 
is considered. 


i ; — tive to differences in accessibility than the response latency _ 
Hoa pe dicts that when ERISEOIY and attribute similarity recognition measure. However, all contrasts in Table 3 are 
are both high, the senior brand is reinforced. As we show 1n in the predicted direction, and 67% meet our statistical cri- 


Table 3, Column 1, four of the six accessibility measures teria (recall that some null effects are predicted); this recall 


are consistent with this prediction (Wald statistics > 3.41, measure is highly correlated with our response latency mea- 
p < .05). Ho, predicts that when category similarity is high sure from Study 2 (r = .75, p < .001). Thus, the data are 
and attribute similarity is low, the senior brand's category 4 i 


associations are reinforced. This prediction holds for only 
one of the two measures; when category is used as a 
retrieval cue, brand name accessibility is reinforced (chew- 
ing gum — Big Red; Wald statistic = 3.98, p < .05). H 
also predicts that when category similarity is high and 
attribute similarity is low, attribute associations are diluted. 
This prediction holds for only one of the four measures. 
When the brand name is used as a retrieval cue, aspect 


accessibility for cinnamon flavor is diluted (Big Red — cin- Study 4: Effects of a Junior Brand 


namon flavor; Wald statistic = 4.07, p « .05). : : . 
Ho; predicts that when category similarity is low and on — span Choice 
attribute similarity is high, category associations are ropaDiiities 


diluted. This prediction is supported (Column 3, first and From a practical perspective, dilution is of interest because 


encouraging about the viability of a recall measure. How- 
ever, these results emphasize the need to consider multiple 
measures in the assessment of dilution. For example, it may 
make sense to trade off sensitivity for convenience and use 
a recall measure for general monitoring purposes but 
response latencies for a. more in-depth understanding or 
when proof of dilution is crucial. 





fourth entries; Wald statistics > 3.71, p < .05). Ha, pre- it can affect sales in the marketplace. Thus, a valid measure 
dicts that attribute associations are not reinforced and may of dilution captures changes in brand knowledge that lead to 
be diluted in this situation. Consistent with prediction, reduced consideration and choice of the senior brand. Theo- 
aspect accessibility for cinnamon flavor is diluted (Big retically, the reduced brand name and aspect accessibility 
Red — cinnamon flavor; Wald statistic = 6.74, p < .01), and we observed using response latency and recall measures 
all other attribute associations in this condition are unaf- have such effects. We now examine whether situations that 
fected. Finally, H4 predicts that when both category and — . lead to dilution, as indicated by response latency and recall 
attribute similarity are low, the senior brand is diluted. Table measures, reduce the probability of consideration and 
TABLE 3 
Study 3: Change in Proportion Recalling Brand Name/Aspects 
High Category Similarity Low Category Similarity 
(Bubble Gum) (Snack Bars) 
Column 1: Column 2: Column 3: Column 4: 
High Attribute Low Attribute High Attribute Low Attribute 
Similarity Similarity Similarity Similarity 


(Cinnamon, (Strawberry, (Cinnamon, (Strawberry, 
Fresh Breath) Echinacea) Fresh Breath) Echinacea) 


Brand Name Accessibility (Aspect —^ Brand Name) 


Chewing gum — Blg Red .30* 29" —.21* —.23" 

Cinnamon — Big Red 12 .01 —.06 —.10 

Fresh breath — Big Red .05 —.02 —.04 ~.05 
Aspect Accessibility (Brand Name — Aspect) 

Big Red — chewing gum .23* .19 —.20* —.23* 

Big Red — cinnamon .19* —27* `~ =—31* —.36" 

Big Red fresh breath .22* —.01 —.03 —.12* 


*Contrast is significant at p < .05. j 

Notes. Contrast values reflect the proportion successfully recalling brand name/aspects after exposure to junior brand versus the proportion 
successfully recalling brand name/aspects after exposure to control brand. Posrtive (negative) values Indicate reinforcement (dilution) 
effects. Bolded contrasts are consistent with prediction. a — b Indicates that a is a retrieval cue for retrieving b. 
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choice. We focus on the situation that has most consistently 
shown evidence of dilution through accessibility measures, 
that is, when both attribute and category similarity are low. 
= H}: When category similarity and attribute similarity are low, 
given distinctive aspects as a choice criterion, the proba- 
bility of the senior brand being included in a consideration 

set decreases. l 
H4: When category similarity and attribute similarity are low, 
given distinctive aspects as a choice criterion, the proba- 

bility of the senior brand being chosen decreases. 


Method ` 


The design is a 3 (senior brands: Big Red, Gap, and Trix) x 
2 (condition: diluting versus control) factorial design in 
which we manipulate the senior brand between subjects and 
the condition within subjects. Participants were 219 under- 
graduate students who participated in one diluting condition 
and two control conditions. As in our previous studies, we 
achieved manipulations through ad exposures using a 
computer-administrated procedure. Advertisements in the 
diluting condition were for a junior brand in a dissimilar 
category and with dissimilar attributes (e.g., when Big Red 
` was the senior brand, the junior brand was Big Red straw- 
berry snack bars). Advertisements in the control conditions 
were identical except that we identified the brand as a well- 
known brand in the category (e.g., when Big Red was the 
senior brand, the advertisement in the control condition was 
for Nutri-Grain strawberry snack bars). After participants 
were exposed to the advertisements, we asked them to 
choose a brand in each senior-brand category, given distinc- 
tive aspects of the senior brand as a choice criterion. We 
examine how consideration and choice of the senior brand 
are affected by exposure to a junior brand (versus exposure 
to an identical product that does not use the senior brand's 
name). 

Previous results showed that students are highly familiar 
with Big Red chewing gum, that cinnamon flavor is 
strongly associated with Big Red, and that the snack bar 
category and strawberry flavor are viewed as dissimilar to 
Big Red's distinctive aspects. Therefore, one of our junior 
brands is Big Red strawberry snack bars, and our choice cri- 


terion is long-lasting cinnamon flavor. Furthermore, pretest 
participants indicated (on seven-point scales) a high level of 
familiarity with Gap clothing (M = 6.82) and Trix cereal 
(M = 6.22); that cotton and khakis are strongly associated 
with Gap (M = 6.10 and 6.52, respectively); and that color- 
ful, fun, fruity, and for kids are strongly associated with 
Trix (M - 6.40, 5.20, 6.06, and 6.56, respectively). Pretest 
participants further indicated that the bed sheets category is 
dissimilar to clothing (M = 2.04) and that the cake mix cate- 
gory is dissimilar to cereal (M = 2.21). Finally, pretest par- 
ticipants indicated that stain resistant is an unlikely attribute 
of Gap clothing (M = 3.13) and that low fat is an unlikely 
attribute of Trix cereal (M = 2.13). Therefore, our two other 
junior brands are Trix low-fat cake mix and Gap stain- 
resistant sheets. Our clothing choice criterion is "a good 
source for cotton khakis," and our cereal choice criterion is 
"a colorful, fun, fruity cereal for kids." 

After viewing the advertisements, participants engaged 
in practice trials to learn the computer-administered choice 
task and then took part in the main experiment. The choice 
task was a simulated shopping experience. The brands were 
represented in color as brand logos or packages of equal 
size. Participants were asked to choose a set of up to four 
brands that they would consider, given the choice criterion. 
Then, this set of brands was displayed on the screen, and 
participants were asked to choose the brand option that best 
satisfied the choice criterion. In each step of the simulated 
choice process, the brands were displayed on the screen in 
random order. This was done for each of the three senior- 
brand categories. Following the choice task, participants 
were asked to recall ad content and to respond to manipula- 
tion and confounding check measures. As in the prior stud- 
ies, we deleted any respondent who failed to report the dis- 
claimer (1.2%); this did not vary by condition (%5 = 1.52, 
p > .10). 


Results and Discussion 


The results appear in the top half of Table 4. When exposure 
to the junior brand resulted in significantly reduced proba- 
bility of consideration or choice, significant differences in 
proportions appear in bold. In support of H3, the likelihood 








TABLE 4 
Studles 4 and 5: Proportion of Respondents Considering and Choosing Senior Brand 
Consideration Cholce 
Exposed to Junior Brand? Exposed to Junior Brand? 
Senlor Brand ' No Yes No Yes 
Study 4: No Delay 
Big Red 85 52 .60 .15 
Trix .72 55 31 .08 
Gap .68 49 25 09 
Study 5: Five-Day Delay 
Big Red .60 .48 .43 22 
Trix .69 52 33 31 
Gap 45 -29 .38 .17 


ED O ———————— — á— ————— É—————————————— 
Notes: Bolded pairs of proportions within the “Considered” and “Choice” columns are significantly different at p < 05 and are consistent with 


prediction. 
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of the senior brand being included in a consideration set 
was significantly decreased with the introduction of a dis- 
similar junior brand for two of the three brands (y^ = 17.56, 
p < .01, for Big Red; and X2 = 4.55, p < .05, for Gap). In 
support of Hy, the probability of choosing the senior brand 
was significantly reduced in all three categories (y^ = 
19.87, p < .01, for Big Red; x7 = 4.18, p < .05, for Trix; and 
x4 = 3.89, p < .05, for Gap). Thus, conditions that resulted 
in measured dilution when we used accessibility measures 
also resulted in decreased consideration and choice of the 
senior brand, as we predicted. Furthermore, these results 
have generality across brands. 


Study 5: Consideration and Choice 
Effects with Delay 


In each of the four preceding studies, we examined dilution 
effects shortly after exposure to the junior brand. In Study 
5, we examined whether effects on consideration and choice 
probabilities occur when there is a substantial delay 
between exposure to the junior brand and the choice involv- 
ing the senior brand. The design and procedures are identi- 
cal to those of Study 4, except that the procedure is admin- 
istered with paper and pencil, and there is a five-day delay 
between exposure to the diluting advertisements and the 
choice task. Participants were 49 randomly assigned under- 
graduate students. 

The results appear in the bottom half of Table 4. Signifi- 
cant differences appear in bold. Consideration and choice 
effects are apparent even with a five-day delay between 
exposure to the junior brand and the choice task. Exposure 
to the junior brand significantly decreased consideration 
probability for two of the three brands (y^ = 4.93, p < .05, 
for Trix; and %4 = 3.63, p < .05, for Gap) and also decreased 
choice probability for two of the three brands (y^ = 5.39, 
p < .05, for Big Red; and 47 = 4.84, p < .05, for Gap). 
Together, Studies 4 and 5 provide strong evidence of the 
power of a junior brand to decrease the consideration and 
choice of a senior brand. 


General Discussion 


We focused on two questions of interest to brand managers 
and public policy makers: First, how can and should trade- 
mark dilution be measured? Second, when is it likely to 
occur? We draw on our results to address these questions 
and identify important research issues. 


How Can and Should Trademark Dilution Be 
Measured? 


We defined blurring dilution as a weakening of the associa- 
tions between a brand and its distinctive aspects (e.g., cate- 
gory, attributes, benefits) that define the meaning of the 
brand in consumers' minds and, at a more concrete level, as 
a reduction in brand name and aspect accessibility. We then 
operationalized dilution as response latency to recognize 
distinctive aspects (Studies 1 and 2) and as aided recall of 
distinctive aspects (Study 3). In Studies 4 and 5, we did not 
directly measure dilution but rather its predicted effects on 
consideration and choice. 
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Construct validity. This work provides strong evidence 
of the construct validity of our measures. First, our mea- 
sures have content, or face, validity. Definitions of blurring 
center on the loss of brand meaning coupled with the prob- 
ability of reduced economic value, particularly through lost 
sales (Federal Trademark Dilution Act 1995; Schechter 
[1927] 1970; Simonson 1993). Our accessibility measures 
capture losses in brand meaning that are theoretically linked 
to reduced brand choice. Less accessible brand names are 
less likely to be recalled and less likely to enter a con- 
sumer's consideration set when they are encountered in the 
shopping environment. Less accessible aspects are less con- 
fidently associated with the brand in choice situations 
(Alba, Hutchinson, and Lynch 1991). 

Second, we provide evidence of convergent and dis- 
criminant validity. Our response latency recognition mea- 
sure is highly correlated with our aided-recall measure, but 
it is uncorrelated with brand liking, ensuring that we are 
measuring blurring dilution rather than tarnishment. 

Finally, our measures have nomological validity (Cron- 
bach and Meehl 1955); that is, the measured construct 
behaves as it is predicted to behave. As we predicted, cate- 
gory and attribute similarity moderate measured dilution. 
Furthermore, the low-similarity situations that lead to mea- 
sured dilution also lead to decreased consideration and 
choice. 


Viability as legal evidence. Our measures also fall 
within the guidelines set by the Court; they go beyond mere 
association but do not extend to actual lost sales, and thus 
they are potentially useful for brands seeking protection 
from dilution (Morrin and Jacoby 2000). Response laten- 
cies are likely the most sensitive measure of dilution, but 
accessibilities can also be measured in ways that are more 
amenable to field survey methods (e.g., aided recall; Peter- 
son, Smith, and Zerrillo 1999). Furthermore, as more is 
learned about individual differences in dilution effects—for 
example, people with higher brand familiarity and category 
knowledge (Morrin and Jacoby 2000) may exhibit smaller 
effects—this knowledge may be taken into account when 
selecting an appropriate dilution measure. 


Blurring versus tarnishment. Although we isolated blur- 
ring dilution in these experimental studies, blurring and tar- 
nishment may occur together. Junior-brand associations 
may be both unattractive and dissimilar to those of the 
senior brand. Furthermore, severe blurring may itself lower 
brand evaluations. Therefore, as a practical matter, firms 
may wish to assess multiple types of negative junior-brand 
effects routinely: blurring of distinctive aspects, accretion 
of unattractive associations, and lowered brand evaluations. 
The kinds of accessibility measures we examined herein 
should prove particularly useful for assessing the extent to 
which negative junior-brand aspects have become associ- 
ated with the senior brand (Morrin and Jacoby 2000). 


When Is Dilution Likely? 


Our results suggest that blurring dilution is least likely in 
copycat strategies, that is, when a junior brand emerges in a 
similar category to the senior brand and with other similar 
aspects (e.g., high attribute similarity). This finding contra- 


dicts what is often brand management’s gut reaction: that 
highly similar junior brands are the most serious threats. 
However, we agree that similar junior brands create an 
exposure for the senior brand. By virtue of its strong inter- 
connections with the senior brand in the shared brand net- 
work, the junior brand has the power to dilute a senior 
brand through any future actions that create dissimilar asso- 
ciations (Pullig, Simmons, and Netemeyer 2005; Swann 
and Davis 1994). In contrast, our data show that junior 
brands in similar categories but with dissimilar associations 
can dilute senior-brand knowledge. 

Case law implicitly recognizes the danger from junior 
brands in similar categories (Vapnyar 2003). However, the 
legal requirement for evidence of dilution means that the 
senior brand may be unable to protect itself preemptively 
from the risk of future dilution by a highly similar junior 
brand. Fortunately, measurable dilution is likely to be found 
in all other situations we examined. Our results provide 
some information about where to expect dilution (e.g., not 
for category associations in similar categories) and where 
effect sizes are likely to be smallest (e.g., dissimilar cate- 
gory, similar attributes) to guide the measurement process. 


Further Research 


Three issues deserve special attention: First, how similar 
must trademarks be to create dilution? The Ninth Circuit 
recently ruled that Orbit bicycles were not diluted by the 
junior brand OrbiTrek exercise bicycles, because the marks 
were not sufficiently similar (Thane Int’l Inc. v. Trek Bicy- 
cle Corp. 2002). Brand managers should be concerned with 
this line of reasoning. Judgments about what constitutes 
sufficient similarity should be guided by an in-depth under- 
standing of the dimensions of similarity (e.g., visual, con- 
ceptual, lexical, phonological) that link a junior brand with 
a senior brand. Such knowledge could also alert brand man- 
agers to junior brands that offer greater or lesser dilution 
exposure to the senior brand. 

Second, are some types of trademarks more susceptible 
to dilution than others? For example, a recent trend is to 
create “fanciful” or “coined” brand names (e.g., Verizon, 
Accenture), which do not have suggestive associations with 
the product they mark. An advantage of fanciful marks is 
that their associations are (at least initially) under the con- 
trol of the firm, but are they more or less susceptible to dilu- 
tion than suggestive marks, such as Drano or Sonicare, 
which convey product associations? Recent work on mem- 
ory suggests that the representation of these two types of 
marks differs such that we would expect fanciful names to 
be more susceptible to dilution of brand name accessibility 
than suggestive names (for a similar idea in a somewhat dif- 
ferent domain, see Young 2001, cited in Schacter 2001). 
Furthermore, the language-based associations of suggestive 
names may reduce overall susceptibility to dilution. If these 
predictions prove correct, they have three implications for 
defending against dilution: First, a brand’s susceptibility 


may be reduced by designing trademarks that facilitate 
communication of the brand’s distinctive aspects. Second, 
for fanciful marks, resources may be allocated more effi- 
ciently by focusing on increasing brand name accessibility, 
which is more often diluted. Third, it is possible to monitor 
for dilution more effectively by using multiple measures. 
Brand name accessibility dilution may occur even when 
there is no aspect accessibility dilution. 

Third, to predict dilution effects correctly, brand man- 
agers and public policy makers alike require an understand- 
ing of the structure of the shared brand network. On the 
basis of recent memory research, we assumed that junior 
brands are encoded proximate to and well integrated with 
senior-brand knowledge when category similarity is high 
and more distinctly and distantly from senior-brand knowl- 
edge when category similarity is low. In general, the acces- 
sibility effects we observed were consistent with this model. 
However, we did not directly access memory structure. Fur- 
thermore, there are many variables that seem likely to alter 
the degree of interconnectedness of the shared brand net- 
work. For example, attribute dissimilarity can manifest in 
ways that may differ in their capacity to create the close 
linkages that lead to dilution. Some dissimilar attributes are 
different points along the same dimension (i.e., cinnamon 
and strawberry flavors), and others reflect completely dif- 
ferent dimensions (i.e., cinnamon flavor and vitamin 
enriched). It might be expected that the former creates a 
closer linkage and, thus, greater dilution. 


Back to Victor’s/Victoria’s Secret 


Our results suggest that Victoria’s Secret may not be much 
harmed in the short run by Victor’s Secret, especially if 
most consumers who are aware of Victor’s Secret know that 
it sells only “adult” lingerie. This category association is 
similar to that of Victoria's Secret (i.e., sexually alluring lin- 
gerie), though there may be a clear distinction on the “taste- 
fulness” or “product offering" dimensions. Thus, Victoria's 
Secret may wish to focus on tarnishment or dilution of 
attribute-level aspects in its attempt to demonstrate harm. 

Perhaps a greater concern is the risk of future dilution 
from this junior brand. Whereas a single “adult” shop 
named Victor's Secret would seem insignificant in the face 
of Victoria's Secret's marketing power—simply a joke to be 
appreciated—the risk is real. One of the values of building 
equity around an arbitrary mark such as Victoria's Secret 
(i.e., one that consists of common words that do not suggest 
or describe the product) is that these marks are the easiest to 
protect legally from infringement (Launer 1996). If others 
can use the mark or a parody version of it, the value of 
building an arbitrary brand is lessened. Although current 
trademark dilution law does not offer legal remedies for the 
risk of future dilution, senior brands with nondiluting junior 
brands would be well advised to monitor the activities of 
these unintended associates. 
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Xueming Luo, Rebecca J. Slotegraaf, & Xing Pan 


Cross-Functional "Coopetition": The 
Simultaneous Role of Cooperation 
and Competition Within Firms 


Extant marketing literature tends to view cross-functional relationships as primarily cooperative or competitive in 
nature, but not both. In contrast, this research focuses on cross-functional "coopetition" (i.e., the joint occurrence of 
cooperation and competition across functional areas within a firm). Using responses from midlevel managers and 
top executives, the authors find that cross-functional coopetition enhances a firm's customer and financial perfor- 
mance. The authors further show that this influence is mediated by market learning, indicating that performance 
returns to cross-functional coopetition occurs through an underlying learning mechanism. 


argue that knowledge transfer across functional from its Saturn division to its other divisions proved more 

boundaries is critical for numerous outcomes, difficult than expected (Kerwin and Woodruff 1992). Barilla 
including new product success (e.g., Griffin and Hauser SpA’s own sales and marketing areas impeded knowledge 
1992), organizational learning (Huber 1991), and overall transfer to its operations area when it attempted to install a 
firm performance (e.g., Gray and Meister 2004). In the mar- just-in-time distribution system (Hammond 1994). 
keting literature, Menon and Varadarajan (1992) argue that Within academic research across marketing and man- 
a firm's market knowledge must be transferred or dissemi- agement, scholars also recognize that knowledge transfer is 
nated across departments before knowledge can play a crit- valuable but often difficult to achieve (e.g., Kogut and Zan- 
ical role as a strategic asset.! Research also indicates that a der 1992; Szulanski 1996) and have subsequently studied 
firm's competitive advantage lies in its ability to transfer barriers to knowledge transfer. Although research points to 


Se in marketing and strategic management sharing attempts in the transfer of manufacturing insights 


market knowledge across departments (e.g., Maltz and several factors that can 1mpede knowledge transfer across 
Kohli 1996). functional boundaries within an organization, a critical fac- 

However, the transfer of knowledge across functional tor is the degree of competition across functions. In particu- 
boundaries can be rather difficult and complicated. Even the lar, research shows that an arduous relationship between the 
most organized efforts to share knowledge are often source and the recipient (Szulanski 1996), or interfunctional 


impeded by employees’ tendencies to guard and selectively rivalry (Maltz and Kohli 1996), can impede internal knowl- 
share information (Gilmour 2003). Consider the number of edge transfer. In numerous situations, functions must com- 
companies that have experienced difficulties with internal pete for a firm’s scarce resources and thus are often reluc- 
knowledge transfer. For example, Hewlett-Packard’s early tant not only to share information to prevent a competing 
failure in the laptop market with a 23-pound product can be function from gaining knowledge but also to receive infor- 


attributed to its lack of knowledge flow between its market- mation for fear that it will heighten the value of a compet- 
ing and engineering departments (Fisher, Maltz, and ing function’s knowledge (Gupta and Govindarajan 2000). 
Jaworski 1997). General Motors found that its knowledge- Thus, the question is whether competing departments can 


effectively cooperate with one another to enhance organiza- 

M tional learning and performance. More generally, how 

IWe refer to "functions" and “departments” interchangeably should firms strategically manage cross-functional competi- 
throughout this article. tion and cooperation to achieve competitive advantage? 

The extant marketing literature has not addressed this 
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competition aspect, focusing on interdepartmental rivalry 
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(e.g., Homburg, Workman, and Krohmer 1999; Levitt 
1969). However, although functional areas compete with 
one another in the pursuit of divergent individual goals and 
strategic priorities (e.g., Houston et al. 2001), they must 
also cooperate so that they work toward the firm’s common 
interests (Narver and Slater 1990). Consequently, an organi- 
zation's functional areas are often forced to compete and 
cooperate simultaneously with one another. Thus, it is criti- 
cal to understand how these seemingly conflicting relation- 
ships interplay and affect firm performance. 

In an effort to understand these effects better, we focus 
on cross-functional “coopetition,” which we define as the 
joint occurrence of cooperation and competition across 
functional areas within a firm. À coopetition view of the 
interactions between marketing and other functional areas 
not only emphasizes the need to address complex inter- 
departmental structures but also echoes the importance of 
coordinating diverse and competitive knowledge and skills 
across the cross-functional boundaries within a firm. We 
argue that cross-functional coopetition improves a firm's 
customer and financial performance. 

We also investigate whether market learning plays a 
mediating role in the relationship between cross-functional 
coopetition and firm performance. This mediating role 
would suggest that there is an underlying learning mecha- 
nism by which cross-functional coopetition contributes to 
superior performance. Market learning is grounded in the 
knowledge-based view of the firm (Grant 1996) and is rec- 
ognized as an important source for a firm's competitive 
advantage (e.g., Kogut and Zander 1992; Nonaka 1994; 
Sinkula 1994). Although marketing scholars have focused 
on the value of knowledge across a broad array of topics, 
including the utilization of market knowledge (e.g., Desh- 
pandé and Zaltman 1982; Menon and Varadarajan 1992; 
Moorman, Zaltman, and Deshpandé 1992) and the 
exchange of knowledge across functional units (Fisher, 
Maltz, and Jaworski 1997; Griffin and Hauser 1992; Maltz 
and Kohli 1996), how intrafirm cooperation among compet- 
ing functions influences a firm's learning about key market- 
ing activities has yet to be explored. Following prior studies 
(e.g., Menon et al. 1999; Moorman 1995; Srivastava, Sher- 
vani, and Fahey 1998), we refer to market learning as an 
organization's knowledge stores of key marketing activities, 
such as developing new products, building brand image 
among customers, and establishing channel partnerships 
and networks in the market. 

Therefore, in this study, we empirically test (1) whether 
the joint occurrence of cross-functional competition and 
cross-functional cooperation (intensity and ability) 
improves a firm's customer and financial performance and 
(2) whether a firm's market learning mediates these perfor- 
mance returns. Our results show that cross-functional 
coopetition has an important effect on performance out- 
comes through enhanced market learning, paving the way 
for new insight into how cross-functional interactions can 
affect a firm's competitive advantage. Our study also offers 
managers evidence that cooperation and competition should 
both be strategically stimulated across functions to promote 
intrafirm knowledge transfer and to enhance the firm's cus- 
tomer and financial performance. 
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Cross-Functional Coopetition 


Theoretical Nature of Cross-Functional 
Coopetition 

Coopetition, or the joint occurrence of cooperative and 
competitive behaviors, can exist at multiple levels, includ- 
ing firms, strategic business units (SBUs), departments, and 
task groups (e.g., Brandenburger and Nalebuff 1996; 
Hamel, Doz, and Prahalad 1989; Tsai 2002). One theoreti- 
cal foundation of coopetition can be drawn from research 
on social structure in the sociology literature. In particular, 
the social embeddedness framework purports that relations 
are always present and that the social structure of these rela- 
tions influences subsequent behaviors (Granovetter 1985; 
Uzzi 1997, 1999). In consideration of the types of social 
relations, weak ties (Granovetter 1973) are characterized by 
sporadic interactions, yet they can offer high returns by 
linking people or firms to diverse pools of information 
(Burt 1992). In contrast, embedded ties are characterized by 
frequent and stronger interactions such that information is 
perceived as more trustworthy (Granovetter 1985) and 
cooperation is high (Gulati 1998). In considering both types 
of relations, research shows that the greatest value is recog- 
nized when there is a complementary mix of both forms 
(Uzzi 1999). Thus, we expect that firms!enhance their per- 
formance when they exhibit cooperative social ties that are 
nested in a broader competitive framework. 

Research in the marketing literature supports and 
extends this notion. For example, Vargo and Lusch (2004) 
point to the importance of collaboration not only among 
interorganizational partners but also among intraorganiza- 
tional functions for a firm's long-term viability. At the intra- 
organizational level, some marketing scholars have also rec- 
ognized that interdepartmental interaction may be a 
double-edged sword, involving both collaboration and 
rivalry (e.g., Ruekert and Walker 1987). We explicitly 
address this interdepartmental interaction by examining the 
effects of cross-functional coopetition, or the joint occur- 
rence of cooperation and competition, across functional 
areas within a firm. 

In cross-functional coopetition, interdepartmental inter- 
actions and the transfer of knowledge across functional 
areas can be both competitive and cooperative in nature. 
The competitive nature often occurs because knowledge can 
generate private gains for individual departments to outper- 
form their counterparts. Indeed, cross-functional competi- 
tion may result from direct comparisons among functional 
units (Levitt 1969; Maltz and Kohli 1996), interdepartmen- 
tal struggles to obtain limited tangible resources (e.g., orga- 
nizational capital, personnel) and intangible resources (e.g., 
top executives' mental time, attention) (Frankwick et al. 
1994), and divergent goals and strategic priorities (Houston 
et al. 2001; Ruekert and Walker 1987). At the same time, 
knowledge transfer across functions within a firm may be 
cooperative in nature because departments need to collabo- 
rate with one another in the transfer of market knowledge 
for the common interests of the firm. Informal cooperative 
interaction among functional departments is particularly 
important for obtaining more frequent and valuable infor- 
mation (Granovetter 1985) and for achieving greater knowl- 


edge transfer across departments (Ruekert and Walker 
1987). For example, “hall talk” and storytelling are infor- 
mal knowledge transfer processes that occur among depart- 
ments as a means to bring people and departments together 
and to facilitate attainment of the organization’s goals 
(Kohli and Jaworski 1990). Thus, the degree to which a 
firm’s departments cooperate in conjunction with various 
levels of competition in the firm’s social structure jointly 
defines the firm’s level of cross-functional coopetition. 

To clarify the concept of cross-functional coopetition 
further, we briefly discuss how it differs from two related 
constructs: First, cross-functional coopetition is similar to 
interorganizational alliances in that both involve coopera- 
tion between competing segments (i.e., functions, firms), 
yet important differences exist. An obvious difference is the 
Jevel of analysis; the former is focused at the intraorganiza- 
tional level, and the latter is focused at the interorganiza- 
tional level. As a result, for example, the competitive ele- 
ment of coopetition within a firm is likely to be different 
from the competitive element between firms, such that 
rivalry and conflict are likely to be less extreme within a 
firm than between firms. In addition, underlying character- 
istics of interorganizational alliances, such as anticipated 
future interaction (Heide and Miner 1992) and relationship 
age (Anderson and Weitz 1989), are not likely to play a role 
in cross-functional coopetition. Second, cross-functional 
coopetition is also similar to market orientation in that both 
emphasize cooperation across functions or departments. In 
particular, a process view of market orientation suggests 
that knowledge about customers is disseminated across 
departments (e.g., Kohli and Jaworski 1990). However, 
coopetition and market orientation differ in an important 
way. Whereas market orientation focuses on promoting 
functional coordination and reducing interdepartmental 
conflict and friction (e.g., Narver and Slater 1990), cross- 
functional coopetition recognizes that interdepartmental 
conflict is not always unfavorable and can even produce 
specific benefits (Anderson and Narus 1990; Lado, Boyd, 
and Hanlon 1997). 


The Effect of Cross-Functlonal Coopetition on 
Performance 


We expect the joint occurrence of cross-functional coopera- 
tion and competition to enhance firm performance. The syn- 
ergy of coopetition has been theorized to produce various 
benefits, including learning, cost savings, resource sharing, 
and innovation (Lado, Boyd, and Hanlon 1997). At an intra- 
organizational level, cooperation among competing units 
may be evident by absorbed and frequent interactions to 
uncover competing units’ know-how (Tsai 2002). There- 
fore, we focus on the ability and intensity, respectively, of 
the nature of interdepartmental cooperation and examine 
the performance effects of cross-functional coopetition in 
more detail by investigating (1) the joint occurrence of 
cross-functional cooperative ability and competition and (2) 
the joint occurrence of cross-functional cooperative inten- 
sity and competition. 


The joint occurrence of cross-functional cooperative 
ability and competition. Cross-functional cooperative abil- 


ity refers to skills in recognizing the value of new market 
knowledge as well as assimilating, transforming, and 
deploying market knowledge across lateral cross-functional 
interactions, reflecting an absorptive capacity for coopera- 
tive knowledge transfer (Cohen and Levinthal 1990; Zahra 
and George 2002). The ability for cross-functional coopera- 
tion extends beyond the acquisition of market knowledge, 
because functional units that do not have the capacity for 
assimilation and transformation may not be able to internal- 
ize the available knowledge or to deploy it effectively. 

Regarding the joint impact of cross-functional coopera- 
tive ability and competition on firm performance, it is possi- 
ble that a negative effect may occur. In particular, Rind- 
fleisch and Moorman (2003) show that a firm’s customer 
orientation may deteriorate if the firm is engaged in a coop- 
erative alliance with competitors; one reason for this 
decline is a low level of trust among alliance members. It is 
also noted that interfunctional rivalry decreases the level of 
trust in information (e.g., Maltz and Kohli 1996), which 
suggests that cross-functional competition coupled with 
cooperative ability could decrease a firm’s customer-based 
performance. 

However, we expect that the joint occurrence of cross- 
functional cooperative ability and competition has a posi- 
tive effect on a firm’s customer performance and financial 
performance. A combination of cross-functional coopera- 
tive ability and competition may nurture productive interac- 
tions (e.g., Brandenburger and Nalebuff 1996; Lado, Boyd, 
and Hanlon 1997; Uzzi 1999) that can facilitate internal 
efficiencies and sharing of best practice for successful 
deployment and exploitation of knowledge. In particular, 
cooperation among competitors can foster greater knowl- 
edge seeking and result in syncretic rents (Lado, Boyd, and 
Hanlon 1997). High cross-functional cooperative ability 
emphasizes the nature of gaining, absorbing, and sharing 
customer and market knowledge, whereas high cross- 
functional competition may provide departments with a 
strong incentive to share this knowledge to understand one 
another's positions better. Indeed, Tsai (2002) empirically 
shows that productive interactions can be nurtured when 
there is high competition for resources across a firm's SBUs 
because they are more likely to share information and 
exploit valuable knowledge stores. These productive inter- 
actions can also affect firm performance. For example, Von 
Hippel (1987) shows that cooperation among competing 
firms can improve each firm's profits. 

As such, the joint effect of high cooperative ability and 
high competition may generate better problem solving in 
satisfying customer needs and higher performance (Hamel, 
Doz, and Prahalad 1989; Lado, Boyd, and Hanlon 1997; 
Tsai 2002). Therefore, we predict that at the functional level 
within the firm, the joint occurrence of cooperative ability 
and competition gives rise to better customer and financial 
performance. 


H;: The joint occurrence of cross-functional cooperative abil- 
ity and competition has a positive effect on a firm's (a) 
customer performance and (b) financial performance. 


The joint effect of cross-functional cooperative intensity 
and competition. Cross-functional cooperative intensity 
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refers to the degree to which lateral, interdepartmental 
interactions are frequent and close within an organization. 
Prior research shows that frequent cooperative interaction 
enhances the level of communication and knowledge shar- 
ing (Ghoshal, Korine, and Szulanski 1994; Tsai 2002). Fre- 
quent interactions across functional areas may also serve as 
a mechanism that helps functions both recognize where 
potentially useful market knowledge could serve and route 
it to the destinations across functional boundaries for 
improved outcomes (Huber 1991). 

We expect that the joint occurrence of cross-functional 
cooperative intensity and competition has a positive influ- 
ence on a firm’s customer performance and financial perfor- 
mance. More specifically, frequent interactions can create 
more opportunities to share knowledge and ideas (Tsai 
2002), and sharing information across competing functions 
can offer timely integration of knowledge within a firm 
(e.g., Uzzi 1997), which in turn stimulates superior perfor- 
mance. Intensive cooperation across competing departments 
may not only promote the development and conversion of 
tacit knowledge into a shared understanding of customer 
needs (Nonaka 1994; Rindfleisch and Moorman 2001) but 
also generate more collaborative and effective strategic 
decision making within the organization (Ghoshal, Korine, 
and Szulanski 1994). Indeed, a competitive interdepartmen- 
tal structure coupled with frequent interactions may enable 
a firm to capitalize on the communication channels and ties 
embedded across its functional areas, thus leading to greater 
market and financial returns. 

Furthermore, the joint occurrence of cross-functional 
cooperative intensity and competition may reduce knowl- 
edge overlap and create synergies. In particular, although 
the intraorganizational functions are likely to share similar 
resource constraints and market situations imposed by the 
firm, knowledge across departments is less likely to be 
redundant, given the traditional emphases across functions 
(Moorman and Rust 1999). With less redundancy, frequent 
interaction among competing departments provides access 
to novel and complementary market intelligence across 
functional knowledge silos (Granovetter 1973). This access 
to nonredundant information fosters better problem solving 
and decision making (e.g., Cummings 2004) and is essential 
for the creation of customer and financial value (e.g., Kohli 
and Jaworski 1990). Therefore, we predict that the joint 
occurrence of cross-functional cooperative intensity and 
competition enhances customer and financial performance. 


H5: The joint occurrence of cross-functional cooperative 
intensity and competition has a positive effect on a firm's 
(a) customer performance and (b) financial performance. 


The Mediating Role of Market Learning 


We also argue that a firm with departments that simultane- 
ously compete and cooperate may enhance its market learn- 
ing, which in turn will generate higher performance. Market 
learning broadly refers to a firm's expertise in and knowl- 
edge stores of key marketing activities, such as developing 
new products, building brand image, responding to cus- 
tomers’ needs, and establishing channel relationships 
(Menon et al. 1999; Moorman 1995; Srivastava, Shervani, 
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and Fahey 1998). On the basis of the knowledge-based view 
of the firm, which posits that competitive advantage results 
from different knowledge stores and market expertise (e.g., 
Grant 1996; Nonaka 1994), we examine whether market 
learning is a route from which cross-functional coopetition 
enhances a firm’s customer and financial performance. 

The coexistence of cross-functional cooperation and 
competition is likely to create a synergy that improves a 
firm’s market learning. First, the joint occurrence of cross- 
functional cooperative ability and competition offers the 
opportunity for customer and market knowledge to cross a 
firm’s functional boundaries in an absorbed and effective 
way. With the facilitating effect of competition, brought 
about by the interest in knowing more about competitors’ 
positions (Hamel, Doz, and Prahalad 1989; Tsai 2002), the 
cooperative ability to recognize, assimilate, and apply spe- 
cialized market knowledge across competing departments is 
likely to enhance the ease and effectiveness with which 
learning may occur (Cohen and Levinthal 1989; Grant 
1996; Szulanski 1996). 

Second, the joint occurrence of cross-functional cooper- 
ative intensity and competition is also expected to improve 
a firm’s market learning by enabling departments to 
enhance their knowledge stores in proactive and innovative 
ways. Intensive and frequent interactions can improve the 
transfer of complex knowledge (Hansen 1999) and adapta- 
tion to complex situations (Uzzi 1997). Furthermore, as we 
previously indicated, cross-functional competition can 
instil] the motivation to understand other functions’ posi- 
tions. Thus, when intensive interactions occur across com- 
peting functions, this joint occurrence of cross-functional 
cooperative intensity and competition enables the compet- 
ing departments to attain new and complementary knowl- 
edge that reduces the chance for interdepartmental misun- 
derstanding of complex knowledge (Griffin and Hauser 
1996) and offers the opportunity to diffuse novel ideas and 
promote greater learning (e.g., Ghoshal, Korine, and Szu- 
lanski 1994). 

Therefore, we expect that the joint occurrence of cross- 
functional cooperative ability and competition and the joint 
occurrence of cross-functional cooperative intensity and 
competition enable a firm to generate stronger market 
learning. 

H4: A firm's market learning is influenced by (a) the joint 
occurrence of its cross-functional cooperative ability and 
competition and (b) the joint occurrence of its cross- 
functional cooperative intensity and competition. 


Extant literature in marketing shows a large and accu- 
mulating amount of evidence regarding the influence of 
organizational learning and market intelligence on firm per- 
formance (e.g., Kohli and Jaworski 1990; Maltz and Kohli 
1996; Moorman, Zaltman, and Deshpandé 1992; Rind- 
fleisch and Moorman 2001; Zhou, Yim, and Tse 2005). In 
addition, fundamental tenets of the knowledge-based view 
(Grant 1996; Kogut and Zander 1992) indicate that superior 
knowledge stores and organizational learning, as strategic 
assets, lead to improved financial and market performance. 
As previously argued, we believe that cross-functional 
coopetition (1) helps a firm build the necessary knowledge 


stores for superior market learning, in the form of the joint 
occurrence of cross-functional cooperative intensity and 
competition, and (2) enables the firm to exploit these 
knowledge stores for superior market learning, in the form 
of the joint occurrence of cross-functional cooperative abil- 
ity and competition. In turn, superior market learning leads 
to improved firm financial and market performance (e.g., 
Dodgson 1993; Kogut and Zander 1992). 

Therefore, we purport that the performance advantages 
of cross-functional coopetition are achieved through an 
underlying learning mechanism. 


H4: Market learning mediates the influence of the joint occur- 
rence of cross-functional cooperative ability and competi- 
tion on a firm's (a) customer performance and (b) finan- 
cial performance. 

H5: Market learning mediates the influence of the joint occur- 
rence of cross-functional cooperative intensity and com- 
petition on a firm's (a) customer performance and (b) 
financial performance. 


Methodology 


Sample 
The sample consists of firms that operate in high- 
technology sectors such as biotechnology, software devel- 
opment, information technology, and electronics within 
China. As the second largest economy in the world, China 
represents a critical market to consider (Boisot and Child 
1996; Walters and Samiee 2003). Using the official China 
Basic Statistical Units Yearbook as the sampling frame, we 
randomly selected 500 firms headquartered in metropolitan 
areas with nationwide operations in China.2 

The data were collected from two key informants within 
the organization to minimize potential common method 
bias. In particular, common method bias concerns arise 
when both independent and dependent variables are mea- 
sured by the same key informant (e.g., Van Bruggen, Lil- 
lien, and Kacker 2002). To specify the two key informants 
from each of the 500 firms, the firms were telephoned and 
requested to select one midlevel manager randomly (e.g., 
sales, marketing, research-and-development [R&D] depart- 
ment manager) and one top executive (e.g., chief executive 
officer, general manager). The key informants were care- 
fully chosen to ensure that they had the knowledge and 
background to complete the questionnaire in a thoughtful 
manner. The midlevel managers and senior executives were 
interviewed separately (Atuahene-Gima and Li 2002). In 
generating these responses for each company, top execu- 
tives answered survey questions about organizational per- 
formance, market learning, and firm demographic informa- 
tion, whereas midlevel department managers answered 
questions about cross-functional cooperative forces, com- 
petitive forces, and some items of firm performance as a 
check of data validity. 


2These 500 firms were equally selected from five major cities, 
including Shanghai 1n the East, Chengdu in the West, Beijing in 
the North, Guangzhou in the South, and Changsha in Central 
China. We selected this equal distribution to minimize bias of 
regional market heterogeneity. 


A total of 329 firms (658 managers) met the screening 
criteria (i.e., the knowledge and background to respond and 
two key informants from each firm) and initially agreed to 
participate. After matching key informants and deleting 
missing data, we ended up with usable information from 
163 firms (326 informants). As a result, we obtained a 
response rate of 32.696 of the original sample (163/500) and 
49.596 of the qualified informants (163/329). To test for 
nonresponse bias, we compared the demographic data of 70 
firms that agreed to participate but did not complete the 
interviews with those of the completed and useful 
responses. We found no significant difference in firm size, 
industry, location, and ownership. 

The results testing the performance items that were des- 
ignated to check informant response validity (sales growth, 
customer satisfaction, and return on investment [ROI]) 
exhibit high consistency between top executives and 
midlevel managers (Guttmans: .83, .88, and .87, respec- 
tively). To ensure that the senior executives were appropri- 
ate and reliable, we examined the extent of their involve- 
ment in making strategic decisions in the firm (Li and 
Atuahene-Gima 2001); the results show high strategic 
involvement (M - 6.81 on a seven-point scale). Among the 
midlevel managers, several functional areas were covered, 
including sales (38%), marketing (33%), R&D (21%), and 
others (8%). Note that all functional areas may generate 
market knowledge to enhance customer value (Kohli and 
Jaworski 1990; Narver and Slater 1990) and that the process 
of market intelligence dissemination across functional 
boundaries can be evaluated by midlevel managers from 
marketing and other functional areas (e.g., Fisher, Maltz, 
and Jaworski 1997; Maltz and Kohli 1996). In general, 6896 
of informants had more than five years' experience in the 
firm, and 9196 had at least two years' experience with the 
firm. Thus, the key informants were reasonably knowledge- 
able about the cross-functional dynamics within the firm 
(e.g., Johnson, Sohi, and Grewal 2004). Among the firms in 
our sample, 72.596 were small or medium in size, with 500 
or fewer employees. The largest industry segment was elec- 
tronics (36.3%), followed by information technology 
(28.1%) and biotechnology (18.7%). 


Measures 


We assessed all constructs as multi-item measures. In addi- 
tion, we scored each item on a seven-point scale, ranging 
from “strongly disagree" (1) to “strongly agree” (7). The 
questionnaire was prepared in English, translated into Chi- 
nese by independent translators, and then back translated 
into English to ensure accuracy and follow appropriate 
guidelines (Brislin 1970). The questionnaire was then 
pretested with 17 Chinese managers in a pilot study. 
Reponses from the pretests ensured the relevance and 
equivalence of the final measures. We describe all measures 
in Table 1 and detail them in the Appendix. We discuss the 
measurement results in the "Results" section. 

We measured cross-functional cooperative intensity 
with six items that we constructed to assess the degree to 
which the lateral cross-functional interactions are frequent 
and close. We adapted this measure in part from similar 
research in marketing that focuses on relational embedded- 
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TABLE 1 


Main Constructs, Sources, and Definitions 





Construct 


Cross-functional 
cooperative intensity 


Cross-functlonal 


cooperative ability 


Cross-functional 
competition 


Market learning 


Firm performance 


Construct Sources 


Antia and Frazier 
(2001); Rindflelsch and 
Moorman (2001) 


Cohen and Levinthal 
(1990); Szulanski 
(1996); Zahra and 

George (2002) 


Levitt (1969); Houston 
et al. (2001); Maltz and 
Kohli (1996); Ruekert 
and Walker (1987) 


Menon et al. (1999); 
Moorman (1995); 
Srivastava, Shervani, 
and Fahey (1998) 


Moorman and Rust 
(1999) 


Construct Definition 


The extent of the frequency and closeness of the 
lateral social interactions among functional areas 
within the firm. 


The ability to assimilate and deploy market 
knowledge in lateral interactions among functional 
areas. 


The degree to which departments compete both 

for limited tangible and intangible resources and 

for strategic importance, power, and department 
charter. 


The firm's expertise in and knowledge stores of 
key marketing activities (i.e., developing new 
product development, building brand image, 

sensemaking customers' current and potential 

needs, and others). 


The perceptions of strategic market performance 
and financlal performance (l.e., market share 
growth, customer loyalty, new product 
development, sales growth, customer satisfactlon, 


Key Informants Jn 
the Dyad 


Midlevel managers 


Midlevel managers 


Midlevel managers 


Top executives 


Top executives 


salllng costs, customer retention, customer 


ness at other organizational levels (e.g., Antia and Frazier 
2001; Rindfleisch and Moorman 2001). Our cross- 
functional cooperative ability measure includes six items 
that examine the core underlying ability to evaluate, assimi- 
late, and exploit market knowledge transferred from other 
departments during cross-functional interactions. In line 
with the process perspective of absorptive capacity (Cohen 
and Levinthal 1990; Lane, Salk, and Lyles 2001), this mea- 
sure extended the three-item scale that Szulanski (1996) ini- 
tially suggested and validated. 

The measure of cross-functional competition included 
11 items that assessed the extent to which functional areas 
compete with one another for tangible and intangible 
resources, strategic importance, and charter (Ruekert and 
Walker 1987) and the extent to which departments struggle 
with one another because of divergent departmental goals 
and strategic priorities (Houston et al. 2001; Levitt 1969; 
Maltz and Kohli 1996). We dropped one item to enhance 
scale validity (see the Appendix), which resulted in a ten- 
item measure. 

We developed the ten-item measure of market learning 
to capture the firm’s expertise in and knowledge stores of 
key marketing activities, adapting our measure from Menon 
and colleagues’ (1999) list of key marketing activities. With 
these ten items, we model market learning as a higher-order 
construct of three subdimensions. This approach is similar 
to extant literature that uses a second-order factor to exam- 
ine cooperative competency (Sivadas and Dwyer 2002). 
Our three subdimensions include market learning in prod- 
uct/service (i.e., developing a new product/service, promot- 
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lifetime value, and ROI). 


ing and selling a product/service, and refining and reposi- 
tioning an existing product/service), customer (1.e., building 
a brand image among customers, developing extensive cus- 
tomer service capabilities, and making sense of customers’ 
current and potential needs), and channel (i.e., pricing 
below competitors, establishing marketing and distribution 
networks in the market, collaborating with business part- 
ners, and knowing about business environment). The results 
show good overall fit: y? = 36.85, d.f. = 32, p > .05; com- 
parative fit index (CFI) = .981, goodness-of-fit index 
(GFI) = .957, and root mean square error of approximation 
(RMSEA) = .031; as we expected, the three first-order 
Jatent constructs loaded significantly on market learning 
(product/services: A = .753, p < .01; customer: A = .816, p < 
.01; channel: À = .610, p < .01). 

In terms of performance, we examined two broad met- 
rics of firm performance. To assess the firm’s strategic mar- 
ket and financial outcomes, such as market growth and ROI, 
we used a four-item measure of financial performance that 
we adapted, in part, from the work of Moorman and Rust 
(1999). In addition, to assess the firm's customer-based per- 
formance, including its customer satisfaction and customer 
retention, we used a four-item measure of customer perfor- 
mance that we also adapted, in part, from the work of 
Moorman and Rust (1999). 

To control for any extraneous effects due to firm or 
industry factors, we also included several covariates. In par- 
ticular, we included firm size and firm ownership to control 
for economies of scale and resource capacity. We measured 
firm size using the number of employees, which is consis- 


tent with extant research (e.g., Lee and Grewal 2004; Luo 
2001), and we measured firm ownership using a dummy 
variable (1 = firms with foreign ownership, 0 = otherwise 
[ie., state- or private-owned firms]) due to strategic and 
operations differences between foreign and local firms in 
China (e.g., Peng and Luo 2000). In addition, we included 
two industry variables to control for market effects that 
have been reported to influence a firm’s strategic choices 
and performance outcomes (e.g., Atuahene-Gima and Li 
2002; Li and Calantone 1998; Zhou, Yim, and Tse 2005). 
Specifically, we included industry hostility, which we mea- 
sured using the five-item scale that Jaworski and Kohli 
(1993) developed, and market volatility, which we mea- 
sured using a four-item scale that is well-accepted in mar- 
keting literature (e.g., Li and Calantone 1998; Slater and 
Narver 1994; Zhou, Yim, and Tse 2005). 


Analysis and Results 


Measure Validation 

Following the work of Anderson and Gerbing (1988), we 
test the validity of the measures using confirmatory factor 
analysis. Overall model statistics indicate that the chi- 
square for the measurement model with all constructs is 
1427.18 (p = .00), with an acceptable ratio of chi-square to 
degrees of freedom of 1.67; the CFI, GFI, adjusted 
goodness-of-fit index (AGFT), and RMSEA are .96, .94, 
.93, and .05, respectively. 

Convergent validity for each of the measures, as indi- 
cated by Cronbach’s alpha, all exceeded the .7 benchmark 
(Nunnally 1978). As Table 2 shows, the minimum reliability 
across the measures is .90. Furthermore, at .71 or higher, 
the average variance extracted (AVE) for each construct 
exceeds the .5 benchmark (Fornell and Larcker 1981). 

Discriminant validity of the measures was supported 
with two different approaches. First, we conducted a series 
of chi-square difference tests by examining pairs of con- 
structs using two models (one constrained model and one 
unconstrained model) for each pair of measured constructs 
(Anderson and Gerbing 1988). The results suggest that 
unconstrained models fit the data better than constrained 
models, indicating discriminant validity. Second, we com- 
pared the AVE of the individual constructs with the squared 
correlation between construct pairs (Fornell and Larcker 
1981). As is evident in Table 2, the largest correlation is that 
between customer performance and financial performance 


(r = .58), which, when squared (r? = .336), is less than the 
AVE for either customer performance or financial perfor- 
mance (AVE = .75 and .72, respectively). In all cases, we 
find that the AVEs exceed the squared correlations, which 
again confirms discriminant validity. 


The Effects of Cross-Functional Coopetition 


Using regression analysis, we capture the effects of cross- 
functional coopetition with the interaction between the 
cooperation and competition constructs. We mean-centered 
these variables before creating the interaction to reduce any 
collinearity between the main and the interaction effects 
(Aiken and West 1996). Following Aiken and West's (1996) 
and Cohen and Cohen's (1983) recommendations, we 
entered the variables into the model in distinct steps: We 
entered the covariates, then the main effects, and finally the 
interaction effects. To examine the incremental effect of 
including the interaction between cross-functional coopera- 
tion and competition, we examined the change in R-square 
(AR2), which indicates the value of including additional 
variables in a model (Cohen and Cohen 1983) and the value 
of interactions in particular (Aiken and West 1996). The 
results for the effects of cross-functional coopetition on 
firm performance appear in Table 3. 

The results show that the interaction between (i.e., the 
joint occurrence of) cross-functional cooperative ability and 
competition has a positive and significant effect on cus- 
tomer performance (b = .19, p « .01) and on financial per- 
formance (b = .26, p < .01), in support of H, (see Table 3). 
In addition, the results show that the joint occurrence of 
cross-functional cooperative intensity and competition has a 
positive and significant effect on customer performance 
(b = .23, p < .01) and on financial performance (b = .16, p < 
.05), in support of H (see also Table 3). Moreover, entry of 
the joint occurrence of cross-functional competition and 
cooperative ability and of the joint occurrence of cross- 
functional competition and cooperative intensity in the third 
step explained a significant level of additional variance 
beyond the main effects. Specifically, these two types of 
cross-functional coopetition explained 8% more variance 
for customer performance (Fo 159) = 3.96, p < .01) and 
10% more variance for financial performance (Fo. 159) = 
4.43, p « .01). 

For our examination of whether there are any differen- 
tial effects for the two types of organizational performance, 
model comparisons show that the joint occurrence of cross- 


TABLE 2 

Descriptive Statistics of Key Constructs 
Varlables M SD AVE Ci CA CM LEARN CUPF  FIPF 
Cross-functional cooperative intensity (CI) 5.28 1.23 .74 .91 
Cross-functional cooperative abllity (CA) 5.89 1.14 73 43 9] 
Cross-functional competition (CM) 419 1.31 71 32" 3e" .90 
Market learning (LEARN) 5.10 1.18 72 .18* .37** | —12 .90 
Customer performance (CUPF) 5.41 1.05 5 .36"* | .24"  -—417* .39** .95 
Financial performance (FIPF) 5.08 1.36 .72 .27** 35" | —10 7 5B" .90 
*p « 05. 
**p « .01. 


Notes The coefficient alpha for each measure is on the diagonal, and the intercorrelations are on the off-dlagonal 
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TABLE 3 


The Effect of Cross-Functional Coopetition on Customer and Financial Performance 





Customer Performance 


Independent Varlables 


Coopetition Effects 
Joint occurrence of cross-functional cooperative 
ability and competition (CA x CM) 
Joint occurrence of cross-functional cooperative 
Intensity and competition (Cl x CM) 


Main Effects 
Cross-functional cooperative ability (CA) 
Cross-functional cooperative Intensity (Cl) 
Cross-functlonal competition (CM) 


Covariates 
Firm size 
Firm ownershlp 
Industry hostility 


Market volatility = 


Overall Model 


F value 8. 


Adjusted R? 
AR? 
Incremental F 


*p < .05. 
**p « .01 


functional cooperative ability and competition has a 
stronger effect on financial performance than on customer 
performance (¥2 gig = 11.28, d.f.ag = 1, p < .01) and that the 
joint occurrence of cross-functional cooperative intensity 
and competition has a stronger effect on customer perfor- 
mance than on financial performance (X2ag = 3.71, d.£.ag = 
1, p < .10). It is possible that frequent interaction among 
competing departments helps build customer retention and 
loyalty, whereas improvements in a firm’s financial perfor- 
mance require more absorptive and assimilating coopera- 
tion among competing departments. 


The Medlating Role of Market Learning 


To examine the extent to which market learning mediates 
the effects of cross-functional coopetition, we relied on the 
three-step mediated regression approach that Baron and 
Kenny (1986) recommend. More specifically, to establish 
mediation of market learning, coopetition must affect mar- 
ket learning and performance, and marketing learning must 
affect performance. As we previously indicated, the results 
show that cross-functional coopetition affects performance 
(see Models 3 and 6 in Table 3). Therefore, to establish 
mediation, cross-functional coopetition must affect market 
learning, and market learning must affect performance. The 
results show that the joint occurrence of cross-functional 
cooperative ability and competition has a positive effect on 
market learning (b = .22, p < .01) and that the joint occur- 
rence of cross-functional cooperative intensity and competi- 
tion has a positive effect on market learning (b = .17, p « 
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Model 1 


Model 2 Model 3 


Financlal Performance 


Model4  Model5 Model 6 


.19** 26°" 
eo .16* 
art 29° .36"* .39** 
.32"* EC odd SÉ .23** 
—16* — 13 — 11 —.10 
.04 .04 .16* 7" .16* 
.16* .14 .09 .08 .08 
ag .24** 22°" 21** 21 
—.11 —.10 —.12 —.16* —.16 
10.65™ 8.87 7.21™ 11.77^ . 10.04** 
43 51 21 45 55 
18 .08 24 .10 
9.52** 3.96** 13.47** 4.43** 


.05), as Model 7 shows (see Table 4). These results support 
H5. For our examination of the effect of market learning on 
performance, the results indicate that market learning has a 
positive and significant effect on a firm's customer perfor- 
mance (b = .32, p « .01) and financial performance (b = .28, 
p«.01)? 

Furthermore, as Table 4 shows, inclusion of market 
Jearning in the model reduces the strength of the effect of 
cross-functional coopetition on performance. In particular, 
the joint effect of cross-functional cooperative ability and 
competition on customer performance and the joint effects 
of cross-functional cooperative intensity and competition on 
both customer performance and financial performance are 
no longer significant (see Models 8 and 9 in Table 4). Thus, 
market learning fully mediates the joint effect of cross- 
functional cooperative ability and competition on customer 
performance, fully mediates the joint effect of cross- 
functional cooperative intensity and competition on cus- 
tomer performance, and fully mediates the joint effect of 
cross-functional cooperative intensity and competition on 





3A]though common variance between market learning and firm 
performance may influence this effect, we examined whether the 
relationship between market learning and firm performance would 
differ when firm performance was reported by midlevel managers 
rather than by senior executives. The results indicate that there 
remains a strong relationship between market learning (reported 
by senior executives) and firm performance (reported by midlevel 
managers): r= 31, p < .01. 


TABLE 4 
The Mediating Effect of Market Learning 


a oS hr 





financial performance. Moreover, although the interaction 
effect of cross-functional cooperative ability and competi- 
tion on financial performance remains significant, its effect 
diminishes (see Table 4), illustrating partial mediation. 
These results offer strong support for H4 and H; and the 
argument that cross-functional coopetition realizes perfor- 
mance advantages through market learning. 


Model Comparisons 


To examine the extent to which our results are robust across 
different models, we used structural equation modeling 
(SEM) to compare alternative models. In particular, we fit 
the data with three models (Morgan and Hunt 1994; Zhou, 
Yim, and Tse 2005). In the first model (SEM 1), all variables 
directly influence the two types of firm performance, and 
we do not include any mediation effect of market learning. 
The second model (SEM2) involves full mediation of mar- 
ket learning (i.e., the joint occurrence of cross-functional 
cooperative ability and competition and the joint occurrence 
of cross-functional cooperative intensity and competition 
are not modeled to affect performance). The third model 
(SEM3) allows partial mediation of market learning. All 
three models include the aforementioned covariates. 

The results indicate that fit indexes for SEMI are 
X2 = 1749.93, d.f. = 852, p = .00; CFI = .90, GFI = .88, 
AGFI = .87, and RMSEA = .06; fit indexes for SEM2 are 
X2 = 1602.98, d.f. = 855, p = .00; CFI = .92, GFI = .91, 
AGFI = .89, and RMSEA = .06; and fit indexes for SEM3 
are X? = 1578.54, d.f. = 851, p = .00; CFI = .94, GFI = .92, 


Model 7: Model 8: Model 9: 
Market Customer Financial 

‘Independent Variables Learning Performance Performance 
Mediating Effect 

Market learning .J2** .28** 
Coopetition Effects 

Joint occurrence of cross-functional cooperative 

abllity and competition (CA x CM) .22** 11 .20™ 
Jolnt occurrence of cross-functional cooperative 
intensity and competition (CI x CM) mY id .13 .12 

Main Effects 

Cross-functional cooperatlve ability (CA) .397** .20** .26** 

Cross-functional cooperative intensity (Cl) .20** .24** .18** 

Cross-functional competition (CM) —.12 —.09 —.09 
Covariates 

Firm size .05 .09* .16* 

Firm ownership .10 .10 .06 

Industry hostility .24** .18** .20* 

Market volatility —.10 —.11* —.17 
Overall Model 

F value 14.87** 10.92** 9.62** 

R2 53 .60 .58 
“p< 05. 
**p < .01. 


AGFI = 90, and RMSEA = .05. Comparison across these 
three alternative SEM models illustrates that SEM3, with 
partial mediation effects, fits the data best, fitting the data 
better than SEMI with no meditation effects (X24 = 
171.39, d.f.ag = 1, p < .01) and better than SEM2 with full 
mediation effects (X745 = 24.44, dfigie = 4, p < .01). In 
other words, these findings show that market learning par- 
tially mediates the influence of cross-functional coopetition 
effects on firm performance, consistent with the regression 
results in Table 4. Furthermore, the structural paths in 
SEM3 appear to be consistent with the beta coefficients 
from the regression results. Specifically, SEM3 results show 
support for the market learning-customer performance (b = 
.35, p « .01) and market learning-financial performance 
(b = .29, p < .01) linkages, comparable to the counterpart 
beta coefficients of .32 (p « .05) and .28 (p « .05) regression 
results in Table 4.5 Overall, the results for SEM confirm our 
conceptual framework, demonstrating the performance ben- 
efits of cross-functional coopetition and the role of market 
learning as a mediator of these benefits. 


^We assessed SEM path coefficients significance through a 
bootstrapping procedure with 1000 resamples. 

SAlthough we found a significant path from cross-functional 
coopetive intensity to customer performance in SEM3 (b = .18, 
p < .05), which was not significant in the regression analysis, spe- 
cific differences between regression results and SEM results may 
be due to different estimation approaches (ordinary least squares 
in the regression approach and maximum likelihood in the SEM 
approach) and to the accounting for measurement error in SEM. 
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Discussion 


The recently emerging research stream on coopetition has 
been largely limited to the interfirm level (e.g., Rindfleisch 
and Moorman 2003; Zeng and Chen 2003). Only a few 
studies have examined coopetition at the intraorganizational 
level, focusing on SBU-level knowledge sharing in general 
(Isai 2002) or within a multinational organization (e.g., 
Ghoshal, Korine, and Szulanski 1994). Coopetition at the 
cross-functional level has largely been ignored in both the 
marketing and the management literature. Notably, Rind- 
fleisch and Moorman (2001) point out that the marketing 
literature has devoted surprisingly little systematic research 
on interfirm coopetition, let alone intrafirm coopetition. 

Our focus on the simultaneous role of cooperation and 
competition in cross-functional relationships advances the 
recent emphasis on market knowledge transfer across func- 
tional boundaries for superior learning and performance 
(Maltz and Kohli 1996, 2000). We argue that because cross- 
functional cooperation involves the ability for and the inten- 
sity of knowledge transfer and because competition can 
facilitate knowledge transfer given an underlying 1ncentive 
to understand competing functions' positions, the joint 
occurrence of cross-functional cooperation and competition 
can create synergies that translate into competitive advan- 
tage. Overall, our results indicate that cross-functional 
coopetition has an important impact on a firm's customer- 
based and financial-based performance and that market 
learning mediates these performance returns. Thus, 
although a firm's functional areas may experience competi- 
tion for resources and strategic emphasis, cooperative 
forces are necessary to shape not only its market learning 
but also its customer and financial performance. Conse- 
quently, our findings regarding the synergy of simultaneous 
cooperation and competition at the functional level provide 
some empirical evidence of the “positive-sum” or syncretic 
rents of interfirm coopetition that Lado, Boyd, and Hanlon 
(1997) theorize. 

Importantly, our results also show that cross-functional 
coopetition can affect firm performance in different ways. 
For example, the joint occurrence of cross-functional coop- 
erative intensity and competition has a stronger positive 
effect on customer-based performance than on financial- 
based performance, potentially because frequent interac- 
tions among competing departments may not be an efficient 
approach (Lado, Boyd, and Hanlon 1997); yet the commu- 
nication offers additional information that can benefit con- 
sumers (Fisher, Maltz, and Jaworski 1997; Griffin and 
Hauser 1996). Our introduction of two important concepts 
of cross-functional cooperation—namely, cooperative 
intensity and cooperative ability—also offers important 
insight for intrafirm knowledge transfer and performance 
outcomes. A combination of the two cooperation concepts 
in cross-functional coopetition may help solve the “fail-to- 
use” problem of sticky yet valuable market knowledge 
(Maltz and Kohli 1996), such that the joint occurrence of 
cross-functional cooperative ability and competition offers 
the ability to recognize and assimilate valuable yet tacit 
market knowledge and the joint occurrence of cross- 
functional cooperative intensity and competition provides 
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the channels for knowledge transfer across functional 
boundaries within the firm. 

Although prior work has suggested a link between 
coopetition and performance (e.g., Brandenburger and 
Nalebuff 1996; Hamel, Doz, and Prahalad 1989; Lado, 
Boyd, and Hanlon 1997), few studies have explored the rea- 
sons for this link. Drawing from the knowledge-based view 
of the firm, we probe for an underlying learning mechanism 
and point out that coopetition at the intraorganizational 
level influences firm performance through market learning. 
Specifically, our results indicate that market learning plays 
a major mediating role in the extent to which cross- 
functional coopetition influences firm performance. Thus, 
the value of simultaneous cooperation and competition 
within cross-functional interactions lies in how they influ- 
ence a firm's market learning, which in turn affects the 
firm’s performance. This finding highlights a critical impli- 
cation for marketing scholars because it uncovers an impor- 
tant role of marketing knowledge and intraorganizational 
learning in exploiting cooperative ability and intensity 
among competing departments for better firm performance. 


Managerial Implications 


Our research also offers insight to managers regarding the 
importance of simultaneously managing cooperation and 
competition in cross-functional interactions. Although exec- 
utives may recognize that market knowledge transfer across 
functional areas is valuable, they must also understand that 
it is a complicated process due, in part, to the underlying 
nature of cooperation and competition in cross-functional 
relationships. 

Notably, our results regarding the strong, beneficial 
influence of cross-functional coopetition on firm perfor- 
mance indicate that fostering cooperation but quenching 
competition may limit a firm's full performance potential. 
In other words, our research directs managers to understand 
that competition and conflicts among marketing, R&D, 
finance, and other departments are not always harmful. 
Instead, fostering the joint occurrence of cross-functional 
cooperation and competition can direct conflicts toward 
constructive interactions and actually promote a firm’s 
overall market learning and performance. For example, 
Royal Dutch Shell had two functions that were focused on 
bettering their own individual departments and competing 
for business within the organization. When these competing 
functions came together to share knowledge, the organiza- 
tion reaped higher performance (Burress and Wallace 
2003). This example, coupled with our research findings, 
informs managers that a firm should attempt to coopera- 
tively bridge the competitive nature of its functional areas 
and leverage the joint occurrence of competition and coop- 
eration for improved firm customer and financial 
performance. 

Indeed, the overwhelming push for lean budgets and 
downsizing in today’s business practice is likely to generate 
high levels of competition among functional areas. This 
competition can often be the result of how resources are 
apportioned across departments. For example, firms fre- 
quently cut marketing budgets to save costs, often because 


marketing expenditures are not easily tied to financial out- 
comes (Rust et al. 2004). As a result, with inherent and 
often unavoidable competitive pressures between marketing 
and other functions (i.e., fighting for more budget from the 
shrinking sum within a firm), executives should consider 
specific approaches that can instill cooperative ability and 
intensity across the competing departments, such as use of 
cross-functional teams to focus on organizational goals 
rather than purely functional goals (Griffin and Hauser 
1996; Maltz and Kohli 2000), implementation of explicit 
employee incentives that focus on the firm’s long-term 
objectives (Hauser, Simester, and Wernerfelt 1994), illustra- 
tion of marketing expenditure effects on shareholder value 
to increase the credibility of the marketing function within 
the firm (Rust et al. 2004), and structural changes to offer 
more opportunities for informal interactions among depart- 
ments within the firm. 

Our findings also point managers to the valuable medi- 
ating role of market learning in the process of converting 
cross-functional coopetition into higher performance. 
Specifically, our results indicate that engaging in more 
intensive and absorbed collaborations across competing 
functional boundaries can enhance a firm’s overall perfor- 
mance through a learning route. Thus, managers should 
nurture the frequency of interactions among competing 
departments and the absorptive capacity for cooperation 
among competing functions to promote intraorganizational 
learning, which ultimately advances firm performance. 

Finally, managers who want to enhance organizational 
learning and are interested in more readily transferring 
knowledge within the firm should understand that the cul- 
prit of this problem may result not simply from low inten- 
sity of cross-functional cooperation but also from a lack of 
assimilation and deployment of market knowledge. Overall, 
both cooperative intensity and ability may help functional 
areas recognize where market knowledge resides within the 
organization, assimilate such knowledge, and then deploy 
this market knowledge for improved learning and subse- 
quent superior firm performance. 


Further Research and Current Limitations 


First, further research on cross-functional coopetition could 
examine differences across functional areas regarding the 
cooperative ability to recognize and assimilate market 
knowledge. In particular, our research unveiled that depart- 
ments are not the same in their abilities to assimilate and 
use market knowledge that is available within the firm. 
Thus, further research should pragmatically recognize the 
difference among functional areas regarding their abilities 
to absorb market knowledge and explore antecedents of this 
difference. 

Second, further research could track how learning medi- 
ates the relationship between cross-functional coopetition 
and organizational performance over time. Our research 
indicated that market learning plays a mediating role in 
transferring cross-functional coopetition into customer and 
financial performance. However, it would be interesting to 
examine how cross-functional level learning may translate 
over time to build stronger cross-functional cooperative 


intensity and ability. Overall, the notion of cross-functional 
coopetition may provide future marketing research a novel 
way of thinking about the complexities of market knowl- 
edge transfer across functional boundaries within the firm. 


Appendix 
Operationalization of Measures 


Cross-Functional Cooperative Intensity (6-ltem 
Measure) 

New scale based on the work of Antia and Frazier (2001), 
Narver and Slater (1990), and Rindfleisch and Moorman 
(2001): reliability = .91 and AVE = .74 (reported by 
midlevel managers). 


1. Departments here share communications frequently in our 
business. 

2. All departments frequently discuss common problems in 
our business. 

3. Marketing personnel share close ties with people in other 
departments. 

4 Our relationship with other departments is mutually gratify- 
ing and highly cohesive. 

5. We expect that our strong interdepartmental social relation- 
ship will exist far into the future. 

6. There is little informal interaction among people from dif- 
ferent departments. (Reverse coding) 


Cross-Functional Cooperative Ability (6-Item 
Measure) 


New scale based in part on the work of Szulanski (1996) 
and Cohen and Levinthal (1990): reliability — .91 and 
AVE = .73 (reported by midlevel managers). 

During the interdepartmental interactions, our depart- 
ments here have strong abilities to 


1. Identify new and useful market knowledge transferred from 
other departments. 

2. Understand new and useful market knowledge transferred 
from other departments. 

3. Value new and useful market knowledge transferred from 
other departments. 

4. Assimilate new and useful market knowledge transferred 
from other departments. 

5. Apply new and useful market knowledge transferred from 
other departments. 

6 Exploit new and useful market knowledge transferred from 
other departments. 


Cross-Functional Competition (11-ltem Measure) 


New scale based on the work of Houston and colleagues 
(2001), Maltz and Kohli (1996), and Ruekert and Walker 
(1987): reliability = .90 and AVE = .71 (reported by 
midlevel managers). 


1. We regularly compete for the limited resources across 
departments 


2. When members of several departments talk about distribu- 
tion of resources (i.e., capital, personnel) across depart- 
ments, tensions frequently run high 
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3. Functional areas regularly compete with each other for 
more mental attention and time from top executives. 

4. To get more resources for our department, other depart- 
ments oftentimes have to make sacrifices. 

5. Individual departments here try to obtain more time and 
attention from senior managers even at the costs of other 
functions. 

6. Each department 1s constantly compared and benchmarked 
with other departments to improve efficiency in the 
organization. 

7. Most departments here try to gain more strategic impor- 

tance and power inside the firm. 

. The objectives pursued by the marketing department are 
incompatible with those of other departments (e.g., manu- 
facturing, IT [information technology], operations). (Item 
dropped to enhance scale validity) 

9. Protecting one’s departmental turf is considered to be a 
way of life in this business. 
10. Individual departments here tend to outperform others for 
a better department charter. 

11. People from different departments feel that the goals of 
their respective departments are in harmony with each 
other. (Reverse coding) 


oO 


Market Learning (10-Item Measure) 


New scale based on the work of Menon and colleagues 
(1999) and Srivastava, Shervani, and Fahey (1998): reliabil- 
ity = .90 and AVE = .72 (reported by top executives). 

Regarding market knowledge, compared to other com- 
panies in your industry, my company has a strong position 
in terms of the expertise in 


1. Developing new product/service. 
2. Promoting and selling product/service. 


3. Refining and repositioning existing product/service. 
4. Building brand image among customers. 

5. Developing extensive customer service capabilities. 
6. Sensemaking customers' current and potential needs. 
7. Pricing below competitors. 


8. Establishing marketing and distribution networks in the 
market. 


9. Collaborating with business partners. 
10. Knowing about business environment (tax, labor, institu- 
tional regulations, etc.). 


Organizational Performance 


Customer Performance (4-Item Measure) 
Adapted in part from the work of Moorman and Rust 
(1999) and Narver and Slater (1990): reliability = .95 and 
AVE = .75 (reported by top executives). 


1. Customer loyalty. 

2. Customer satisfaction. 

3. Customer lifetime value. 
4. Customer retention. 


Financial Performance (4-Item Measure) 
Adapted in part from the work of Moorman and Rust 
(1999) and Narver and Slater (1990): reliability = .90 and 
AVE = .72 (reported by top executives). 

1. Market share growth. 

2. Sales growth. 

3. Reducing selling costs. 

4. ROI. 
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Jennifer J. Argo, Darren W. Dahl, & Andrea C. Morales 


Consumer Contamination: How 
Consumers React to Products 
Touched by Others 


Although consumers like to touch products while shopping, the authors propose a theory of consumer contamina- 
tlon, positing that consumers evaluate products previously touched by other shoppers less favorably. The authors 
test the theory by manipulating cues that increase the salience that consumer contact has occurred. Furthermore, 
the authors investigate the role of disgust as the underlying mechanism of the theory. 


highlights the difficulty that cosmetic retailers have 
in managing consumers’ touching and interacting 
with their product offerings. Matthew Waitesmith, vice 
president of MAC cosmetics, is quoted as saying “we never 
double-dip” (p. 20), in reference to stringent policies the 
company uses in an effort to alleviate consumer concern 
over trial and sampling of its retail products. Similarly, in 
the best-selling book, Why We Buy, Paco Underhill (2000) 
discusses the challenges that retailers face in effectively 
managing the impact of consumers’ touching display prod- 
ucts. He notes that consumers often open the packaging of 
products to touch and feel them but usually choose to buy 
products that have been or appear to be untouched by 
others. Indeed, it is common for consumers to try on an arti- 
cle of clothing or flip through a magazine but, after decid- 
ing to make a purchase, select a “fresh” one from the back 
of the display. It is clear that there is a strong consumer 
aversion to touched products, and it is difficult to manage 
consumer touching behavior. However, little research has 
investigated the reason for this phenomenon or has provided 
managers with insight into how to address its implications. 
Recently, marketing researchers have found that infor- 
mation obtained through physical touch plays a central role 
in the evaluation of consumer products (Mooy and Robben 
2002; Peck and Childers 2003). The importance of physical 
touch in consumption has been cited as one of the critical 
factors that has limited the adoption of online shopping, an 
environment that is void of tactile information (Alba et al. 
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1997; Citrin et al. 2003; McCabe and Nowlis 2003). 
Although prior research suggests that touch has a positive 
impact on consumer information search and product evalua- 
tions, the goal of the current research is to show that con- 
sumer contact with products may actually be a double- 
edged sword for marketers. Specifically, we develop a 
theory of consumer contamination, which we define as con- 
tamination from consumer touching. In the product context 
we investigated herein, this behavior results in negative 
implications for a touched product. Importantly, in our 
research, we test the impact of consumer contamination by 
examining cases in which products are unharmed objec- 
tively by contact with other consumers, but nevertheless the 
contact is still viewed as a contamination event. This differs 
from actual physical contamination in which a product 
becomes soiled or dirty as a result of consumer contact. 

Using the law of contagion (Frazer [1890] 1959; Mauss 
[1902] 1972; Tylor [1871] 1974) as a theoretical frame- 
work, we propose that when consumers become cognizant 
that another consumer has previously touched a product, 
their evaluations of and purchase intentions for the product 
decrease. We test this prediction in a retail shopping envi- 
ronment by manipulating factors that increase the salience 
that consumer contact has occurred. We refer to such factors 
as “contamination cues.” In particular, our research investi- 
gates the effects of three contamination cues—proximity to 
contact, time elapsed since contact, and the number of con- 
tact sources—on consumers’ product evaluations and pur- 
chase intentions. In addition to establishing the impact of 
consumer contamination in a retail context, we also provide 
insight into the underlying process through which consumer 
contamination operates by investigating the role of disgust 
as the driving mechanism. Our research reveals that the 
increased salience of contact through a variety of contami- 
nation cues negatively affects consumer evaluations of 
touched products. 


Conceptual Background 


Law of Contagion 


In the late 1800s and early 1900s, anthropologists devel- 
oped a conceptual framework for the laws of sympathetic 
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magic (Frazer [1890] 1959; Mauss [1902] 1972; Tylor 
[1871] 1974). These laws summarize a pattern of beliefs 
and practices that exist over several traditional cultures to 
describe how the world works. One of the central laws of 
sympathetic magic is the law of contagion. According to 
this law, when a source (person/object) and a recipient 
(another person/object) come into direct or indirect contact, 
the source influences the recipient (Rozin and Nemeroff 
1990). This influence arises because when the two touch, 
the source transfers its essence (i.e., some or all of its essen- 
tial properties) to the recipient (Nemeroff and Rozin 1994). 
Notably, the essence does not need to be visible to the 
naked eye and can be composed of invisible entities (Rozin 
and Nemeroff 1990). According to the law, the source's 
essence remains a part of the recipient even after contact 
has been broken (Rozin and Nemeroff 1990). 

Previous research has indicated that the law of conta- 
gion can play a significant role in an interpersonal context. 
For example, in anthropological research on the Hua tribe 
of Papua New Guinea, Meigs (1984) finds that when a Hua 
woman marries and moves to her husband's community, no 
one in the community will consume the food she produces 
and/or serves. This refusal arises because the villagers 
believe that the woman's essence has been transferred to the 
food; if they accept the food, they will become polluted 
with her properties (Meigs 1984). 

In a different context, correlational research has shown 
that people can have both positive and negative responses to 
objects that have been touched by others. Although people 
are reluctant and uncomfortable coming into physical con- 
tact with an object that has been used by a disliked person 
(e.g., a laundered sweater that belonged to Adolf Hitler), 
this research shows that they sometimes evaluate objects 
used by people they like more favorably (e.g., a hamburger 
with a bite taken out of it by their lover; Nemeroff 1995; 
Rozin, Markwith, and McCauley 1994; Rozin, Millman, 
and Nemeroff 1986; Rozin et al. 1989). However, the pro- 
cesses that are believed to underlie such negative and posi- 
tive effects from contact are not identical. Negative con- 
tamination (i.e., the value of an object is decreased through 
contact) has been shown to be the result primarily of a 
residue model that results from physical contact. Con- 
versely, positive contagion (i.e., the value of an object is 
increased through contact) has been found to be the result 
strictly of a symbolic interaction model that is based on 
interpersonal/moral factors (Nemeroff and Rozin 1994). 
Thus, previous research shows that negative contamination 
effects are driven primarily by a transfer of essence, as in 
the Hua tribe example, and that positive contamination 
effects arise primarily because people want to be associated 
with an item that belongs to someone about whom they 
have strong, positive feelings (Nemeroff and Rozin 1994). 
Although both positive and negative contamination are pos- 
sible in the retail context under investigation in this article, 
we focus specifically on negative contamination effects. 


Negative Contamination in a Retall Context 


In O'Reilly and colleagues' (1984) study, 7696 of survey 
respondents indicated that they would not buy used under- 
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clothing, and 2096 refused to purchase used overcoats. The 
justification for the low desirability of the used products 
was related to a fear of contamination. Because the original 
owners of the used products remained anonymous to con- 
sumers, these feelings could not be based on associations 
but instead suggest a belief that the clothing was contami- 
nated simply because it had been handled and owned by 
someone else previously. Indeed, there are many cases in 
which contact with consumers actually causes products to 
become less desirable to other consumers as a result of 
physical contamination from incidental touching or usage 
(e.g., floor-room models, demonstration products). In the 
current research, we argue that consumers generalize this 
notion for even brief and incidental contact that does not 
involve formal usage, ownership, or actual physical 
contamination. 

This generalized contamination effect has negative 
implications for products that have been handled by con- 
sumers. Much like second-hand items or demonstration 
products that suffer devaluation (Kotler and Mantrala 
1985), contamination that arises from incidental consumer 
contact should also lead consumers to devalue touched 
products. This devaluation should be manifested by lower 
evaluations of and purchase intentions for touched products 
than for untouched products. 

Note also that consumer contact with a product does not 
need to be formally observed to produce contamination 
effects (Rozin, Markwith, and McCauley 1994; Rozin et al. 
1989). Rarely does a person actually see who has put 
clothes on consignment or who has previously test-driven 
an automobile. In these cases, consumers infer that contact 
has occurred. Thus, simply a cue in the retail environment 
that signals to a consumer that someone else has previously 
touched the product is necessary for consumer contamina- 
tion to occur. Given this, we expect that perceived contact 
between consumers and products is sufficient, and observed 
physical contact is not essential for the elicitation of con- 
sumer contamination effects. More formally, 


P,: Perceived contact between a consumer and a product 
results in other consumers having less favorable evalua- 
tions of the touched product. 


Implicit to our proposition is the presence of environ- 
mental signals that provide cues to the consumer that a 
product has been touched. These signals, or contamination 
cues, can be relayed through a variety of sources, including, 
but not limited to, other consumers, sales staff, merchandis- 
ing placement, and product characteristics. These cues 
increase the salience of the contamination and thus increase 
the likelihood of consumer contamination. Angyal (1941) 
provides support for this idea by noting that the greater the 
salience or intimacy of a contamination event, the greater is 
the impact of contamination effects. Therefore, the more 
clearly a contamination cue signals that contact between a 
consumer and a product has taken place, the greater is the 
effect of contamination on other consumers' evaluations of 
that product. Moreover, the greater the intimacy of contact, 
the greater is the likelihood of contamination occurring. 
This implies that contamination effects differ depending on 
the type of cue present and the type of product, because the 


degree of intimacy changes by product category. In the cur- 
rent research, however, we focus only on the effects of dif- 
ferent contamination cues. 

Although there are likely to be several possible con- 
tamination cues present in the retail environment, we focus 
our empirical investigation on three specific cues: proximity 
to contact, time elapsed since contact, and number of con- 
tact sources. In the next section, we discuss and develop 
specific hypotheses that are related to these contamination 
cues. Furthermore, drawing on previous work on contami- 
nation (Rozin and Fallon 1987; Rozin, Millman, and 
Nemeroff 1986), we also forward a prediction about the 
role of disgust as a central mechanism in the incidence of 
consumer contamination in a retail context. 


Hypotheses 


Proximity to Contact as a Contamination Cue 


The first contamination cue we examine is proximity to 
contact, which we define as the physical location of the 
product relative to where it would be touched by other con- 
sumers in a retail setting. To understand how proximity to 
contact influences evaluations and purchase intentions of 
touched products, we draw from social impact theory (SIT; 
Latané 1981), which proposes that the presence or action of 
a real, implied, or imagined social presence affects a target. 
This impact results from three forces: immediacy (i.e., 
proximity), number (ie., how many), and strength (i.e., 
importance; Latané 1981). One of the theory’s principles 
predicts that the impact of each of the forces increases as a 
power function. Thus, in terms of proximity, the greatest 
impact arises from close proximity as opposed to being far- 
ther away (Argo, Dahl, and Manchanda 2005; Latané and 
Wolf 1981). Extending this principle to consumer contami- 
nation, we predict that contamination salience will be 
greater the closer a product is situated to a location of con- 
tact. We expect that this will happen because as the product 
moves closer in proximity to a contact location, consumers 
will perceive that there is an increase in the probability that 
the product has been touched. This expectation is consistent 
with observations from retail settings in which consumers 
often select products, such as magazines, that are located at 
the bottom or back of a display (i.e., where there is a lower 
probability that the product has been touched because it is 
located farther out of a consumer's reach). Thus, we expect 
that as contamination salience increases through proximity 
to contact, there will be an increase in the impact of con- 
sumer contamination on product evaluations and purchase 
intentions. On the basis of this discussion, we expect that 
this impact will be negative in valence. 


Hı: Consumer product evaluations and purchase intentions 
will be lower if a product is physically proximate (versus 
not physically proximate) to the location where another 
consumer is believed to have had contact with it. 


Time Elapsed Since Contact as a Contamination 
Cue 


The second contamination cue we consider is the amount of 
time that has elapsed since consumer contact has occurred. 


Unlike proximity to contact, it is unclear how time influ- 
ences product evaluations and purchase intentions. The law 
of contagion suggests that when contact has been made 
between a source and a recipient, the source continues to 
influence the recipient indefinitely— "once in contact, 
always in contact" (Frazer [1890] 1959; Rozin, Markwith, 
and McCauley 1994). In other words, after an object has 
been contaminated, it should remain that way, even after the 
two items are no longer touching (Frazer [1890] 1959). Pre- 
liminary evidence for this idea has been found in two corre- 
lational studies. In the first study, Rozin and colleagues 
(1989) find that attempts to cleanse a contaminated item by 
washing it failed to eliminate the effects of contamination 
completely. In the second study, Hejmadi, Rozin, and Siegal 
(2004) demonstrate that children believed that juice contin- 
ued to be contaminated even after a contaminant had been 
removed and tbe juice was purified. These findings provide 
some suggestive evidence that consumers will believe that a 
product is contaminated by physical contact with another 
consumer, regardless of the amount of time that has elapsed 
since it was last touched. Thus, the impact of consumer 
contamination on product evaluations and purchase inten- 
tions should be consistent across time. 

However, recent research on construal-level theory sug- 
gests that consumers' responses to a contamination event 
vary at different points in time (Liberman and Trope 1998; 
Trope and Liberman 2003). Trope and Liberman (2003) 
examine how temporal distance systematically alters the 
way people mentally construe future events. They posit that 
when people consider distant-future events, they form high- 
level construals (i.e., abstract representations). In contrast, 
when people consider near-future events, they form low- 
level construals that involve more concrete and vivid 
details. Although Trope and Lieberman focus exclusively 
on the impact of temporal distance for future events, they 
suggest that construal theory principles should apply to past 
events as well. Therefore, when people consider distant-past 
events, similar to the distant-future events, they will form 
higher-level, less concrete construals about the event than 
when they consider near-past events. Given this, it would be 
expected that the salience of consumer contamination 
would be mitigated as time passes. In other words, the 
shorter the time elapsed since consumer contact, the 
stronger is the consumer contamination effects realized. We 
offer two competing hypotheses: 


H.,: Consumer product evaluations and purchase intentions 
will be lower 1f a product is believed to have had contact 
with another consumer, regardless of the time elapsed 
since contact. 


H»: Consumer product evaluations and purchase intentions 
will be lower if a product is believed to have had contact 
with another consumer; this effect will be mitigated as 
time elapses since contact. 


Number of Contact Sources as a Contamination 
Cue 


The third contamination cue we examine is the number of 
contact sources. Specifically, we consider how consumer 
contamination effects are influenced by the number of con- 
sumers believed to have had physical contact with a prod- 
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uct. As we mentioned previously, SIT predicts that the 
impact of the number of people who constitute a real, imag- 
ined, or implied social presence will increase as a power 
function. Therefore, we expect that an increase in the num- 
ber of sources that a consumer believes have previously 
touched a product will increase the contamination effects. 
Although SIT predicts that contamination effects should 
be stronger (i.e., lower product evaluations) when more 
consumers are believed to have come in contact with a 
product, multiple contact sources might actually increase 
product evaluations by signaling that the product is popular. 
Thus, rather than lower the value of the product, contact by 
multiple consumers may heighten evaluations and purchase 
intentions of a touched product by emphasizing its popular- 
ity. We hypothesize the following competing predictions: 
H3a: Consumer product evaluations and purchase intentions 


will be lower if a product 1s believed to have had contact 
with many (versus one) consumers. 


Has: Consumer product evaluations and purchase intentions 
will be higher if a product 1s believed to have had contact 
with many (versus one) consumers. 


Feelings of Disgust as the Underlying Mechanism 


The previous discussion argues that as contamination 
becomes more salient to consumers, they are more likely to 
experience consumer contamination effects. What is 
unclear, however, is the underlying mechanism that drives 
consumers to respond unfavorably to products that have 
been touched by other consumers. Previous research has 
suggested that the law of contagion operates in the Western 
world in the domain of disgust (Rozin, Millman, and 
' Nemeroff 1986), which is defined as revulsion in response 
to an offensive object (Rozin and Fallon 1987). Feelings of 
disgust have been shown to underlie contamination effects 
in various contexts, including food rejection, disease, and 
moral taint (Rozin, Haidt, and McCauley 1993; Rozin, 
Markwith, and Nemeroff 1992). Furthermore, in general, 
disgust manifested in rejection and/or lower evaluations of a 
contaminated object (Rozin and Fallon 1987; Rozin, Haidt, 
and McCauley 1993; Rozin and Nemeroff 1990; Rozin, 
Millman, and Nemeroff 1986). 

Given these findings, we expect that consumers who 
perceive that physical contact has occurred between another 
consumer and a product will experience feelings of disgust 
toward the contaminated product. In turn, these feelings of 
disgust will translate into negative consumer evaluations 
and purchase intentions. Thus, disgust 1s the primary dri- 
ving force that motivates a change in consumer evaluations 
for products that are touched by others. 


H4: Disgust mediates the influence of consumer contamina- 
tion on product evaluations and purchase intentions. 


We test these hypotheses in three field-study experi- 
ments. In the first study, we investigate the role of the prox- 
imity to contact (Hj) in facilitating contamination effects. 
Study 2 broadens the investigation by adding a manipula- 
tion of time elapsed since contact and provides an explicit 
test of the competing hypotheses put forth in H}. Finally, 
Study 3 examines how the number of contact sources (H3) 
influences consumer contamination. In each experiment, we 
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measure feelings of disgust as an underlying motivator (H4) 
in producing the contamination effects that consumers 
experience. 


Study 1 


Overview 

The purpose of this study is to establish that consumer con- 
tamination operates in a retail setting, negatively influenc- 
ing evaluations of products that have been touched by other 
consumers. Furthermore, we empirically test Hj and H4. 
Study 1 consists of a one-way between-subjects experi- 
mental design with proximity to contact (close versus 
medium versus far) as the experimental factor. We ran- 
domly assigned 70 undergraduate students from a large 
North American university to one of the three conditions; 
they received course credit for their participation. 


Method 


Procedure. Participants were tested individually at the 
university student center. When they arrived, they were 
informed that the objective of the study was to gather mar- 
keting research information for the university bookstore. 
Participants were told to visit the store, look around to get 
an impression of the store (three to five minutes), and then 
complete an assigned task. To identify the task they needed 
to complete while in the store, they were instructed to select 
an envelope that would identify a task for them. They were 
told that the tasks ranged from making a purchase to trying 
or using a product. Unbeknownst to the participants, each 
envelope indicated that they would be required to perform 
the same task: to try on a specific T-shirt located in the 
store. The envelope included a picture of the target T-shirt 
and instructed participants to find a sales associate in the 
clothing section who would help locate the proper article of 
clothing. After selecting the envelope, participants pro- 
ceeded to the bookstore. At this time, the experimenter text- 
messaged a trained confederate located in the store to indi- 
cate that the participant was en route and to provide a 
description of him or her. The confederate assumed the role 
of a sales associate at the bookstore and was responsible for 
implementing the proximity-to-contact manipulation. One 
male and one female confederate alternated the role, and 
both were blind to the experimental hypotheses.! 

In all conditions, after participants located the sales 
associate in the store and showed him or her the photograph 
of a one-of-a-kind T-shirt, the sales associate said, “We only 
have one left in stock." In the close condition, the sales 
associate finished the sentence with “and someone is trying 
it on." The sales associate then took the participant over to 
the dressing room to wait, and a couple of seconds later, a 
second confederate, who assumed the role of another shop- 
per (same gender as the participant), exited the dressing 
room, leaving the T-shirt in the room. In the medium condi- 
tion, the sales associate completed the sentence with “and it 
is over here on the return rack" and then took the participant 





lThe sales associate was also responsible for monitoring the 
cleanliness of the T-shirt and replaced it with a new one after every 
second participant or if it became soiled. 


to a return rack located near the dressing room. In the far 
condition, the sales associate finished the sentence with 
“and it is just over here on the rack” and took the participant 
to a regular display rack located a few feet away from the 
return rack. Note that at no time in any of the conditions did 
either confederate actually touch the target T-shirt. Physical 
contact with the target T-shirt was implied only by the 
proximity-to-contact manipulation. 

After finishing the task and returning to the experi- 
menter, participants completed a questionnaire. Participants 
were then debriefed and were thanked for their 
participation. 

Dependent variables. The questionnaire contained both 
the dependent variables of interest and questions consistent 
with the cover story. We measured overall product evalua- 
tions using five seven-point scales. The items included 
"bad/good," “negative/positive,’ “undesirable/desirable,” 
"unfavorable/favorable," and “dislike/like.” We averaged the 
responses to form an evaluation index (à = .95). We mea- 
sured purchase intentions using a seven-point scale that 
asked participants to indicate how likely they would be to 
buy the product (1 = “very unlikely to buy," 7 = “very likely 
to buy”). 

' We also asked participants to indicate how they felt dur- 
ing the shopping experience using a battery of both positive 
and negative affect terms (1 = “not at all,’ 7 = "very"). 
Embedded in this list were four items intended to measure 
feelings of disgust (Haidt, McCauley, and Rozin 1994; 
Smith and Ellsworth 1985) and other negative emotional 
responses that could be elicited by the shopping experience 
and could potentially account for changes in product 
evaluations. We created a disgust index by averaging the 
following four items: disgusted, revolted, unclean, and 
gross (à = .76). To rule out the possibility that negative 
emotions other than disgust accounted for the results, we 
created another index for other negative emotions by aver- 
aging the following five items: frustrated, bad, annoyed, 
angry, and mad (@ = .85). 

To determine whether participants' reactions toward the 
product were a direct result of specific physical contamina- 
tion or a more general contamination effect, we asked them, 
“How dirty was the product you used/tried?" We averaged 
two seven-point scales (“dirty, unsanitary” and “not at all/ 
very”) to create a dirty index (r = .70, p < .01). After com- 
pleting these measures, participants completed a manipula- 
tion check for proximity to contact (location recall measure) 
by answering an open-ended question that asked where they 
found their target product., Participants also indicated their 
gender, age, and major and completed an open-ended suspi- 
cion probe. Responses to the demographic measures 
(including match/mismatch on gender relative to the con- 
federate) were not significant in this study or any of the 
subsequent studies; thus, we do not discuss this issue 
further. 


Results 


Preliminary analyses. Of the 70 participants who took 
part in the study, 67 successfully completed the assigned 
task. The sample used in the subsequent analysis consisted 


of 67 participants (males = 30, females = 37); cell sizes 
ranged from 21 to 24. An analysis of the manipulation 
check for proximity indicated that all participants were cog- 
nizant of where the product was located. An examination of 
responses to the suspicion probe revealed that no partici- 
pants were aware of the experimental hypotheses. 


Test of hypotheses. H, predicted that product evalua- 
tions and purchase intentions would decrease as the product 
became closer to where contact with another consumer took 
place (1.e., contamination salience increased). An analysis 
of variance (ANOVA) using proximity to contact as the 
independent variable and the evaluation index as the depen- 
dent variable produced a significant main effect (F(2, 64) = 
22.95, p < .001). Post hoc tests indicated that reducing 
proximity to contact results in a linear decrease in product 
evaluations. Participants evaluated the T-shirt significantly 
more positively when it was located on the regular rack 
(Mg, = 5.20) than when it was on the return rack 
(Medium = 3.96, t(64) = 3.24, p < .01) or in the dressing 
room (Molose = 2.47, t(64) = 6.77, p < .001), and they evalu- 
ated it significantly more positively when it was located on 
the return rack than when it was in the dressing room 
(t(64) = 3.50, p < .001). 

To determine whether proximity to contact also signifi- 
cantly influenced participants’ purchase intentions, we con- 
ducted a second ANOVA. Again, we found a significant 
effect (F(2, 64) = 9.04, p « .001). Post hoc comparisons 
showed that participants’ purchase intentions were lower as 
the proximity to the contact location became closer (Mo. = 
2.46; M nedum = 3.64, t(64) = 2.20, p < .05; close versus far: 
Mg, = 4.76, t(64) = 4.25, p < .001; medium versus far: t(64) 
= 2.03, p < .05). Thus, we found support for H}. 

To determine whether feelings of disgust mediated the 
impact of proximity to contact on product evaluations and 
purchase intentions (H4), we conducted a mediation analy- 
sis. Following the work of Baron and Kenny (1986), we 
found that proximity to contact significantly predicted feel- 
ings of disgust (F(2, 64) = 28.02, p < .001; Mo, = 4.11, 
M medium = 2-35, Mg, = 1.27). Next, we estimated a path 
analysis that included the direct path for the effect of prox- 
imity to contact on product evaluations and an indirect path 
mediated by disgust. The results produced a significant 
indirect path, indicating that disgust fully mediated the 
effect of the independent variable on product evaluations 
(see Figure 1, Panel A). We also conducted a path analysis 
to establish whether disgust significantly mediated purchase 
intentions. À second mediation analysis indicated that dis- 
gust fully mediated the effect of proximity to contact on 
purchase intentions (see Figure 1, Panel B). We conducted 
an additional path analysis to determine whether product 
evaluations mediated the impact of proximity to contact on 
purchase intentions. As we show in Figure 1, Panel C, prod- 
uct evaluations fully mediated the relationship. 


Additional analyses. 'To rule out the possibility that con- 
tamination made participants experience other negative 
emotions and that these negative emotions caused the lower 
evaluations of the contaminated products, we conducted an 
analysis using the negative emotion index. The ANOVA 
indicated that proximity to contact failed to significantly 
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FIGURE 1 
Study 1: Path Analysis 
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influence other negative emotions (F(2, 64) = 1.71, p > .10), 
providing evidence that feelings of disgust are specific in 
the facilitation of consumer contamination effects. A more 
general, negative reaction does not appear to drive the lower 
evaluations and purchase intentions for a product that has 
been touched by other consumers. 

We also conducted an analysis using the dirty index as a 
dependent variable (F(2, 64) = 1.89, p > .10). Null effects 
for this measure indicate that consumers do not perceive 
actual physical contamination from the consumer touching 
that is inherent in our manipulation but rather experience a 
more general contamination effect that directly affects their 
evaluations and purchase intentions for the product. An 
analysis in subsequent studies (i.e., Studies 2 and 3) also 
shows null effects for both the negative emotion and dirty 
indexes; thus, we do not discuss these measures further. 


Discussion of Study 1 


Study 1 provides the first empirical evidence in support of 
the existence of consumer contamination. The results show 
that when consumers receive a signal that another consumer 
has touched a product, they lower both their evaluations and 
their purchase intentions for that product. Notably, partici- 
pants never actually observed other people touching the 
product; instead, they inferred contact through the contami- 
nation cue of proximity to contact. By changing where the 
product was located in the store, we also manipulated the 
salience of contamination. In particular, when the product 
was located closer to the perceived point of contact, partici- 
pants’ evaluations of and purchase intentions for the product 
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were lower. This decrease in evaluations and intentions 
appears to be driven by feelings of disgust that arise when 
another shopper is believed to have contaminated a product 
through contact. 

Although the results of Study 1 are consistent with a 
theory of consumer contamination, it is possible that the 
lower product evaluations and purchase intentions in the 
dressing room and return rack (compared with the regular 
rack) conditions were not the outcome of contamination but 
rather the result of consumers thinking that the product they 
were asked to try on had been previously rejected. In other 
words, the location of the product (i.e., either in the dress- 
ing room or on the return rack) may have signaled to par- 
ticipants that the product was deficient because someone 
else had previously tried it on but decided not to purchase it. 
To test this alternative explanation, we implemented an 
additional experimental condition (rejection condition). 

In the rejection condition, as in the far condition, the 
sales associate indicated that the product was located on a 
regular rack in the store. However, when the sales associate 
and the participant arrived at the rack, they found that it was 
empty. After the sales associate vocalized that it appeared as 
though someone else already had the T-shirt, another shop- 
per (i.e., a confederate), who was standing nearby holding 
the target T-shirt and a couple other items by the hangers, 
looked over and said, “I was going to try it on, but I don’t 
want it after all" Without ever making contact with the 
shirt, he or she then handed the participant the one-of-a- 
kind T-shirt on the hanger. Independent sample t-tests com- 
pared the rejection condition and the far condition, and the 
results indicated that the two conditions did not differ sig- 
nificantly for either product evaluations (t(38) = .74, p > 
20, Myejection = 4.87) or purchase intentions (t(38) = .35, 
p > .20, Micjecuon = 4.58). The absence of statistical signifi- 
cance suggests that negative product evaluations and pur- 
chase intentions in the dressing room and return rack condi- 
tions are consistent with consumer contamination and are 
not explained by another consumer rejecting the product. 

Another possible explanation for the results of Study 1 
that cannot be addressed with the current findings is that the 
location of the product not only manipulated proximity to 
contact but also may have changed perceptions about the 
amount of time elapsed since contact. Specifically, when 
the product was located in the dressing room or on the 
return rack, consumers may have thought that the product 
was more recently contaminated than when it was located 
on the regular rack. In turn, knowledge about the time 
elapsed since contact may have subsequently affected prod- 
uct evaluations and purchase intentions. We tease apart 
these two competing explanations in Study 2 by adding an 
explicit manipulation of time elapsed since contact from 
another consumer. 


Study 2 


Overview 

We conducted Study 2 to investigate the role of time 
elapsed since contact in the realization of consumer con- 
tamination. This study empirically tests the competing 
hypotheses H2, and Hz. Furthermore, in this study, to clar- 


ify the role of proximity to contact and time elapsed since 
contact in producing contamination effects, we manipulate 
both contamination cues separately. Participants in the 
study included 95 students from a large North American 
university. We randomly assigned them to a condition in a 2 
(proximity to contact: medium versus far) x 2 (time elapsed 
since contact: short versus long) + 1 (control: no touch) 
between-subjects experimental design. The 48 male and 47 
female participants received $10 for compensation for their 
participation. Cell sizes ranged from 18 to 20. 


Method 


Procedure. We used the same procedure as that in Study 
1, but with two exceptions: First, we used only two levels of 
proximity to contact, medium and far. We manipulated both 
the medium and the far conditions as in Study 1. Second, to 
manipulate time elapsed since contact, the sales associate 
indicated the amount of time that had passed since the T- 
shirt was last touched by another consumer. In the short 
condition, the sales associate told the participant that the T- 
shirt had just been tried on, whereas in the long condition, 
the sales associate indicated that the T-shirt had not been 
tried on for a few days. We included an additional group in 
the design to serve as a control group. Participants in this 
condition found the target product located on the regular 
rack (similar to the far proximity-to-contact condition) but 
were not provided with any contact information. The 
manipulation check for time elapsed since contact used two 
nine-point scales that asked, “How long do you think the 
product was in the same exact location where you found 
it?” (1 = “very short period of time,” 9 = “very long period 
of time”) and “How much time do you think the product 
was in the same exact location where you found it?” (1 = 
“minutes,” 9 = *months"). We averaged these two items and 
combined them to form a time index (r = .73, p < .001). 
Dependent measures followed those in Study 1 (evaluation 
index a = .96, disgust index o = .97, location recall mea- 
sure, and purchase intentions). 


Results 


Preliminary analyses. All participants were cognizant of 
the product’s location. A 2 x 2 ANOVA with the time index 
as the dependent variable produced only a significant main 
effect for time elapsed since contact (F(1, 71) = 13.25, p = 
001; Miong = 5.26, Mshor = 3.63)? Thus, the manipulation 
checks were successful. We also examined responses to the 
suspicion probe, which revealed that participants were 
unaware of the experimental hypotheses. 


Test of hypotheses. Recall that H, put forth competing 
predictions about the role of time elapsed since contact for 
consumer contamination. H», proposed that the passing of 
time would not influence contamination, whereas Hap pro- 
posed that the passage of time would mitigate contamina- 


ZWe conducted a one-way ANOVA with the control group and 
the amount of time conditions for the time delay index (F(2, 92) = 
15.95, p < .001; Meontro) = 6.14; control versus long: t(92) = 2.24, 
p < 05; control versus short: t(92) = 5.39, p < .001). 


tion effects. A 2 (proximity to contact: medium versus 
far) x 2 (time elapsed since contact: short versus long) 
ANOVA with the evaluation index as the dependent varia- 
ble produced a main effect for proximity to contact 
(F(1, 71) = 8.09, p < .01; M nedum = 3.58, Mg, = 4.47) and 
a main effect for time elapsed since contact (F(1, 71) = 
12.78, p = .001; Miong = 4.60, Minot = 3.51). However, 
these main effects were qualified by a significant interaction 
between the two factors (F(1, 71) = 5.43, p « .05; 


Mmedum and shot = 2-64, Medium and long 7 4.51, 
Mear and short ^ 4.28, Mgr and long = 4.67; see Figure 2. Panel 


FIGURE 2 
Study 2: Influence of Proximity to Contact and 
Time Elapsed Since Contact 


A: Product Evaluations Index Scores 


Product Evaluations 





Control Medium Far 


Proximity to Contact 





B: Purchase Intentions Score 


Purchase Intentlons 





Contro! Medium Far 


Proximity to Contact 





Consumer Contamination / 87 - 


A). Planned contrasts indicated that participants’ evalua- 
tions of the product were significantly lower when the prod- 
uct had just been touched and was located on the return rack 
than when it was on the regular rack (t(71) = 3.68, p < 
.001). Furthermore, evaluations were significantly lower 
when the product was located on the return rack and had 
just been touched than when it was touched days before 
(t(71) = 4.10, p « .001). No other comparisons were signifi- 
cant (ts < 1). These findings support Hy, and suggest that 
the impact of consumer contamination disperses over time. 
It appears that for consumer contamination, “once in con- 
tact" does not mean "always in contact." 

A second ANOVA indicated that purchase intentions 
were also significantly influenced by time elapsed since 
contact (F(1, 71) = 12.25, p = .001; Miong = 3.89, Mohort = 
2.53) and marginally influenced by proximity to contact 
(F(1, 71) = 3.14, p < .10, Mnedum = 2-83, Mg, = 3.54). 
However, again, these main effects were qualified by a sig- 
nificant interaction between proximity to contact and time 
elapsed since contact (F(1,71) = 4.11, p « .05; 
Mmedium and shot = 1.72, Mmedium and long = 3.94, 
Mear and short ^ 3:255 Mr and long = 3.84; sco Figure 2. Panel 
B). Planned contrasts produced the same pattern as that for 
product evaluations. Specifically, participants were less 
willing to purchase the product when it had recently been 
touched and was located on the return rack than when the 
product was on the regular rack (t(71) = 2.70, p « .01). Fur- 
thermore, participants were less willing to purchase the 
product when it was on the return rack and had just been 
touched than when it had been days since the product had 
been touched (t(71) = 3.83, p < .001). No other contrasts 
were significant. Again, this result provides support for the 
notion that consumer contamination is not permanent but 
dissipates over time (Hyp). 

A two-way ANOVA with the disgust index as the 
dependent variable produced two main effects (proximity to 
contact: F(1, 71) = 14.48, p = .001; M medum = 2.69, Mear = 
1.47; time elapsed since contact: F(1, 71) = 4.23, p « .05; 
Miong = 1.74, Mago, = 2.36) and a significant interaction 
(F(1, 71) = 5.82, p < .05; Mmedum and short = 3-40, 
M medum and long = 1-97, Miar and short = 1.41, Moar and long = 
1.53). To establish whether disgust mediated the impact of 
contamination cues on product evaluations, we conducted a 
mediation analysis. We estimated a path analysis that 
included direct paths for proximity to contact, time elapsed 
since contact, and their interaction on product evaluations 
and an indirect path that was mediated by disgust. The 
results produced a significant indirect path, indicating that 
disgust fully mediated the effects of the independent varia- 
bles on product evaluations (see Figure 3, Panel A). We also 
conducted a path analysis for purchase intentions. Similar 
to product evaluations, the results revealed a significant 
indirect path, suggesting that disgust also fully mediated the 
impact of the contamination cues on intentions (Figure 3, 
Panel B). Finally, we conducted a path analysis to establish 
whether product evaluations mediated the impact of con- 
tamination cues on purchase intentions; as we indicate in 
Figure 3, Panel C, full mediation was present. 

Control group. We compared the control group with the 
time-elapsed-since-contact conditions in a series of one- 
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way ANOVAs, which produced significant results (product 
evaluations: F(2, 92) = 11.87, p < .001; Moontro) = 5.33; pur- 
chase intentions: F(2, 92) = 12.55, p < .001; Mosa) = 4.80; 
disgust: F(2, 92) = 4.51, p < .05; Montroi = 1.25). Post hoc 


FIGURE 3 
Study 2: Path Analysis 
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tests indicated that though the control group differed signif- 
icantly from the short condition (product evaluations: 
t(92) = 4.63, p < .001; purchase intentions: t(92) = 4.75, p < 
.001; disgust: t(92) = 2.90, p < .01), it did not differ signifi- 
cantly from the long condition (product evaluations: t(92) = 
1.86, p > .05; purchase intentions: t(92) = 1.89, p > .05; dis- 
gust: t(92) — 1.29, p » .10). This provides further support 
for the findings that over time, the impact of contamination 
dissipates. 


Discussion of Study 2 


The findings of Study 2 provide additional support for the 
existence of consumer contamination in a retail setting. 
Consistent with Study 1, we found that as the salience of 
contamination increased with proximity to contact, con- 
sumer evaluations of and purchase intentions for a contami- 
nated product decreased. However, the introduction of a 
second contamination cue, time elapsed since contact, mod- 
erated this finding. In particular, the results for proximity to 
contact only held when participants believed that other con- 
sumers had recently contaminated the product. We address 
the concerns identified in Study 1 by separately manipulat- 
ing time elapsed since contact and defining its independent 
influence on consumer contamination. Following construal- 
level theory (Liberman and Trope 1998; Trope and Liber- 
man 2003), when the time elapsed since contact increased, 
the impact of proximity to contact was mitigated. We add to 
the law of contagion by identifying a contamination context 
in which the effects of contamination appear to dissipate 
over time. Indeed, contamination effects do not appear to be 
permanent in an interpersonal, retail context. 


Study 3 


Overview 

Studies 1 and 2 demonstrate how two contamination cues, 
proximity to contact and time elapsed since contact, influ- 
ence consumers’ evaluations of products that have been in 
contact with other consumers. The primary goal of Study 3 
is to investigate the influence of a third contamination cue: 
the number of contact sources. Study 3 also extends the 
impact of consumer contamination to another dependent 
measure: willingness to pay (WTP) for the product. We ran- 
domly assigned 104 undergraduate participants to a condi- 
tion in a 2 (proximity to contact: medium versus far) x 2 
(number of contact sources: one versus many) + 1 (control: 
no touch) between-subjects experimental design. Partici- 
pants received $10 as compensation for their participation. 


Method 


Procedure. We used the same procedure as that in Study 
2, but with two exceptions: First, to manipulate the number 
of contact sources, in the one-contact-source condition, the 
sales associate informed participants that one other person 
had tried on the shirt, whereas in the many-contact-sources 
condition, the sales associate indicated that a lot of other 
people had tried on the shirt. We assessed the effectiveness 
of this manipulation by asking participants to indicate on an 
eight-point scale approximately how many other people had 
tried on/used the exact same product as them (1 = “no one,” 


8 = "more than seven"). Second, we introduced a new 
dependent measure in the study; we asked participants what 
they would be willing to pay for the product (i.e., dollar 
amount). Other measures followed those used in Study 1 
(evaluation index & = .97, disgust index o = .86, and loca- 
tion recall measure). 


Results 


Preliminary analyses. Of the 104 participants who took 
part in the study, 3 were unable to complete the assigned 
task successfully. Thus, the sample used in the analyses 
reported here consisted of 101 participants (31 males and 
70 females); cell sizes ranged from 18 to 22. 

Analysis of the location recall question showed that all 
participants were cognizant of the location of the product. A 
2 x 2 ANOVA with the rating of how many people had tried 
on the product as the dependent variable produced a signifi- 
cant main effect for number of contact sources (F(1, 77) = 
11.51, p < .001; Mone = 3.60, Mmany = 5.18).? An examina- 
tion of the suspicion probe indicated that no participants 
were aware of the experimental hypotheses. 


Test of hypotheses. H4 put forth competing predictions 
about the influence of the number of contact sources in pro- 
ducing consumer contamination. H4, predicted that con- 
sumer product evaluations and purchase intentions would 
be lower if a product was thought to have had contact with 
many (versus one) consumers, whereas Ha, predicted that 
product evaluations and purchase intentions would be 
higher in this instance. À 2 (proximity to contact: medium 
versus far) x 2 (number of contact sources: one versus 
many) ANOVA with the evaluation index as the dependent 
variable produced a main effect for number of contact 
sources (F(1, 77) = 4.45, p « .05) and a main effect for 
proximity to contact (F(1, 77) = 5.08, p « .05). The interac- 
tion effect between the two independent variables was not 
significant (F < 1). Consistent with H3,, participants evalu- 
ated the product more positively when they believed that 
only one other person had touched the product before them 
(Mone = 3.86) than when they believed that several other 
people had touched the product before them (Mg, = 
3.19). Furthermore, consistent with H}, participants evalu- 
ated the product more positively when the product was 
located on a regular rack in the store (Mg, — 3.89) than 
when it was on the return rack (Mmedium = 3.19). 

We conducted a similar analysis using WTP as the 
dependent variable. The results for WTP also produced two 
main effects (proximity to contact: F(1, 77) = 4.38, p « .05; 
number of contact sources: F(1, 77) — 4.73, p « .05). Par- 
ticipants were willing to pay more to purchase the product 
when it was located on the regular rack in the store (Mg, = 
$20.00) than when it was on the return rack (Migequm = 
$11.85), and they were willing to pay more when only one 


3We conducted a one-way ANOVA with the control group and 
the number of contact source conditions for the number index 
(F(2, 100) = 15.38, p < .001; Moontrol = 2.15; control versus one: 
t(98) = 2.64, p = .01; control versus many: t(98) = 5 39, p « .001). 
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person had touched it (Mone = $16.18) than when multiple 
people had touched it (My, = $11.72). Again, there was 
no significant interaction effect (p > .10). 

A two-way ANOVA with the disgust index as the 
dependent variable produced only two main effects (prox- 
imity to contact: F(1, 77) = 4.24, p < .05; Mmedium = 3.13, 
Mg; = 2.55; number of contact sources: F(1, 77) = 5.66, p < 
05; Mone = 2.52, Mmany = 3.19). To determine whether feel- 
ings of disgust led to changes in product evaluations, we 
conducted a mediation analysis. To test for mediation, we 
estimated a path analysis that included direct paths for the 
effects of proximity to contact and number of contact 
sources on product evaluations and indirect paths mediated 
by disgust. The results produced a significant indirect path, 
indicating that disgust fully mediated the effects of the inde- 
pendent variables on product evaluations (see Figure 4, 
Panel A). We also conducted a similar path analysis for 
WTP, and the results indicated that disgust fully mediated 
the impact of contagion cues on WTP (see Figure 4, Panel 
B). Finally, to test the causal relationship between evalua- 
tions and WTP, we conducted another path analysis. The 


FIGURE 4 
Study 3: Path Analysis 
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results indicated that evaluations mediated the impact of 
contamination cues on WTP for the contaminated product 
(see Figure 4, Panel C). 


Control group. We compared the control group with the 
number-of-contact-sources conditions in a series of one- 
way ANOVAs. The results were significant for all the key 
dependent variables (product evaluations: F(2, 98) = 15.19, 
p < 001; Moontrol = 5.39; WTP: FQ, 98) = 14.90, p < .001; 
Meontro! = $25.45; disgust: F(2, 98) = 16.00, p < .001; 
Meontro) = 1.31). Post hoc tests indicated that the control 
group differed significantly from the one-source and many- 
sources conditions for all the variables (product evaluations: 
control versus one [t(98) = 3.92, p < .001] and control ver- 
sus many [t(98) = 5.50, p < .001]; WTP: control versus one 
[t(98) = 3.76, p < .001] and control versus many [t(98) = 
5.46, p < .001]; disgust: control versus one [t(98) = 3.72, 
p < .001] and control versus many [t(98) = 5.66, p < .001)). 


Discussion of Study 3 


The findings of Study 3 provide additional evidence for our 
general proposition that perceived contact between a con- 
sumer and a product results in other consumers having less 
favorable evaluations of the touched product. Our results 
show support for H4, and SIT, indicating that when con- 
sumers believe that many people have touched a product, 
their evaluations and WTP for the product are significantly 
lower. Rather than signaling the popularity of a product as 
Hay predicted, multiple contact sources seemed to increase 
the level of contamination realized. Our findings are telling, 
given that the experimental procedure may have also sig- 
naled the popularity of the product (i.e., the item was “the 
only one left in stock"). The results for Study 3 also show 
consistent effects for proximity to contact such that partici- 
pants responded more negatively to a product that was situ- 
ated closer to a possible contact location. Finally, as in 
Studies 1 and 2, the results of Study 3 also show that the 
feelings of disgust that consumers experience in response to 
product contamination lead to the changes in product 
evaluations and WTP. 


Conclusions and Implications 


Recent marketing research has found that consumers like to 
use the sense of touch to learn about products (Mooy and 
Robben 2002; Peck and Childers 2003). However, the cur- 
rent research demonstrates that though consumers enjoy 
touching products themselves, they respond negatively 
when other consumers have had contact with the products. 
Using a retail shopping environment, the current research 
provides the first evidence for a theory of consumer con- 
tamination, in which consumers are believed to contaminate 
products simply by having contact with them. In general, 
we find that contamination cues that increase the salience of 
contact between another consumer and a product have a 
negative impact on consumer responses to the product. 


Theoretical Implications 


Contamination cues. Across three field experiments, we 
demonstrate that when proximity to contact is closer and 
when the number of contact sources is higher, consumer 


evaluations, purchase intentions, and WTP for a touched 
product decrease. Notably, the results for the impact of 
proximity to contact hold only when a short period of time 
has elapsed since the product was perceived as being 
touched. When time since contact is longer, the effects of 
consumer contamination appear to dissipate. 

We argue that these contamination cues increase the 
salience of contamination by effectively signaling to the 
consumer that the product has previously come in contact 
with another person or people. It is important to recognize 
that in our research, this contact was implied and not physi- 
cal in nature. As is shown by the null findings from our 
dirty index, although consumers were cognizant that no 
physical contamination was engendered to the product by 
the implied touching, they were still influenced by the con- 
tamination from the other consumer. Indeed, the lingering 
perceived essence of another seems to be enough to stigma- 
tize the product for the consumer. The contamination cues 
we identify appear to activate a contamination heuristic, 
which, though arguably misapplied, has important repercus- 
sions for consumer evaluations and intentions. 

Time and contamination. The identified temporary 
nature of consumer contamination emphasizes a key theo- 
retical finding that builds on previous work in this area 
(Hejmadi, Rozin, and Siegal 2004; Rozin et al. 1989). Prior 
research has examined cases of contamination in which 
people had extremely strong attitudes toward the source of 
contamination. These sources have been life threatening 
(e.g., poison) or have been people about whom others have 
strong, preexisting negative attitudes and associations (e.g., 
Adolf Hitler). After the source has come into contact with 
the object, these strong negative attitudes toward the source 
seem to transfer permanently and are then associated with 
the object (Hejmadi, Rozin, and Siegal 2004; Rozin et al. 
1989). In the current research, however, consumers have no 
preexisting attitudes toward the contamination source, 
because the source is either unfamiliar or completely 
unknown to consumers. Furthermore, consumers recognize 
that the contamination is not life threatening in nature. 
Thus, there is no permanent association to be transferred on 
contact, and any contamination effects that occur are indica- 
tive of a perceived transfer of essence. Our findings qualify 
previous thought by empirically showing that for the type of 
contamination investigated in the current research, an 
increase in temporal distance mitigates the strength of the 
contamination effect. 


Feelings of disgust. Consistent with previous research 
(Rozin, Haidt, and McCauley 1993; Rozin, Markwith, and 
Nemeroff 1992; Rozin, Millman, and Nemeroff 1986), we 
found that disgust operates readily in the domain of con- 
tamination in the consumption context. In particular, the 
results indicate that consumers lower their evaluations of 
touched products because they feel disgusted. by the con- 
tamination from other shoppers. The analyses demonstrat- 
ing full mediation of disgust on evaluations, purchase inten- 
tions, and WIP support this assertion. Importantly, we 
show discriminate validity for disgust by finding null 
effects for other negative emotions. The visceral reaction 
that disgust represents in this context points to the basic 
nature and strength that contamination effects exemplify. 


Managerial Implications 


Store design and merchandising. The consumer con- 
tamination effects demonstrated in each of our three studies 
underscore the importance of management decisions related 
to store layout and design. Although it is critical to enable 
consumers to touch merchandise (Mooy and Robben 2002; 
Peck and Childers 2003), consideration should also be 
given to limiting how and where consumers touch products. 
Effective store design can provide opportunities and con- 
straints with respect to consumer contamination. For exam- 
ple, instead of using product bins or table displays that 
encourage rummaging and searching, firms can effectively 
design shelving and rack displays to minimize consumer 
handling of products. Furthermore, a front-room/back-room 
layout that provides “fresh” products for trial can poten- 
tially mitigate consumers’ contamination perceptions. 
Retail environments in which consumers try on merchan- 
dise (e.g., clothing, footwear) need to be sensitive to con- 
sumers viewing others handling the products. As The Wash- 
ington Post (Pressler 2003) noted recently, effective design 
and management of dressing room areas is an understated 
factor in retailer success. Notably, the article indicates that 
the primary complaint of clothing shoppers 1s that other 
consumers tried on clothing that cluttered the dressing 
room. 

Appropriate merchandising and well-managed product 
displays are also central in effectively addressing contami- 
nation concerns. Cues that a product has previously been 
used or touched, such as ripped tags or disheveled displays, 
raise contamination fears and negatively affect consumer 
evaluations. Managers must make an effort to minimize the 
salience of contamination so that consumers are not moti- 
vated to think that a given product has been contaminated. 
Effective merchandising of products can be useful in allay- 
ing and preventing contamination effects from materializ- 
ing. Indeed, Underhill (2000, p. 179) notes that retailers 
“place the dark shades down where they can be reached eas- 
ily, and display the beiges and pale grays and off-whites up 
high, where they can be seen but not touched. If a sweater 
displayed on a table comes in several shades, you'll always 
find the lighter ones on the bottom and the dark on top, 
where they'll be rubbed and grubbed, but who will know?" 

Service personnel and communication. Our results also 
point to a strong need for proper employee training and ser- 
vice practices. Employees need to be cognizant of the issues 
surrounding consumer contamination and have the ability to 
address related consumer concerns. MAC cosmetics is an 
example of a company that has responded to this opportu- 
nity, turning contamination concerns into a competitive 
advantage in the industry. MAC employees are trained to 
wipe sample lipstick both before and after consumer trial 
and use disposable applicators (e.g., mascara, blush) for 
consumer promotion activities. Employees must be mindful 
to remove the evidence of consumer contamination as 
quickly as possible in a retail environment. Refolding, 
restocking, repackaging, and returning products quickly to 
shelf locations are examples of simple activities that can 
alleviate consumers'contamination concerns. Companies, 
such as Nordstrom and Banana Republic, that encourage 
their personnel to be vigilant in this regard communicate a 
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safe, comfortable, and individualized shopping environment 
in which sales staff are sensitive to specific consumer needs 
and concerns about contamination. 

In a broader sense, a company’s communication with 
consumers should also be sensitive to contamination. 
Advertising, promotions, and other communication efforts 
should be careful not to raise contamination concerns in 
their creatives. Communication that highlights the inci- 
dence of consumer contact and identifies who is touching 
products may result in contamination signals that alienate 
potential customers. 


Policy and procedures. Acceptance of the effects and 
implications of consumer contamination gives way to the 
effective integration of this issue into store systems and 
policies. Policies that are created specifically to respond to 
contamination outcomes provide direction and a means to 
manage the negative implications of this phenomenon. As 
we alluded to previously, effective policies and procedures 
that empower employees to recognize and address con- 
sumer contamination are an effective approach. The devel- 


opment of policies for more physical forms of contamina- - 


tion is also imperative for organizations. For example, Best 
Buy has adopted a classification of “Open Box Products,” 
which describes products that have been handled, opened, 
or have had the packaging disrupted. In this case, the prod- 
uct and warranty are intact, but the policy calls for an 
“Open Box” labeling of the product and a price discount. 
Return policies, warranties, and consumer trial all should be 
examined with an eye toward contamination effects. Recent 
trends in the retail environment of bulimic shopping and 
retail borrowing (Piron and Young 2000) make this a grow- 
ing concern. 


Limitations and Future Research Opportunities 


The limitations of our research are inherent to the experi- 
mental methodology and sample we used in conducting our 
studies. Each of our three studies used the same product 
(i.e., clothing) as the experimental stimuli, followed a simi- 
lar procedure, and was consistent in the retail environment. 
However, as we mentioned previously, because contamina- 
tion effects are expected to differ depending on the salience 
and intimacy of contact (Angyal 1941), the results reported 
herein may be limited only to product categories for which 
consumers have close, intimate contact. Indeed, it seems 
likely that contamination effects are moderated by the level 
of contact or intimacy that consumers have with a product 
while using it: The more contact a product has with a con- 
sumer’s body, the greater are the contamination effects. 
Prior anthropological work supports this notion, demon- 
strating stronger contamination effects for ingestion, the 
most intimate form of contact, than for mere interaction 
with an object (Appadurai 1981). This suggests that con- 
tamination effects in retail contexts could be most prevalent 
for food items, creating a true challenge for grocery retail- 
ers. Further research should test contamination effects 
across different product categories to examine these issues 
more closely. 

Another possible moderating factor of contamination 
effects is product packaging. In the studies we presented 
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herein, products were always free from packaging and com- 
pletely vulnerable to contact with consumers. Thus, it 
would be interesting to investigate whether product packag- 
ing can protect products from consumer contamination and 
whether the level of protection varies by the packaging 
material. Recent work that has examined contamination 
effects among products (Morales and Fitzsimons 2005) has 
found that clear packages facilitate contamination effects 
and opaque packages prevent them. Specifically, this 
research shows that when a consumable product is dis- 
played inside a clear, plastic package and is touching a dis- 
gusting product, the consumable product becomes contami- 
nated, but when the consumable product is displayed inside 
a solid, paper package, evaluations of the product remain 
constant. Whether these findings for packaging hold true for 
consumer contamination, however, remains unclear and is 
left for further research. 

Another area for extension is to investigate the potential 
for positive contamination effects in a consumer context. As 
we mentioned previously, research has found that when a 
contamination source has a positive relationship with the 
recipient (i.e., friend, lover), responses toward contamina- 
tion are more positive in nature than when a relationship 
does not exist. This result is consistent with young sports 
fans keeping a home-run ball that is hit by an admired base- 
ball player or David Beckham’s missed soccer ball selling 
for $16 million on eBay. However, unlike the cases we 
examined in the current work, these examples of positive 
contamination are primarily the result of a symbolic inter- 
action model based on associations rather than a residue 
model resulting from physical contact alone (Nemeroff and 
Rozin 1994). As such, for positive contamination effects to 
occur in consumer settings, the identity of the contamina- 
tion source needs to be known and have a strong, positive 
association with the consumer. In contrast to the negative 
contamination effects demonstrated here, simply knowing 
that a product has been touched by an anonymous source is 
not enough for positive contamination to arise. For positive 
effects, the contamination source should be as important as 
the contact. 

In addition to influencing whether positive or negative 
contamination effects will occur, the source of contamina- 
tion might also serve as a moderating factor in general. For 
example, salespeople are constantly arranging displays and 
moving around products. Do consumers view this touching 
as contamination? How do consumers integrate their rules 
for contamination across different “actors” (i.e., salespeople 
versus other shoppers) in the consumption context? Is 
touching by some consumers actually considered positive, 
whereas touching by other consumers considered extremely 
negative? Further research could attempt to understand how 
touching by anonymous versus different known sources fits 
into the developed theory of consumer contamination. 

Similarly, additional work is also needed to examine 
how the type of retail outlet influences consumer contami- 
nation effects. Consumers might have different ideas for the 
amount of contamination they would expect to find in an 
upscale store such as Nordstrom versus a discount center 
such as Wal-Mart. It may be that some retail outlets gener- 
ate strong contamination effects because physical contact 


among consumers and products is expected to be controlled 
and limited, whereas others do not because previous con- 
sumer contact with products is expected, as in a consign- 
ment store. Such speculation suggests that consumers may 
create rules (and exceptions) or frame information differ- 
ently (Rozin and Nemeroff 1990) depending on the con- 
tamination context. These and other contextual differences 
could be investigated in further research on contamination. 
Finally, in the current research, we consider the impact 
of three types of contamination cues: proximity to contact, 


time elapsed since contact, and number of contact sources. 
In the retail environment, there are likely several other types 
of contamination cues that may be prevalent. For example, 
physical clues (e.g., cut or handwritten price tags or marked 
products) related to contact may all have a role in the real- 
ization of contamination effects. Future investigation should 
broaden our research framework to provide additional 
insight into the antecedents and outcomes of consumer 
contamination. 
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Priya Raghubir & Eric A. Greenleaf 


Ratios in Proportion: What Should 
the Shape of the Package Be? 


Consumers’ reactions to rectangles have implications for package and product design. In two lab studies and an 
analysis of fleld data, the authors find that the ratio of the sides of a rectangular product or package can influence 
purchase intentions and preferences and is related to marketplace demand. In more exploratory inquiries, the 
authors also find that the impact of this ratio on consumers depends on the relative seriousness of the context in 
which the product is used. Furthermore, ratio can also affect product perceptions, and consumers appear to prefer 
a range of contiguous ratios for different contexts rather than a particular ratio. 


ne of the oldest controversies in aesthetics, extend- 
ing back to the ancient Greeks, is whether people 
prefer certain ratios for the sides of rectangles. 
Investigations into this question began in the 1870s with 
Fechner’s work and continue to the present day. The results 
of this research have been mixed. The research has not pro- 
duced definitive conclusions about whether people consis- 
tently prefer particular rectangular ratios (Benjafield 1985; 
Green 1995). 

Rectangular preferences have important implications for 
marketers because so many products and packages are rec- 
tangles. Recent research in the trade press suggests that a 
package’s shape is a critical way for a brand to differentiate 
itself because package design can affect consumers’ pur- 
chase decisions (Sherwood 1999). This has led many firms 
to focus on product and package shape, and small changes 
in package shapes can have a large influence on sales and 
profits (Prince 1994). Many of the “new age" beverages of 
the 1990s (e.g., Snapple, Arizona Iced Tea, Perrier) paid 
close attention to bottle design and are credited with help- 
ing spark a packaging renaissance (Miller 1994). With so 
much attention being given to the shape of products and 
packages, brand managers and designers who are responsi- 
ble for introducing new products or for creating new shapes 
and packages for existing products need guidance on how 
rectangular proportions can affect consumer purchase inten- 
tions, preferences, and demand. 

Marketers have proposed conceptual models of factors 
that may affect consumer reaction to product form (Bloch 
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1995). Bloch’s (1995) model proposes antecedents, media- 
tors, and moderators of the impact of aspects of package 
shape on consumer preferences, intentions, and choice. 
Although Bloch’s model is rich in insights into possible 
influences on consumers’ reactions to product form and 
how consumers process visual information, it is primarily a 
conceptual model. Research has also begun to examine how 
package shape affects the managerially important variables 
of volume perceptions and consumption (e.g., Wansink and 
Van Ittersum 2003). 

The current article examines one particular aspect of 
product design: We take the concept that people have differ- 
ent preferences for different rectangular ratios, an area that 
has received considerable attention in psychology and aes- 
thetics, and apply it empirically to the consumer realm to 
answer questions of interest to marketing managers and 
product designers who deal with rectangular products and 
packages. Our principal objective is to investigate whether 
the ratio of the sides of a rectangular product or package— 
in other words, rectangles of interest to marketers—can 
affect consumer purchase intentions and preferences and to 
examine how ratio is related to marketplace demand. An 
additional objective is to examine, on an exploratory basis, 
additional issues in this domain of interest to marketing 
managers, such as whether ratios affect product percep- 
tions, whether the impact of ratio on purchase intentions 
and preferences and the ratio’s relationship to market share 
depend on the context in which the rectangular product is 
used, and whether consumers tend to prefer a contiguous 
range of rectangular ratios for products and packages rather 
than one particular ratio. 


Prior Research on Preferences for 
Rectangles 


Aestheticians have long been interested in whether people 
find rectangles with certain ratios to be more pleasing than 
other ratios and how a rectangle’s aesthetic qualities might 
be related to the mathematical properties of its ratio (Alberti 
[1485] 1988; Huntley 1970; Lawlor 1982). This line of 
inquiry has also examined how often particular ratios 
appear in nature, such as in the structure of plants and ani- 
mals (Ghyka 1977; Lawlor 1982). 
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Much of this work has focused on the “golden ratio,” or 
the ® (phi) rectangle, which has a ratio of approximately 
1.618. The number ® has numerous special mathematical 
properties that have been studied by mathematicians such as 
Pythagoras, Euclid, and Fibonacci (Herz-Fischler 1987; 
Livio 2002). Some of these properties translate into visual 
properties (see Ghyka 1977; Huntley 1970; Livio 2002). 
The 6 is the only rectangle for which the ratio of the longer 
side to the sum of the sides equals the ratio of the short side 
to the longer side. If a square is removed from a ® rectan- 
gle, the remaining rectangle is still a ® rectangle. Repeating 
this process several times yields an attractive spiral such as 
that found in nautilus shells. 

Advocates of the golden ratio's aesthetic appeal point to 
its alleged use in major works of art and architecture, such 
as the building blocks of the Great Pyramids, the 
Parthenon's facade, and paintings by Georges Seurat (see 
Ghyka 1977; Huntley 1970), whereas others are cautious 
about such claims (Livio 2002). In Bloch's (1995) theoreti- 
cal review of the factors that affect “ideal product form,” he 
posits that the golden ratio may appeal to people because of 
overall "gestalt" qualities that supersede any one particular 
characteristic. Although Bloch acknowledges the differences 
just discussed, he cites the gestalt qualities in “enduringly 
popular architectural forms" (p. 22) and notes that “it could 
prove useful for managers involved with design decisions to 
be acquainted with Gestalt principles ... [and that] firms 
may find it useful to include these considerations" (p. 22). 

scientific investigations of whether people actually pre- 
fer certain rectangles began with Fechner's (1871, 1876; for 
a recent translation, see Fechner 1997) pioneering work, 
which found a preference for the ® rectangle. Other early 
researchers replicated Fechner's results (Lalo 1908; 
Thorndike 1917; for a review, see Green 1995). However, 
research supporting a preference for the ® rectangle, 
including Fechner’s, has been critiqued on methodological 
grounds (for reviews, see Green 1995; Plug 1980). Prefer- 
ences can be influenced by participant instructions 
(Hekkert, Lieke, and Van Wieringen 1994), preference mea- 
sures (McManus 1980; Ohta 1999), elicitation (Hige 1995, 
1997), the range of ratios used and the location of a ratio in 
that range (Benjafield and McFarlane 1997; Godkewitsch 
1974; Piehl 1976), whether rectangles have equal area 
(Benjafield 1976), use of average versus individual prefer- 
ences (McManus and Weatherby 1997; Plug 1980), and 
whether the rectangle is part of a more complex pattern 
(Boselie 1984). 

Our studies control for many of these methodological 
concerns. Furthermore, rather than clearly linking the stud- 
ies’ purposes to rectangular ratios, we use an indirect 
approach that is less likely to create demand artifacts 
(Green 1995; Hóge 1995, 1997; Plug 1980). Study 1 uses a 
between-subjects rather than a within-subjects design; to 
disguise the study's purpose, Study 2 participants make 
additional purchase intention ratings and rankings for 
design alternatives that do not involve rectangles; and Study 
3 uses actual marketplace sales. 
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Rectangles as Objects Used In a Particular 
Consumer Context 


Most studies of rectangular preferences have examined rec- 
tangles as abstract shapes, devoid of any particular meaning 
or use. This limits their application to real-world stimuli 
(Koneni 1997). A major limitation for marketing is that 
consumers are not concerned with buying abstract rectan- 
gles but rather with products and packages that have a rec- 
tangular shape. Marketing managers and product designers 
need to understand how consumers are influenced by the 
proportions of these rectangular products and packages and 
use this knowledge to guide their own decisions about pack- 
aging and product dimensions. Bloch’s (1995) conceptual 
model recognizes the importance of this issue and 
approaches the impact of product form in a consumer set- 
ting rather than treating form abstractly. 

Thus, there is a need for empirical research that exam- 
ines consumer reactions to rectangles as they are applied to 
products and packages, not just rectangles in the abstract 
sense. Because marketers’ concerns extend beyond prefer- 
ences to include consumers’ purchase intentions and 
demand, constructs that have little meaning when applied to 
abstract rectangles, we examine these latter issues. Thus, we 
investigate consumer reactions to rectangular products and 
packages, not abstract rectangles without context. Previous 
work in marketing has examined the distribution of the 
ratios of rectangular print advertisements and how the 
placement of headlines, pictures, text, and brand logo 
within an advertisement affects preferences for the adver- 
tisement (DeHeer and Nobbe 2000). However, we do not 
know of previous work that has examined how the rectan- 
gular ratios themselves affect purchase intentions, product 
preferences, and market share for products. 

The remainder of this article proceeds as follows: Study 
1 tests the hypothesis that in a consumer setting, the ratio of 
a rectangle can affect consumers’ purchase intentions and 
preferences for a particular product. On an exploratory 
basis, this study also tests whether ratio also affects certain 
consumer product perceptions. Study 2 replicates the test of 
this hypothesis for a different product category. On an 
exploratory basis, it also examines whether these purchase 
intentions depend on whether the context in which a prod- 
uct 1s used is relatively serious or relatively frivolous and 
whether higher purchase intent exists for a contiguous range 
of ratios rather than a particular ratio. Study 3 tests whether 
consumer demand for frequently purchased goods sold in 
rectangular boxes is related to rectangular ratio and whether 
the context in which a product is used moderates this 
relationship. 


study 1: Impact of Rectangular 
Ratio on Purchase Intentions, 
Preferences, and Perceptions 


The Impact of Rectangular Ratio on Purchase 
Intentions and Preferences 


Although the evidence that people prefer particular rectan- 
gles has been mixed, consistent throughout this research is 


the notion that preferences indeed differ across ratios; that 
is, participants in these studies do not favor all rectangles 
equally. Applied to the realm of consumer products, this 
suggests that rectangular ratio can affect consumer prefer- 
ences and purchase intentions because both constructs are 
related to demand. 


Hı: The ratio of a rectangle’s sides affects consumers’ (a) pur- 
chase intentions and (b) preferences for rectangular 
products. 


Design and Procedure 


Our sample consists of 108 undergraduate students (56 
females and 52 males) enrolled in an introductory market- 
ing course; they completed the experiment for partial course 
credit. Participants were told that the experiment was a 
study of musical tastes among students at their university. 
They were asked to assume that they had been mailed invi- 
tations with descriptions of two concerts (Concert A and 
Concert B). The words in the description of Concert A, 
titled “Music of Indonesia: Songs Before Dawn,” were 
printed in a square box that was six inches on each side (a 
1:1 ratio). We used this concert as a control to create a ref- 
erence point that people would compare with their judg- 
ments of a target concert (Concert B) and to control for 
individual differences in scale response. The target descrip- 
tion of Concert B, titled “Dream Songs: Sounds of the 
Malaysian Rainforest,” was printed in a horizontal (width > 
height, as opposed to a vertical rectangle [width < height]) 
rectangular box with the ratio of sides either 1.38 or 1.62 
(the golden ratio). Both boxes had the same area and con- 
tained the same description of Concert B (see Figure 1). We 
decided to include the golden ratio because this rectangle 
has been the subject of so much inquiry. 


Purchase intentions for CDs. To measure purchase 
intentions for a product closely related to the concert, we 
asked participants to rate their likelihood of buying a CD of 
the music for Concerts A and B on a seven-point scale (1 = 
"definitely not,” 7 = “definitely”’). 


Relative preference for concerts. To measure relative 
preference for attending Concert A versus Concert B, we 
asked participants to indicate which concert they would pre- 
fer to attend if the two concerts were held at the same time, 
on a six-point scale with intervals labeled “strongly,” “mod- 
erately," and “slightly” prefer Concert A or B. Higher num- 
bers reflect a preference for Concert B. 


Covariates and controls. Because purchase intention for 
the CD of Concert B is one of the dependent variables of 
interest and because ownership of CDs is also a useful indi- 
cator of experience with music, participants answered ques- 
tions about the number of CDs they owned for use as a 
potential covariate; the response categories were as follows: 
10 or fewer = 1, 11-50 = 2, 51-100 = 3, 101—200 = 4, and 
more than 200 = 5. To measure tendency to attend concerts, 
which might also be a useful covariate in concert prefer- 
ences, participants reported how many live music concerts 
they attended in an average year. Participants also reported 
their gender (female = 0, male = 1). To assess whether ratio 
affected a cognitive phenomenon—namely, depth of pro- 


cessing—after a filler task, we administered a surprise 
recall task that was followed by a five-item multiple- 
response “quiz” on the content of the information provided 
for Concert B. 


Results 


Purchase intentions. To examine whether the ratio of 
the target concert description affected purchase intentions 
for the CD of Concert B, we used an analysis of covariance 
(ANCOVA) with purchase intentions as the dependent 
variable. The independent variables were the two different 
ratios of the Concert B invitation and gender, both coded as 
a categorical variable, and the scale for the number of CDs 
owned. The adjusted mean purchase intention for the CD of 
Concert B when the description was in the 1.62 ratio (M = 
3.05) was greater than that for the 1.38 ratio (M = 2.22; 
F(1, 103) = 6.22, p < .05). Gender and CD ownership were 
not significant (Fs < 1.0). Thus, the results support the 
impact of ratio on purchase intentions (H,,). In this exam- 
ple, the 1.62 ratio created higher purchase intentions for the 
CD of Concert B than did the 1.38 ratio. 

We also examined whether the impact of ratio on pur- 
chase intentions depended on an interaction between ratio 
and gender by adding this interaction term to the ANCOVA. 
This interaction term was not significant (F < .01). 


Impact of rectangular ratio on consumer perceptions. 
Research on rectangular preferences in aesthetics and psy- 
chology has not usually examined attribute perceptions, 
because this work has tended to focus on abstract rectan- . 
gles, not on consumer products that can be broken down 
into attributes. Because product perceptions matter to mar- 
keters, we believe that it is worth investigating, on an 
exploratory basis, whether the proportions of a rectangle in 
the context of a product or package affect consumers' per- 
ceptions of that product. 

Although the aesthetic literature on rectangular prefer- 
ences has not formally tested hypotheses that connect par- 
ticular perceptions, this literature often discusses the impor- 
tance of perception in the creation of these preferences. 
Commentators mention that preferred rectangles promote 
perceptions of harmony, proportion, balance, rationality, 
and mathematical beauty (Ghyka 1977; Huntley 1970; 
Lawlor 1982; Livio 2002; Svensson 1977 [who uses a line 
partitioning task]). These perceptions are also relevant to 
many, though by no means all, consumer products. To 
investigate the potential impact of ratio on these percep- 
tions, we also included in Study 1 perceptual measures of 
the concerts that we believed were related to these aesthetic 
perceptions in the literature. We measured relative percep- 
tions of Concert B and Concert A on scales that included 
the items “harmonious,” “stable,” "beautiful," “exciting,” 
and "fast" (1 = "B much less than A,” 4 = "B about the 
same as A,” and 7 = "B much more than A"). We intended 
the last two to be reversed. We call this construct "har- 
mony” to indicate its source in the literature on rectangular 
preferences, which often mentions harmony. However, we 
do not claim that these perceptions encompass all percep- 
tions that may be relevant to rectangular products or pack- 
ages or that harmony is always the dominant perception. 
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FIGURE 1 
Stlmull for Study 1 


Control Concert A: Ratio = 1:1 


Music of Indonesia: Songs Before Dawn 
“Compelling ... Fascinating musical smorgasbord from 
this cultural crossroads.” 





In the Banyuwangi region, located in the eastern end of 
Java, a vibrant and earthy musical genre called 
gandrung is performed. This tradition begins sometime 
around 9 p.m. and ends just before dawn. An 
unmarried female singer performs a beautiful suite of 
songs backed by a small ensemble of musicians who 


play violins and drums. The male guests at this dance 
pay money for the privilege of dancing with this 
mesmerizing songstress. Reticent yet dynamic, this 
call-and-response singing is accompanied by 
compelling rhythms played on percussion instruments 
that express and control the performance of the singers 
and dancers. This performance is by one of the music's 
finest living singers, Gadrung Temu. A recent review in 
the SF Chronicle called this “An enjoyable example of 
thoroughly contemporary and unusual exotica.” 





Target Concert B: Ratlo = 1:1.38 


Dream Songs: Sounds of the Malaysian Rainforest 
“Rhythms that lure you irresistibly away to the land of dreams.” 














The Temiar people of the central Malaysian rainforest are deeply 
spiritual. Inspired by their vibrant natural environment, they have 
developed a musical landscape that evolves from their 
dreamsongs. Dreamsongs form the basis for community-wide 
singing and trance-dancing ceremonies, which celebrate 
marriages in Temiar society Sophisticated and delicate, this 
music contains cherished and exquisite compositions performed 
with an enchanting mix of xylophones, gongs, cymbals, fiddles, 
guitars, and breathtaking vocals. Instrumental and vocal music of 
the Temiar draws listeners into a realm of ornate tonal variations 
and textured rhythms. The performance features Kam 
Maitrasangi, a master of the Temiar dreamsong technique. A 
recent review in the SF Chronicle called this "A good 
introduction to this highly passionate and expressive artform." 
















Target Concert B: Ratio = 1:1.62 


Dream Songs: Sounds of the Malaysian Rainforest 
“Rhythms that lure you irresistibly away to the land of dreams.” 


The Temiar people of the central Malaysian rainforest are deeply 
spiritual. Inspired by their vibrant natural environment, they have 
developed a musical landscape that evolves from their dreamsongs. 
Dreamsongs form the basis for community-wide singing and 
trance-dancing ceremonies, which celebrate marriages in Temuar society. 
Sophisticated and delicate, this music contains cherished and exquisite 
compositions performed with an enchanting mix of xylophones, gongs, 
cymbals, fiddles, guitars, and breathtaking vocals. Instrumental and 
vocal music of the Temiar draws listeners into a realm of ornate tonal 
variations and textured rhythms. The performance features Kam 
Maitrasangi, a master of the Temiar dreamsong technique. A recent 
review in the SF Chronicle called this “A good introduction to this 
highly passionate and expressive artform." 
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Results for perceptual ratings. The five perceptual rat- . 


ings all loaded on a single principal component (largest 
eigenvalue = 2.58, second eigenvalue = .84); absolute load- 
ings ranged from .66 to .83. As we expected, “exciting” and 
“fast” loaded in the opposite direction of the other three 
perceptual scales. After reversing these two items, we aver- 
aged the five items to create a harmony index (à = .75); 
higher values indicate perceptions of greater harmony for 
Concert B than for A. 


Relationship between perceptions and ratio. To examine 
the impact of ratio on the perceptual measure, we per- 
formed an ANCOVA with the perceptual scale as the depen- 
dent variable. The independent variables were the two cate- 
gorical variables (ratio and gender) and CD ownership. The 
adjusted means show that participants rated Concert B as 
higher on the perception measure when the concert descrip- 
tion used the ratio 1.62 (M = 4.70) than when the concert 
description used the ratio 1.38 (M = 4.32; F(1, 100) = 3.89, 
p = .051). The coefficient for gender was not significant 
(F < 1), nor was CD ownership (F = 2.57, p = .11). Thus, 
we find initial support that rectangular ratio can influence 
product perceptions, at least for a set of perceptions that are 
related to those often mentioned in the literature on rectan- 
gular preferences. We also examined whether gender and 
ratio interacted to influence perceptions by adding this 
interaction to the ANCOVA. However, the interaction term 
was not significant (F = .01, p > .5).! 


Impact of ratio on relative preferences. We also exam- 
ined the impact of ratio on relative preferences to attend 
Concert À versus Concert B; we included the potential role 
of perceptions as a mediator in this relationship. We con- 
ducted an ANCOVA on the relative preference of the con- 
certs using the likelihood of attending the concert as the 
covariate and ratio and gender as the two categorical inde- 
pendent variables. We included attendance to control for 
individual differences in the propensity to attend live con- 
certs. The analysis also included the perception measure. As 
we discussed previously, this step determines whether per- 
ceptions are a potential mediator of tbe impact of ratio on 
relative concert preferences. 

Preference for Concert B versus Concert À was margin- 
ally greater when the description of the former was pre- 
sented in the ratio of 1.62 (adjusted M = 4.15) than when it 
was presented in the ratio of 1.38 (adjusted M = 3.68; 
F(1, 99) = 2.88, p = .093). Thus, the direction of preference 
is the same as that for purchase intentions, in support of 
Hs, though the evidence is not as strong as it is for CD pur- 
chase intentions (H,,). Gender, concert attendance, and the 
perceptual measure did not have a significant relationship to 
preference, suggesting that perceptions did not mediate the 


IWe examined whether perceptions mediated the relationship 
between ratio and purchase intentions for the Concert B CD. A 
condition for mediation is that the proposed mediating variable 
must have a significant relationship to the dependent variable 
when the initial independent variable is included 1n the analysis 
(Baron and Kenny 1986). However, including the perceptions in 
the previous ANCOVA did not yield a significant effect for per- 
ceptions (F(1, 99) = .13, p = .72). 


relationship between ratio and preferences. The interaction 
between gender and ratio was not significant (F(1, 99) = 
.23, p = .63). 


Impact of ratio on cognitions. There were no differences 
in the amount of recall of the concert information across 
conditions or in the number of questions that participants 
correctly answered in the surprise quiz task at the end of the 
procedure. Thus, we found no evidence for differential 
depth of processing in the two Concert B ratios. 


Discussion 


In summary, the results of Study 1 show that a difference in 
the ratio of the sides of a rectangular concert description 
can affect purchase intentions for the CD of music from the 
concert and, to a weaker extent, preferences. On an 
exploratory basis, we also found initial support that the ratio 
can affect product perceptions that are related to an index of 
perceptions discussed in the aesthetic literature on 
rectangles. 


Study 2: Purchase Intentions for 
Invitation Cards 

Study 2 has three purposes: First, we test H; in additional 
domains. Second, we examine Hia regarding purchase 
intentions, across a larger set of rectangular ratios than the 
two ratios used in Study 1. Third, on an exploratory basis, 
we examine whether the impact of rectangular ratios on 
purchase intentions depends on the relative seriousness ver- 
sus frivolity of the consumer context in which the product is 
used. We also examine whether consumers have higher pur- 
chase intentions when rectangles are within a contiguous 
range of ratios or when this 1s only a single ratio. 


Test of H, Across a Broader Set of Products and 
Rectangles 


Although the comparison of two ratios in Study 1 creates a 
good starting point, many studies of rectangular preferences 
have included a larger set of ratios. Because rectangular 
consumer products and packages are found in the market- 
place in a large range of ratios, we believe that it is useful to 
test H,, with more rectangles from a larger ratio range. 
Study 2 examines nine ratios, ranging from the square to the 
2:1 rectangle. Furthermore, in Study 2, consumers report 
purchase intentions for four different product variants of a 
rectangular product, along with their favorite and least 
favorite ratio. This method allows for within-person and 
across-person analysis of the impact of rectangular ratio, an 
approach recommended in prior research (Plug 1980). 


Context Effects In Consumer Reactions to 
Rectangles 


On an exploratory basis, Study 2 also examines whether 
consumer reactions to rectangular products and packages 
depend on the context in which the product is used.* Previ- 


2Our use of the term "context" differs from that in the work of 
Benjafield and McFarlane (1997), who use it to describe how pref- 
erences for the QD rectangle are affected by its location within the 
range of all rectangles that participants view. 


Ratios In Proportion / 99 


ous conceptual work on consumer reactions to product form 
(Bloch 1995) has proposed how context, such as situational 
factors and cultural and social context, may affect consumer 
reactions to product form. Thus, we propose a particular 
context that should be relevant for rectangular products and 
packages, namely, whether the product is used in a rela- 
tively more serious versus less serious context. We focus on 
this context effect for the following reasons: As we dis- 
cussed previously, many aesthetic theories on preferences 
for particular rectangles, as well as other shapes, have 
emphasized that certain rectangles have properties such as 
harmony, proportion, balance, rationality, and mathematical 
beauty (Ghyka 1977; Huntley 1970; Lawlor 1982; Livio 
2002). When applied to a consumer perspective, this ratio- 
nale suggests that consumers should prefer these rectangles 
when the product is used in a context that makes the con- 
sumer more sensitive to and appreciative of these particular 
properties. When consumers are in a relatively serious con- 
text, they are more likely to appreciate these qualities than 
when they are in a relatively less serious, frivolous context. 
Whereas many consumer contexts are serious, such as 
going to an art museum, others are more frivolous and may 
even be intended to reduce harmony, balance, rationality, 
and so forth. 

We propose that in these latter, less serious contexts, 
products and packages that use ratios with these mathemat- 
ical and aesthetic properties will not create higher purchase 
intentions than products and packages that use different 
ratios. We still expect purchase intentions to depend on ratio 
in relatively frivolous contexts, but we expect that this rela- 
tionship differs from that in relatively serious contexts. 
Because we do not hypothesize which ratios will create the 
highest purchase intentions in relatively less serious con- 
texts, we consider this investigation exploratory in nature. 
To pursue this line of inquiry, we include both relatively 
more serious and relatively less serious contexts in the 
product variants in Study 2. 


Design and Procedure 


The product of interest in Study 2 was custom-printed invi- 
tations for different events. The instructions read as follows: 
"Imagine that you are selecting invitations at the printer’s 
shop. The printer tells you that you have to make three dif- 
ferent decisions to design the invitations for each occasion.” 
A total of 122 undergraduate students who were enrolled in 
a marketing course completed the experiment for partial 
course credit. 

We manipulated rectangular ratio by asking participants 
to provide purchase intentions for rectangular invitation 
blanks for each of four occasions. The nine blanks varied in 
ratio from 1:1 to 2:1 in steps of approximately .13, which 
included the 1.62 (4D) rectangle. Although Study 2 was not 
intended to focus specifically on the ® ratio, we avoided 
placing the ® ratio in the middle of the range because this 
placement can affect preferences (Benjafield and McFar- 
lane 1997). 

We labeled the card blanks with the letters A-I ın the 
bottom right-hand corner using four different random 
orders; these labels were written to give half of the partici- 
pants horizontal invitations and half of the participants ver- 
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tical invitations. Although eye-tracking studies of how 
people view quadrangles have shown that scanning patterns 
do not vary consistently with quadrangle type, including 
differences in horizontal versus vertical orientation (Ohta 
1999), we believe that it is advisable to balance this orienta- 
tion. Card blanks were shuffled into random order before 
being placed in an envelope. Thus, there were 8 different 
types of envelopes used (4 order of letters x 2 
orientations) x 4 (order of occasions). To increase task real- 
ism and disguise the study's purpose, participants made.two 
other design decisions. The decisions were to choose enve- 
lope color (white, blue, green, or red) and the invitation's 
font (there were five different options). 


The invitation contexts. To examine whether ratio inter- 
acts with context to influence purchase intent, each partici- 
pant was asked to choose cards for four different contexts, 
two of which were relatively more serious and two of which 
were relatively less serious. In a pretest (n = 40), we evalu- 
ated relative seriousness using a seven-point scale, anchored 
by “not at all serious" and “very serious.” The occasions 
chosen to be relatively more serious, which were a piano 
recital of classical piano music (M = 4.91) and a business 
presentation for prospective customers for accounting soft- 
ware (M = 5.24), received higher seriousness ratings than 
the occasions chosen to be relatively less serious, which 
were a three-year-old child’s birthday party (M = 1.92) and 
an outing with adult friends to a three-ring circus (M = 
1.68); participants rated both relatively serious contexts as 
more serious than the relatively less serious contexts (p < 
.05). 


Measures of purchase intent. We measured purchase 
intent for each invitation blank for each of the four occa- 
sions in two ways: First, participants indicated the likeli- 
hood that they would choose each card option (A-I) on a 
six-point scale, anchored by “very likely” (6) and “very 
unlikely” (1). Because critiques of rectangular preference 
research find that rankings and ratings are complementary, 
participants also indicated which of the nine card blanks 
they were “most likely to choose” and “least likely to 
choose.” Rankings preserve interitem distances, whereas 
ratings are less sensitive to aggregation and scale usage 
effects (Green 1995; Plug 1980). 


Measure of relative seriousness of the four occasions. 
Because the relative seriousness of consumer occasions has 
not been measured as a construct, we could not use an 
established scale for this purpose. Accordingly, we gener- 
ated five items to measure this construct. After providing 
purchase intentions, participants rated each of the four 
occasions using five seven-point semantic differential 
scales, anchored by “frivolous/serious,” “orderly/disor- 
derly,” “unified/fragmented,” “volatile/stable,’ and “ratio- 
nal/emotional.” We reverse-coded the first and fourth varia- 
bles (frivolity and volatility) so that final item scores ranged 
from 1 (“relatively more serious”) to 7 (“Telatively less 
serious"). 


Results 


Manipulation check of context manipulation. A factor 
analysis for each of the four occasions confirmed that the 


five variables in the measure of relative seriousness loaded card as their first choice was contingent on the ratio of the 
onto a single factor for each occasion (eigenvalues = 2:28, cards (8) = = 73.56, 48.45, 30.15, and 68.63 for account- 
2.86, 2.23, and 2.34 for birthday, accounting software, cir- ing, piano, circus, and birthday, respectively; ps < .01). The 
cus, and piano concert, respectively). We averaged the five number of participants rating each card as their least 
variables to create an index of relative seriousness for each favorite was also related to ratio (y2(8) = 164.29, 118.13, 
of the four occasions. This measure displayed a reasonable 105.07, and 180.44 for accounting, piano, circus, and birth- 
level of reliability for a new scale (œ = .69, .79, .68, and .66 

for birthday, accounting software, circus, and piano concert, FIGURE 2 . 

respectively). A repeated measures 2 x 2 (context x repli- Study 1: Mean Purchase Intention Ratings for 
cate) analysis of variance (ANOVA) on this measure Cards, by Ratio and Context 

showed that participants rated the two occasions intended to 
be relatively more serious as more serious (accounting = 
1.65; piano concert = 2.56) than the occasions: intended to 
be relatively less serious (circus = 4.92; birthday party = 
5.07; main effect of context: F(1, 121) = 895.25, p < .001). 
The two occasions within each context domain differed 
from each other (main effect of replicate: F(1, 121) = 28.80, 
p < .001), and this difference was greater for the two rela- 
tively more serious occasions (interaction: F(1, 121) = 
70.04, p « .001). 


` Results for purchase intentions. To test Hia, we first 
analyzed the purchase intention ratings using a 9 x 2 x 2 
(ratio X context x replicate) within-subjects, repeated mea- 
sure ANOVA. As H,, predicted, intent ratings for cards dif- 1 31.13 1.26 1.38 1.5 1.62 1.74 1.87 2 
fered by ratio (main effect: F(8, 848) = 64.18, p < .001). Ratios of Sides of Invitation 
Mean purchase intention ratings lor each card and ratio 


appear in Figure 2. - | | 
The rankings data also support H,, (see Table 1). For ——e- Birthday —™- Circus -3k- Piano —A— Accounting Birthday —il- Circus -—-— Piano —Xx-— Accounting 


each occasion, the number of participants who ranked each 


»n = “very likely”) 


Mean Intent Rating 
3 


(1 = "very unlikely 





TABLE 1 
Study 1: Purchase Intentlons for Rectangular Cards by Ratio of Sides (Number of Participants Ranking a 
Card as First or Last Choice) : 


Relatively More Serlous Contexts Relatively Less Serlous Contexts 
Accounting Plano Total Circus Birthday Total 
Number of First-Ranked Cholces 
1:1 10 7 17 23 34 57 
1.13 8 8 16 17 18 35 
1.26 . 26 18 44 19 25 44 
1.38 30 32 62 22 14 36 
1.5 29 24 53 13 15 28 
1.62 10 13 23 7 4 11 
174 . 8 9 12 2 2 4 
1.87 CA 5 9 8 3 11 
2 4 8 12 12 8 20 
Number of Last-Ranked Choices 

1:1 31 28 59 13 7 20 
1.13 8 17 25 13 5 18 
1.26 4 2 6 9 8 17 
1.38 0 3 3 3 3 6 
1.5 3 4 7 6 5 11 
1 62 7 3 10 6 7 13 
1.74 4 8 12 12 11 23 
1 87 16 15 31 13 18 31 
2 51 44 95 48 59 107 


Notes Numbers in bold indicate a frequency that is significantly different across the two senousness contexts at p « 05. 


Ratios in Proportion / 101 


day, respectively; ps < .01). Thus, the purchase intention 
ratings and the first-choice and last-choice rankings all sup- 
port Hia 


Impact of context on purchase intentions. Next, we 
examined whether purchase intentions for the different rec- 
tangular cards depended on whether the card was being 
chosen for a relatively serious or frivolous occasion. In an 
ANOVA of the purchase intention ratings, the main effect 
of ratio was contingent on context (ratio x context interac- 
tion: F(8, 848) = 13.91, p < .001). The overall analysis also 
produced additional significant effects of context 
(F(1, 106) = 15.73, p < .001), a replicate x context interac- 
tion (F(1, 106) = 4.01, p < .05), and a significant three-way 
interaction (F(8, 848) = 8.02, p < .001). The interaction 
effects involving the replicate factor reflect that the two 
relatively more serious contexts differed more in their 
degree of rated seriousness than did the two relatively less 
serious contexts. The latter effect is not of concern to our 
predicted effects, and thus we do not analyze it further. 

The number of participants who ranked each ratio as 
their first choice was also contingent on the four occasions 
(overall ¥2(24) = 71.83,'p < .001), as was the proportion of 
participants who rankeg each ratio as their least favorite 
choice (overall x2(24) & 47.60, p < .001). Thus, across the 
three measures of purchase intentions, first choice, and 
worst choice, the results suggest that preferences for the dif- 
ferent ratios of the cards nded on the relative serious- 
ness of the occasion for which the invitation was being 
selected. "EDD 

Preference for a range of ratios rather than a particular 
rectangle. Prior research has often found that people prefer 
a contiguous range of rectangles rather than a particular 
ratio (Benjafield 1976; McManus 1980; Piehl 1976, 1978; 
Svensson 1977). If this finding applies in a consumer con- 
text, it could be important to marketers and brand managers 
because it implies that it i$ less important to use a particular 
ratio for rectangular products and packages than a ratio in 
the preferred range. In Study 2, the use of nine invitations 
with ratios extending from 1:1 to 2:1 makes it useful for an 
exploratory examination of whether participants had higher 
purchase intents within a range of ratios. We also examine 
each context separately because ‘the preferred range may 
differ for relatively more serious versus less serious 
occasions. a 

If consumefs ,prefez a range of ratios within the range 
we used in Study 2 for the two relatively serious contexts, 
we would expect an inverse U-shaped pattern for the pur- 
chase intent ratings. We checked for this pattern by regress- 
ing the intent ratings for the serious contexts on ratio and 
ratio? using ordinary least squares regression. The standard- 
ized beta coefficients support an inverse U shape. There is a 
positive coefficient for the main term and a negative one for 
the squared term (f, = 3.59, t = 17.39, p < .001; 
Bratio squared = —3.74, t = 18.12, p < .001), indicating that the 
relationship is not linear. Furthermore, the mean intent rat- 
ings in Figure 2 show that for both the accounting presenta- 
tion and the piano concert, the ratios of 1:26—1.62 have rat- 
ings that are fairly close, and these ratings fall off 
monotonically as the ratios become closer to 1:1 and 2:1. 
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The rankings data also show a similar pattern. In the rela- 
tively more serious contexts, ratios in the range of 1.26—1.5 
have a higher number of first choices and a lower number of 
last choices than rectangles outside this range. This range 
does not include the ® rectangle. 

For the two relatively less serious contexts, the regres- 
sion of intention ratings on ratio and ratio? also suggests 
that the relationship is not linear (Bano = .59, t = 2.70, p < 
.01; Bono squared = —9 7, t = —4.39, p < .001). However, the 
mean intent ratings for both the child's birthday party and 
the circus trip do not exhibit an inverse U shape in Figure 2. 
Instead, the nonlinearity is driven by the combination of a 
plateau in the range of 1:1—1.5:1, with ratings decreasing 
for larger, more elongated ratios. Thus, there is evidence of 
a preferred and less-preferred range but not of an inverse U 
shape. Furthermore, the range of 1—1.5 has a higher number 
of first rankings than do higher ratios, but the data for the 
number of last rankings are less consistent. Thus, although 
we caution that these investigations are exploratory in 
nature, Study 2 provides preliminary evidence for a favored 
range rather than a particular favored rectangle for both the 
relatively more serious and the relatively frivolous contexts. 
Furthermore, these ranges and the shape of the relationships 
appear to differ across contexts. 


Discussion 


Study 2 provides additional support for H,,; that is, in a 
consumer context, purchase intentions for rectangular con- 
sumer products depend on the ratio of the rectangle. 
Exploratory investigations also suggest that these intentions 
depend on tbe context of the relative seriousness of the con- 
sumer occasion for which the product is used and that pref- 
erences exist for a range of ratios rather than for a particular 
ratio, especially in relatively serious contexts. 


Study 3: Frequently Purchased 
Grocery Products 


Studies 1 and 2 found that the ratio of rectangular products 
and packages affects consumers' preferences and purchase 
intentions. A managerial implication of this finding is that 
ratio should also affect actual marketplace demand for rec- 
tangular products and packages. Whereas Studies 1 and 2 
focused on purchase intentions and preferences, Study 3 
examines actual marketplace demand for frequently pur- 
chased consumer goods sold in rectangular boxes. Given 
this focus on market demand, we test the following 
hypothesis: 

H5: Marketplace demand for products sold in a rectangular 

box is related to the ratio of the sides of the box. 


The exploratory results of Study 2 supported the notion 
that the relationship between purchase intention ratings and 
rectangular ratio was different for relatively more serious 
versus less serious contexts. Accordingly, we expect that the 
impact of ratio on marketplace demand also depends on this 
context. Thus, Study 3 also tests the following hypothesis: 


H3: The direction of the relationship between marketplace 
demand, as measured by market share, and the ratio of the 


dimensions of rectangular boxes is different for products 
used in relatively more serious versus relatively less seri- 
ous contexts. 


Product Categories 


To test H, and H}, we chose four categories of products that 
are sold in supermarkets in rectangular boxes. To ensure 
comparability across categories, we chose categories in 
which (1) consumers see the package as a two-dimensional 
rectangle on the shelf but see three dimensions when 
inspecting the package off the shelf, (2) package materials 
are similar across brands (e.g., cardboard), (3) there is 
within-category variation in the ratio of the rectangles, and 
(4) there is an external criterion to measure marketplace 
demand. Following the method used in Study 2, we 
pretested four categories that met these four criteria to 
ensure that two of them were perceived as relatively more 
serious and that two were perceived as relatively less seri- 
ous. A pretest (n = 38) showed that soaps and detergents 
were rated as more serious (M = 4.15 and M = 4.41, respec- 
tively, on a seven-point scale anchored by “not at all” [1] 
and “very serious” [7]) than cereal and cookies (M = 3.26 
and M = 2.93, ps < .05). 


Products 


We identified the top eight to ten brands in terms of market 
shares in each category using market share data from the 
Market Share Reporter (Thomson Learning 2002, pp. 273— 
301, 696). The average market share in the two relatively 
more frivolous categories was 3.53%, whereas the average 
for the two relatively less frivolous categories was 9.08%. 
We recorded the dimensions of the height, width, and length 
of the box for each of these brands and calculated the fol- 
lowing three ratios for each brand: 


1. Longest dimension/median dimension (max/mid), 
2. Longest dimension/shortest dimension (max/min), and 
3. Median dimension/shortest dimension (mid/min). 


Note that the Ratio 3 is the inverse of Ratio 1/Ratio 2. Illus- 
trations of some of the brands considered and the variation 
in their shelf facings appear in Figure 3. 


Results 


Relationship between ratio and market share. For each 
of the two sets of product categories—cookies and cereals 
versus soaps and detergents—a separate ordinary least 
squares regression examined the relationship between the 
dependent variable of market share and the independent 
variables of the three ratios of the dimensions. Because 
market shares are percentages, we transformed them into 
log(share/[1 — share]). The results appear in Table 2 for the 
two sets of categories. 

The regression results were significant for both cate- 
gories (Fs = 5.12 and 8.19 for less and more serious cate- 
gories, respectively; ps < .05) and explained a high degree 
of variation (R2 = .490 and .654 for relatively less and more 
serious categories, respectively). This supports the relation- 
ship between package ratio and market share (H3). 


— 


However, the direction of the coefficients differed 
across the two categories, in support of Ha. For the rela- 
tively more serious categories, the parameter estimates for 
max/mid and mid/min were positive (B = 63.55 and f = 
54.05 for max/mid and mid/min, respectively; t 2 4.00 and 
t = 3.80, respectively, ps « .01). In the less serious cate- 
gories, the coefficient pattern was reversed: The coefficients 
for max/mid and mid/min were negative (D = —1.23 and f = 
—1.31 for max/mid and mid/min, respectively; t = —2.81 and 
t = —2.82, ps « .01), suggesting that for these categories, 
higher market shares were associated with lower ratios (i.e., 
closer to a square). Although the three parameter estimates 
for the ratios are all significant for both types of product 
categories, the signs for the relatively more serious cate- 
gories are always opposite those for the relatively less seri- 
ous categories. 


Discussion 


Overall, these results suggest that in the realm of frequently 
purchased supermarket products, the ratios of the dimen- 
sions of a product’s rectangular box are related to its market 
share. However, because of the nature of the marketplace 
data, we do not claim to have identified a direction of 
causality; that is, H5 and H4 may be supported because 
"better" products are more appropriately aesthetically 
designed, but it may be that appropriately aesthetic ratios 
translate into demand in the marketplace. Regardless of the 
direction of causality, these results attest to the importance 
of considering product form in marketing strategy, as Bloch 
(1995) suggests. 


General Discussion 


Our studies found that the ratio of the sides of rectangular 
products and packages affects consumer purchase intentions 
and preferences and that these ratios are related to actual 
marketplace demand for frequently purchased consumer 
goods. Whereas prior work on rectangular preferences in 
psychology and aesthetics has typically examined rectan- 
gles as abstract entities, we examined preferences and pur- 
chase intentions in the specific domain of products and 
packages, which is of interest to marketing managers and 
product designers. Our results suggest that the proportions 
of a rectangular product or package can have a subtle yet 
important effect on consumers. On a more exploratory 
basis, we also found that rectangular ratio can affect con- 
sumer product perceptions and that the impact of ratio on 
purchase intentions and demand can depend on the relative 
seriousness of the context in which a product is used. Fur- 
thermore, consumers appear to favor a contiguous range of 
rectangles rather than a particular rectangle. However, we 
caution that before these results can be generalized, a 
broader set of rectangular products and packages must be 
tested. 

Notably, many of the products we examined in the mar- 
ketplace had ratios that were not highly favored in either 
our laboratory or our field studies. Thus, our findings, far 
from being conventional wisdom among marketing man- 
agers and product designers, may be able to inform the 
decisions of product designers and brand managers and add 
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TABLE 2 
Study 3: Regression Predicting Market Shares of Frequently Purchased Grocery Products (Relatively 
Less Serlous [Cereal and Cookies] Versus More Serious [Soaps and Detergents]) 





Cereal and Cookles 


Soaps and Detergents 








R2 .654 

Number of observations 17 

Overall F 5.12 (p = .013) 8.19 (p = .003) 
Coefficient t-Value Coefflclent t-Value 

Varlable Estimate (Probabllity of t) Estimate (Probabllity of t) 

Intercept -2.19 —1.96 (.068) -81.35 —4.04 (.0014) 

Max/mid —1.23 —2.81 (.013) 63.55 4.00 (.0015) 

Max/min .998 3.76 (.0017) —44.11 —3.86 (.0020) 

Mid/min —1.31 —2.82 (.0122) 54.05 3.80 (.0022) 





to the academic literature on product packaging (Folkes and 
Matta 2004; Krider, Raghubir, and Krishna 2001; Raghubir 
and Krishna 1999; Wansink and Van Ittersum 2003; Yang 
and Raghubir 2005). A brief survey of store brands showed 
that many private labels use a ratio that we found to be less 
liked. Whether this is done intentionally is an interesting 
area for research.? We hope that by demonstrating the rele- 
vance of rectangular preferences to the consumer domain, 
our findings will encourage researchers to explore the aes- 
thetic appeal of other product and packaging variables and 
their impact on brand image, including, but not limited to, 
proportion, shape, color, font, relative salience, and other 
visual characteristics. 

Our studies do not examine whether consumers' reac- 
tions to rectangular products and packages vary with culture 
or nationality. It is possible that consumers' environments, 
including the range of products and package shapes in the 
marketplace, the architectural norms surrounding them, the 
media to which they are exposed, and other cultural differ- 
ences across countries and people, affect their reactions to 
package shapes. Cultural factors may also cause the impact 
of relative seriousness of context to differ across people. 
These issues are particularly important for international 
brand managers who are deciding whether to use a local or 
global international marketing strategy and thus are a poten- 
tially fruitful area for future research. 

We examined how the shape of a rectangle affects prod- 
uct perceptions. Further research could examine additional 
product perceptions. A managerial implication of this find- 
ing is that companies may be able to change product per- 
ceptions by changing the shape of a product or package or 
by making shape more salient. It would also be interesting 


3Safeway, a leading grocery chain on the West Coast of the 
United States, offers crisped rice bars 1n a box with dimensions (in 
inches) of 24 x 12 x 6 (shelf face ratio of box = 2), unsalted top 
crackers in a package with dimensions of 25.25 x 11.25 x 1I 
(ratio = 2.24), and sugar cones in a box with dimensions 25 x 12 x 
6 (ratio = 2.08). Compare this with box dimensions of leading 
national brands: For example, Betty Crocker's pancake mix has 
dimensions of 23.5 x 16.5 x 6 (ratio 21.42), and Orville Reden- 
bacher’s popcorn has dimensions of 16 x 11.5 x 5.5 (ratio = 1.39). 


to examine whether product and package shape affects the 
sensory aspects (e.g., taste) or cognitive aspects (e.g., will- 
ingness to pay) of a product. Furthermore, Study 1 exam- 
ined relatively serious musical concerts and the CDs for 
them. Additional research might examine whether the 
results replicate or reverse if the context is less serious, such 
as Cajun restaurants that encourage a fun atmosphere. 

Our findings for context may help explain why prior 
literature on rectangular preferences has been inconclusive. 
As we noted previously, this research has rarely stated a 
context. Although participants are likely to invoke the rela- 
tively serious context of a lab experiment if provided with 
no other context, variations in a lab's appearance, experi- 
menter demeanor, and instruction wording might all affect 
this context factor. Furthermore, of their own accord, par- 
ticipants often imagine in experiments that rectangles are 
particular objects, and there can be great variety in these 
objects (Ohta 1999). This could create variations in the con- 
text in which these imagined objects are used, adding an 
uncontroled element to studies of preferences for 
rectangles. 

Further research might examine whether the effects 
studied herein are subliminal versus conscious (and if this 
varies across effects) and whether they are best studied with 
indirect versus direct methods of elicitation of purchase 
intentions, preferences, or choices. It is plausible that sub- 
conscious appeals, such as aesthetic beauty, are more likely 
to manifest in indirect tasks, whereas more conscious and 
cognitive processes, such as volume estimation, may be bet- 
ter measured with direct tasks. Such research would be of 
interest given the growing interest in marketing on noncon- 
scious processes (for a review, see Fitzsimons et al. 2002). 

Our results examined how a single context, relatively 
more versus less serious, affects consumer reactions to rec- 
tangular packages and products. Although this context is 
rooted in the reasons proposed for consumers’ preferences 
for certain rectangles, other contexts, such as whether the 
product is used for utilitarian or hedonic purposes, may also 
affect these reactions (e.g., Chandon, Wansink and Laurent 
[2000] show that a sales promotion’s efficacy depends on 
the product type). Furthermore, additional research is 
needed to explore the construct validity of the serious/frivo- 
lous distinction to determine how it applies to a broader 
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range of consumer products. Furthermore, we did not exam- 
ine contexts that may affect how carefully or accurately 
consumers process information regarding rectangular 
shapes. This processing could be affected by color, other 
shapes near the rectangles, or even the rectangle’s material. 

Individual differences in aesthetic preferences can have 
an important impact on aesthetic responses to product 
design and shape (Bloch 1995; Holbrook 1986). Further- 
more, Bloch, Brunel, and Arnold (2003) find empirical sup- 
port that people differ in how important visual product aes- 
thetics are to their “relationship with products” (p. 551); 


they developed a scale of the centrality of visual product 
aesthetics to measure this tendency, based on the dimen- 
sions of “value, acumen, and response intensity” (p. 551). 
Thus, an important area for further research is to examine 
how such individual differences may affect consumer 
responses to rectangular products and packages. Although 
gender did not have a significant impact in Study 1, it may 
affect the impact of rectangular ratio in other product cate- 
gories. Other demographic variables also deserve study; for 
example, age may moderate rectangular consumer prefer- 
ences if people become more serious over time. 
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Stuck in the Past: Why Managers 
Persist with New Product Failures 


In this research, the authors examine the phenomenon of escalation bias in the context of managing new product 
introductions. In particular, they identify three general paths—Decision Involvement Inertia, Decision Involvement 
Distortion, and Belief Inertia Distortion—that can lead managers to escalate their commitments. The authors test 
the relative strength of these paths in driving observed escalation behavior. The results show that involvement with 
the initial decision, a key construct in numerous explanations for escalation behavior (e.g., agency theory, self- 
justification), is not a necessary condition to induce commitment to a losing course of action (i.e., escalation bias). 
Rather, the authors find that the driving force behind escalation behavior is improper use of initial positive beliefs 
in the face of negative new information. This insight has implications for the groundwork necessary for organiza- 
tions to design systems, policies, and procedures to help them avoid the trap of escalation bias that is often asso- 


ciated with major strategic decisions. 


arketing managers are often asked to make major 
decisions, even though the information available to 
them at the time of the decision is incomplete and 
uncertain. Typical examples of such decision situations are 
launching a new product, initiating an everyday-low-pricing 
or a “no-promotions” strategy, opening a new channel of 
distribution, and initiating a major new media campaign. 
Given that the decision environment is often highly uncer- 
tain and the financial stakes large, it is not surprising that 
firms often revisit such decisions after they obtain new 
information to determine whether the firm should continue 
with, modify, or terminate the initially chosen course of 
action. | 

It is well documented that when managers who are pub- 
licly committed to a course of action are asked to reevaluate 
that action, they tend to remain committed to it even when 
the new information indicates that the action should be ter- 
minated. See, for example, studies of the Vietnam War and 
Desert Storm (Lipshitz 1991), Expo 86 (Ross and Staw 
1986), the Apollo moon missions (Mitroff 1974), the 
Campeau—Federated merger, the coffee wars between Philip 
Morris and Procter & Gamble (Bazerman and Neale1992), 
the National Basketball Association draft (Staw and Hoang 
1995), information technology projects (Keil 1995; Keil, 
Mann, and Rai 2000), and a plethora of experimental inves- 
ügations (Arkes and Blumer 1985; Boulding, Morgan, and 
Staelin 1997; Brockner and Rubin 1985; Staw 1976). This 
stylized phenomenon, often referred to as “escalation of 
commitment," can be disastrous for firms, especially in 
today's hypercompetitive markets, which require fast and 
accurate reactions and adaptability on the part of companies. 
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Decisions about new products seem particularly suscep- 
tible to the problem of escalation of commitment. Schmidt 
and Calantone (1998) show that innovative projects tend to 
generate a high leve] of commitment from managers, often 
leading to ongoing investment in failing projects. Industry 
practitioners echo this sentiment. The director of the New 
Products Showcase and Learning Center stated, “It some- 
times takes more courage to kill a product that's going 
nowhere than to sustain it" (Lukas 1998, p. 44). Still, killing 
a new product can be the best course of action, as suggested 
by comments from a former chief executive officer of East- 
man Kodak when speaking to participants in a senior exec- 
utive education program. He indicated that he considered a 
new product team a "success" when members appropriately 
decided to stop investing in a new product idea and labeled 
the project "the world's fastest failure." 

We use the new product introduction setting to explore 
the reasons managers tend to stay with losing courses of 
action and thus forgo the best course of action. We begin by 
developing a model of how managers combine specific 
information about the potential product and their prior busi- 
ness experience to decide whether to launch a new product 
and, if they choose to launch the product, how they decide 
whether to continue with the venture after receiving nega- 
tive new information. We then estimate the parameters of 
this model using data that we obtain from an experiment in 
which managers are asked to make an initial product-launch 
decision and a subsequent reevaluation decision. Using the 
resulting estimates, we provide empirical support for and 
against numerous previous explanations of escalation of 
commitment. In turn, this enables us to address the impor- 
tant question of how firms can reduce managers' tendency 
to stay with losing courses of action. 


Paths to Escalation 
To make the preceding discussion more specific, imagine 
that a manager is provided with a detailed analysis of the 
potential viability of a new product. This analysis contains 
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information about specific aspects of the market—the dis- 
tribution of possible initial market shares, industry growth 
rates, likely responses by competitors, and summary finan- 
cial data, such as the anticipated net present values (NPVs) 
of the venture. The manager uses all of this information to 
form beliefs about what is likely to happen and, on the basis 
of these beliefs, comes to an initial decision about whether 
to launch the new product. 

Assume that the manager in our example decides to 
introduce the product. Moreover, assume that this manager 
is assigned to lead the product launch so that he or she is 
publicly connected to the initial launch decision. Two years 
later, the firm decides to reevaluate the viability of the prod- 
uct. The analyst again provides the manager with informa- 
tion, this time based on new data collected since the launch 
decision. Among other things, the new information contains 
updated figures for realized market share, new estimates of 
industry growth rates, and newly anticipated competitive 
responses. This information is also summarized in terms of 
the anticipated distribution of NPVs, assuming that the firm 
continues with the product. In addition, the analyst provides 
information about the expected NPV of exiting the market 
and using the assets associated with the venture in another 
application. Finally, assume that the “normative” decision 
(i.e., the decision based on the expected outcomes associ- 
ated with sticking with the product relative to the expected 
alternative use of funds) is to pull the plug on the product 
and reinvest the remaining assets. When the manager is 
asked either to stay the course or to reinvest the assets in the 
new application, he or she decides to stay the course, 
demonstrating escalation of commitment. 

Why might escalation bias occur? The extant literature 
can be organized around three general paths that lead to this 
occurrence. The first path implicitly or explicitly assumes 
that because the decision maker was publicly involved with 
the first decision, this involvement directly influences the 
second decision. Returning to our example, this means that 
the manager continues with the product launch despite 
being fully aware of the negative new information because 
of his or her involvement with the initial decision. This may 
happen because the manager fears “losing face" if the initial 
decision is changed (Brockner, Rubin, and Lang 1981) or 
because of the effort required to justify the change (Fox and 
Staw 1979). Another possibility is that the manager uses the 
NPV assessment associated with the first decision as a ref- 
erence point and consequently perceives a lower assessment 
as a loss. Prospect theory posits that people tend to become 
more risk seeking in the domain of losses (Kahneman and 
Tversky 1979). Framing the issue as a loss may lead the 
manager to accept more risks than normal and therefore 
stick with the initial launch decision (Arkes and Blumer 
1985; Bazerman 1984; Whyte 1986). In all of these expla- 
nations, the manager's involvement with the initial decision 
leads to inertia and reluctance to change, even though the 
negative new information may be correctly perceived. Thus, 
we refer to this path as “Decision Involvement Inertia.” 

The manager's involvement with the initial decision 
again plays a critical role for the second major escalation 
path. Here, involvement with the first decision does not 
affect the second decision directly; instead, it improperly 


influences the belief structure underlying the second deci- 
sion (Bacharach, Bamberger, and Mundell 1995; Conlon 
and Garland 1993; Festinger 1957; Samuelson and Zeck- 
hauser 1988; Silver and Mitchell 1990; Wicklund and 
Brehm 1976). The manager, who was involved with the first 
decision, misinterprets the new information so that it is in 
concert with his or her initial decision, resulting in biased 
beliefs that, in turn, lead to the decision to stay the course. 
Therefore, we call this path “Decision Involvement Distor- 
tion" because the manager's involvement with the initial 
decision leads to a distortion in the perception of the new 
information. 

The third path also concerns biased perceptions of the 
new information. However, in this case, the bias comes 
from the person distorting and/or improperly weighting the 
new information to conform to his or her initial beliefs, not 
from his or her involvement with the initial decision. Thus, 
returning to our example, this mechanism suggests that it is 
not necessary for the manager to have made the initial prod- 
uct introduction decision to exhibit commitment escalation 
subsequently. Rather, it is necessary only that the manager 
has formed positive beliefs about the viability of the initial 
product launch, even with no public involvement in the 
launch decision, to induce escalation. In other words, this 
bias does not occur because the manager was involved in 
the initial decision but rather because he or she does not 
sufficiently adjust prior beliefs and thus chooses to continue 
the original course of action. Note that the mechanism for 
these biased beliefs is similar to that associated with a con- 
firmatory bias or sticky prior beliefs (Bolton 2003; Jonas et 
al. 2001; Klayman 1995; Schwenk 1986). Thus, we refer to 
this path of bias as “Belief Inertia Distortion.” 

In summary, the literature on escalation can be orga- 
nized around three basic mechanisms (paths) that lead to 
escalation of commitment. We summarize these mecha- 
nisms in Figure 1. Two of the mechanisms rely on the 
maker of the second decision being involved in the first 
decision. The third relies only on the decision maker ini- 
tially having positive beliefs about the factors that led to the 
first decision. All three mechanisms can cause managers to 
exhibit escalation behavior, but they differ with respect to 
the process that causes the behavior. Moreover, they differ 
in terms of their implications regarding how to prevent 
escalation of commitment. 

Determining which paths are operative is not an easy 
task. All the paths involve complex relationships between 
second-stage outcomes and first-stage explanatory varia- 
bles. It is not enough to demonstrate simple correlations 
between first- and second-stage variables. For example, to 
test Decision Involvement Inertia, it is necessary to show 
that involvement with the initial decision influences the sec- 
ond decision, after we control for the effects of the updated 
second-stage belief structure. Likewise, testing Decision 
Involvement Distortion requires us to show that the man- 
ager's updated belief structure affects the continue/stop 
decision and then that involvement with the initial go/no-go 
decision affects the manager’s updated belief structure, after 
we control for the possible normative and nonnormative 
effects of prior beliefs on updated ones. Finally, to test 
Belief Inertia Distortion, we must show that the manager’s 
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FIGURE 1 
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updated belief structure affects the continue/stop decision 
and that the manager's prior beliefs have a nonnormative 
effect on his or her updated beliefs beyond any predicted 
normative effect on the updated beliefs, independent of any 
involvement with the initial decision. 

The preceding discussion makes it clear that if we want 
to disentangle the potential influences of the first decision 
from the influence of prior beliefs, we must explicitly 
model and measure the belief-updating process. In Heath's 
(1995) review of the literature, he expresses concern that 
most escalation studies fail to control for initial beliefs. 
Consequently, we take extra care to address this issue. 

Given the complexities associated with disentangling 
our effects of interest, we chose to conduct an experiment in 
which we manipulate the manager's involvement with the 
initial decision, the level of the manager's initial (prior) 
beliefs, and the degree of (lack of) market success subse- 
quent to the initial product launch. We then measure initial 
and updated beliefs and observe the second decision. This 
enables us to estimate the parameters of our belief-updating 
model and, on the basis of those parameters, to assess the 
impact of each of the three potential paths on escalation of 
commitment. Formally, we test the following three 
hypotheses: 


H; (Decision Involvement Inertia): When the decision maker's 
updated belief structure is held fixed, being highly 
involved with the decision to choose an initial course of 
action has a direct, positive effect on the next decision, 
thus producing escalation behavior. 

H; (Decision Involvement Distortion): Being highly involved 
with the decision to choose an initial course of action 
biases the decision maker's updated belief structure in the 
direction of supporting the initial course of action. In turn, 
this biased updating process increases the likelihood that 
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the decision maker will stay with the initial course of 
action. 

H; (Belief Inertia Distortion): Independent of any involvement 
with the 1nitial decision, a decision maker's initial positive 
belief biases his or her updated belief structure in the 
direction of the initial belief structure, thus producing 
escalation behavior. 


Note that these three hypotheses are not mutually exclu- 
sive and therefore can operate simultaneously. Furthermore, 
any empirical tests of these hypotheses must distinguish 
between normative and nonnormative updating of beliefs. 
Consequently, in the next section, we describe a model for 
belief updating and, in the process, describe how we can 
distinguish between normative and nonnormative updating 
effects in the model. 


Belief-Updating Model 


Our model of the belief-updating process acknowledges 
that a marketing manager normally receives different and, 
perhaps, conflicting pieces of information about many 
aspects of the environment. We assume that the manager 
integrates this information into an overall assessment that 
guides a decision. Therefore, we model not only the man- 
ager's overall assessment and updating process but also the 
process of updating beliefs about specific underlying 
aspects of the environment that lead to the overall 
assessment. 

We begin with the conceptualization of managerial 
decision making that Boulding and colleagues (1994) pro- 
pose. Their model assumes that all beliefs can be repre- 
sented by probability distributions in which the mean of the 
distribution is the person's belief about the most likely 
occurrence and the variance of the distribution captures the 
person's uncertainty about the particular belief. Boulding 
and colleagues posit two different types of underlying 
beliefs and one summary belief. The underlying beliefs 
refer to the specific levels of the market factors that influ- 
ence performance of the product (also referred to as “what” 
knowledge) and beliefs about the market structure (i.e., how 
each market-factor belief is weighted; also referred to as 
"how" knowledge). This conceptualization has the manager 
combine the two components (factor levels and weights) to 
form "summary" knowledge, that 1s, a summary belief or 
overall evaluation. For example, for a new product decision, 
market-factor beliefs would include all the factors that are 
believed to have an influence on the project's overall suc- 
cess. These market factors could include items such as the 
obtained market share, the price of the new product offer- 
ing, market growth, competitors’ responses, and the cost of 
capital. The manager then uses knowledge of the relevant 
market structure to establish weights for how each market- 
factor belief affects the project's overall success, which is 
captured by the manager's summary belief. Boulding and 
colleagues assume that it is this summary belief that ulti- 
mately determines the manager's decision. 

We formalize this process as follows: Let SB, be the 
mean of manager i's summary belief about the course of 
action at time t, and let FB,, be the mean of the manager's 
belief about the level of the kth market factor that affects 


overall market performance. Let g;,(-) be the manager’s 
mental mapping of how these market factors combine to 
result in overall performance: 


(1) SB, = gi (FB). 


We assume that the manager's mental mapping, g;,(-), 
can be closely approximated by a Fishbein-style linear for- 
mulation (Ajzen and Fishbein 1977). Thus, in Equation 2, 
we replace the function g,,(-) with a vector of importance 
weights (Wke) that captures the manager's perception of the 
importance of each market factor: 


(2) SB, = S wy FBa. 
k 


Conceptually, the assumption of a linear mapping is 
supported by numerous studies that have shown that a linear 
model provides a robust, parsimonious, and paramorphic 
representation of how people combine multiple stimuli 
when forming an overall evaluation (e.g., Ajzen and Fish- 
bein 1977; Bettman 1979; Einhorn 1970; Johnson and 
Meyer 1984; Keeney and Raiffa 1976). Equally important, 
the data-generating process we used to create the informa- 
tion presented to participants was linear. Thus, participants 
who correctly perceived the relationships in the data should 
have used a linear mapping. Empirically, we tested several 
nonlinear specifications for the participants in the study and 
found no indication that a nonlinear specification was nec- 
essary or appropriate. 

Consider now SB,, which represents the manager’s 
summary belief at the continue/stop decision point. Equa- 
tion 2 suggests that there are three potential sources of bias 
of this summary belief: (1) The manager's market-factor 
beliefs, FB, could differ from normatively correct levels; 
(2) the manager's importance weights, Wy, could be dis- 
torted compared with normatively correct weights; and (3) 
when forming a summary belief, the manager could inject 
extraneous, irrelevant information that Equation 2 does not 
capture. For example, if the manager had an argument with 
his or her children over breakfast, an ensuing bad mood 
could affect his or her beliefs. Theoretically, we can capture 
this effect in Equation 2 by augmenting the market-factor- 
beliefs vector to include the manager's mood. As 
researchers, we have access to and can measure only a lim- 
ited set of possible factors, and thus any estimation must try 
to control for unobserved factors. 

Next, we describe how to determine whether any of 
these sources of bias affect the manager's updated beliefs at 
the continue/stop decision stage. In doing so, we are inter- 
ested not in just any difference between the manager's 
beliefs and the normatively correct beliefs but rather in the 
possible systematic biasing effects of the manager's first- 
stage (go/no-go) beliefs (SB,,, FBy,;, w;iy1) and the initial 
decision on his or her second-stage beliefs. 


Updating Market-Factor Bellefs 


We begin by modeling how a manager updates the mean of 
his or her market-factor beliefs (see Equation 3). The model 
includes three terms: (1) the effect of any relevant new 
information that the manager receives about market factors 


(NI), (2) the effect of the manager’s prior market-factor 
belief (FB;,;), and (3) the effect of the manager’s prior sum- 
mary belief (SB;). 


(3) FB io = Gy NI + Oy FB yy + O3,SB,). 


Note that we do not explicitly model the effects of the 
initial decision on the updating process. Rather, we 
hypothesize that if the person’s involvement with the initial 
decision affects the updating process, the coefficients in 
Equation 3 will systematically vary depending on the level 
of involvement. In other words, involvement interacts with 
the right-hand side of Equation 3. In addition, note that if 
we assume that the manager’s beliefs can be described by a 
normal distribution, the first two terms are consistent with 
normative updating, where the coefficients o4, and Op, 
reflect the relative reliabilities of the new information and 
the initial belief, respectively. Conversely, the third term 
represents the hypothesized nonnormative effect of the ini- 
tial summary belief on the updating of market-factor 
beliefs. The rationale behind this last term is the well- 
documented phenomenon that people tend to perceive new 
information as compatible with their prior beliefs, exhibit- 
ing a confirmatory bias (Hoch and Ha 1986). A significant, 
positive coefficient on the initial summary belief is evi- 
dence of biased updating of the market-factor belief.! 


Updating Importance Welghts 


Normatively, importance weights should change only if the 
manager receives new information that specifically indi- 
cates that the model of how market factors interact has 
changed. However, we conjecture that new information that 
pertains only to the levels of market factors may still induce 
a change in importance weights. Specifically, if the man- 
ager's assessment of a market-factor level increases 
(decreases), we suggest that there will be an associated 
increase (decrease) in the importance that he or she accords 
to that market factor. Such a pattern would reflect "elastic" 
importance weights (Boulding, Kalra, and Staelin 1999; 
Hsee 1995). Specifically, the weights would increase in 
value when new information supports a positive initial 
belief and decrease in value when new information contra- 
dicts the positive initial belief. In Equation 4, the third term 
captures this reasoning; if the difference between the man- 
ager's new market-factor belief and prior market-factor 
belief is positive (i.e., the new information is perceived as 
more positive), the importance weight for this factor 
increases. If the difference is negative, the reverse occurs. 
As in Equation 3, the first two terms in Equation 4 represent 
the expected normative effects. 


(4) Wy? = BikNI, + Baw + Pok FB yo — FB). 





‘Failing to find a significant effect on the initial summary belief 
does not mean that the updating 1s normative, because the other 
two coefficients may still be different from their normative values. 
Here, however, we restrict our attention to the more stringent test 
of the significance of the prior summary-belief coefficient. 
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We estimate a slightly different version of Equation 4. It 
will become clearer when we describe the experiment that 
we do not manipulate or provide any new information that 
indicates a change in the market structure (the importance 
weights). Thus, Biy should be zero, and we estimate only 
the Jast two parameters in Equation 4: 


(5) Wi? = Boywai + B3 (FB — FB). 


In this “special case updating" given in Equation 5, B, = 1 
and B4, = 0 would be consistent with normative updating, 
and deviations from these parameter values would reflect 


biased updating. 


Updating the Summary Bellef 


If the manager’s market-factor beliefs and/or importance 
weights at the time of the continue/stop decision are biased, 
then according to Equation 2, the summary belief that the 
manager constructs from those underlying beliefs will be 
biased as well. However, it is possible that the manager’s 
updated summary belief is not constructed in this manner. 
Rather, the manager might update the former summary 
belief on the basis of some macrolevel evaluation of the 
veracity of the new information. Such a model is consistent 
with normative Bayes-nile updating, but at a higher level of 
ageregation. Because the two possible updating models are 
based on the same information, it follows that the two 
should agree in a well-calibrated model. On the basis of this 
principle, we can test for the existence of bias at the 
summary-belief level. Consider a general model that nests 
our two alternative models of summary-belief updating: 


(6) SBy = YiNI + 5SB,, + ¥32W1yp2F Biv. 


If the mean of the summary belief is updated by means 
of Bayes’ rule with the prior summary belief and the new 
information, the 3 term in Equation 6 should provide no 
additional explanatory power. Conversely, if the updated 
belief is constructed on the basis of the updated underlying 
factor beliefs, the first two terms in Equation 6 should pro- 
vide no additional explanatory power. If all three terms 
simultaneously provide explanatory power for the updated 
summary-belief measures, bias is entering the summary 
belief. 

In summary, estimation of Equations 3, 5, and 6 enables 
us to test directly for the existence of bias in the updating 
process and, therefore, the mechanisms underlying escala- 
tion behavior (H,-H3). 


Method 


Before describing our study, we briefly review the require- 
ments needed to test adequately for the occurrence of the 
three proposed mechanisms underlying escalation behavior. 
First, as Northcraft and Wolfe (1984, p. 227) point out, we 
need to "examine decision making situations in which com- 
mitment of further resources is explicitly economically 
inadvisable.” Heath (1995, p. 52) also mentions this need 
and further claims that "previous demonstrations of 'esca- 
lating commitment' with explicit investments did not pro- 
vide relevant information about marginal costs and bene- 
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fits" To address these concerns, we provide the decision 
makers with explicit information about the marginal costs 
and benefits of both continuing with and discontinuing the 
course of action. Furthermore, the information shows that to 
continue is economically inadvisable. Second, because 
involvement with the initial decision and initial beliefs are 
key constructs in the three paths, it is necessary to manipu- 
late both involvement with the first decision and initial 
beliefs. This is not straightforward, because measuring 
beliefs requires the person to be somewhat involved with 
the initial decision. However, using several different 
manipulation checks, we show that even though all our par- 
ticipants are somewhat involved with the initial decision, 
our involvement manipulation leads to significant differ- 
ences in their involvement level. Finally, there is the issue 
of disentangling normative belief updating from biased 
updating. Here, we use our theoretical model to partition 
the two effects. 

More specifically, we conducted an experiment in 
which we asked participants to take the role of management 
in a new product introduction and reevaluation situation. 
The study involved manipulating three variables: the degree 
to which participants were involved with the initial decision 
(high involvement versus low involvement); the extent to 
which the initial information provided a positive initial indi- 
cation about future profits and, thus, positive NPV (very 
positive versus positive); and the degree to which subse- 
quent information was nonconfirmatory (very negative ver- 
sus negative). Together, these three manipulations enable us 
to tease apart the various effects related to H,—H3. 

We used a new product setting similar to our initial 
example that appears in a previous study (Boulding, Mor- 
gan, and Staelin 1997) in which senior-level managers 
exhibited a marked tendency to continue with the product. 
Specifically, between 45% and 100% of the managers con- 
tinued with the product, depending on the particular condi- 
tion. In contrast, when Boulding, Morgan, and Staelin 
(1997) gave the financial information associated with the 
reevaluation decision to a group of 20 academic experts, all 
chose to discontinue the project. This latter finding provides 
an important normative benchmark, suggesting that contin- 
ued commitment to the project in our setting is indeed evi- 
dence of escalation bias. 


Participants 


We obtained responses from 142 participants who com- 
pleted the experiment. Approximately two-thirds of these 
participants were MBA students from four different full- 
time programs, two on the East Coast and two on the West 
Coast. The remaining participants were midlevel managers 
who were enrolled in an executive MBA program at one of 
the East Coast schools. All participants had completed at 
least one full semester of studies that included instruction in 
decision making. Because our sample used MBA students, 
it is logical to ask if this sample can be generalized to the 
population of senior-level managers who normally make 
major investment decisions. To address this issue, we repli- 
cated the control condition used in Boulding, Morgan, and 
Staelin’s (1997) study, which used senior-level managers as 


participants, with a sample of our participants. In our study, 
60% of the participants (12 of 20) in the control condition 
discontinued the product, whereas 66% (27 of 41) discon- 
tinued it in Boulding, Morgan, and Staelin’s study.2 The dif- 
ference in stopping rates between the two studies is not sig- 
nificant (t = .45, p > .30). This point of comparison suggests 
that there is no substantial difference in the way participants 
in the two studies responded to the information provided, 
increasing our confidence that we can generalize our find- 
ings to the population of senior-level managers sampled in 
Boulding, Morgan, and Staelin’s study. 


The Case 


Participants were asked to complete the Quality Valve Com- 
pany case.? The first two pages of the case describe the new 
product, the history of the market, the company, the primary 
competitor, and the investment associated with an improved 
valve targeted at the large-truck emissions-control market. 
The specific initial decision problem was whether the orga- 
nization should invest $2.5 million in new machinery and 
launch the improved valve. | 

As part of the case description, participants learned that 
the financial success of the new product venture depended 
on three key uncertainties: the share of the market the new 
product garners after launch, the industry's growth rate, and 
the competitive-entry response of the major competitor to 
the launch of the new product (which was captured by the 
timing and effectiveness of an innovative competitive offer- 
ing in the future). 

Participants initially received one of two different ana- 
lyst’s reports (positive or very positive) that included pro- 
jections of all the key market factors along with the dis- 
tribution of estimated NPVs, assuming that the product is 
launched. They also were given the expected NPV and 
could easily determine from the NPV distribution the prob- 
ability that the project would yield a positive NPV. In addi- 
tion, the analyst's report provided estimates of the relative 
importance of each of the key uncertainties in determining 
the product's financial success; these estimates are 7096 for 
market share, 22% for competitive-entry response, and 8% 
for market growth. After reading the case description and 
the analyst's report, participants made a launch/no-launch 
decision (high-involvement condition only) and described 
their beliefs about the expected levels of relevant market 
factors and their mean overall summary beliefs. 

Next, participants in the high-involvement group who 
suggested launching the product and all the participants in 
the low-involvement group were given the second part of 
the case. Specifically, they received negative new informa- 
tion about the product's performance during its first two 


2Although this control cell replicates Boulding, Morgan, and 
Staelin's (1997) study, the levels of information that they provide 
differ slightly from those used in this study. Consequently, these 
data cannot be used for further comparisons with other data col- 
lected in this research. 
: 3The case is available on request. 


years on the market. The information included realized val- 
ues for the three key market uncertainties and was presented 
in a balanced fashion. Thus, the realized market share was 
significantly less than anticipated, the realized market 
growth rate was greater than anticipated, and the informa- 
tion about the major competitor's entry response was neu- 
tral in that no new information was given on either the tim- 
ing or the effectiveness of its future offering. In addition, 
participants were told that an uncorrectable production 
problem explained the lower-than-expected market share 
but that the sales force and research-and-development 
groups were very happy about the product introduction. 
Nonetheless, because market share was the most important 
factor in determining profitability and the production prob- 
lem was not correctable, on balance, the information was 
negative. Moreover, the analyst's revised (normatively 
updated) financial report predicted an expected NPV from 
future cash flows that was much lower than initially pro- 
jected. More important, this NPV forecast was substantially 
lower than the guaranteed cash value associated with the 
alternative of dropping the product and selling the equip- 
ment. Thus, as Heath (1995) and Camerer and Weber 
(1999) stress, the marginal costs and benefits of the alterna- 
tive courses of action were made explicit to participants 
before they made the second decision. Specifically, the nor- 
mative decision for the organization based solely on the 
new financial information and the concept of expected val- 
ues was to withdraw the product and sell the equipment. 
This choice was given to participants. After receiving the 
new information, participants were asked to select one of 
two alternatives: continue with the product or withdraw it 
and sell the equipment. 


Experimental Manipulations 


Initial information. 'The initial information provided to 
participants was either positive or very positive. In the very 
positive condition, the expected market share following the 
launch was higher, as was the analyst's assessment of the 
expected NPV from the product. In both conditions, the 
information was sufficiently positive to result in the vast 
majority of participants choosing to launch the product. 


New information. The new information provided to par- 
ticipants was either negative or very negative. In the very 
negative condition, participants were given a realized mar- 
ket share after the launch that was lower than the market 
share given to participants in the negative condition. In 
addition, the analyst's assessment of the expected NPV 
from the product was lower in the very negative condition. 
In both conditions, the information about realized market 
share, the uncorrectable production problem, and the ana- 
lyst’s revised NPV assessment declined sufficiently com- 
pared with the guaranteed cash-out value of stopping the 
project to make stopping the project the normatively correct 
decision. 


Involvement manipulation. 'The theoretical explanations 
underlying the two decision involvement paths suggest that 
the influence of the initial decision on the subsequent con- 
tinue/stop decision is the result of the decision maker's per- 
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sonal involvement in the initial decision (see, e.g., Staw and 
Ross 1987). The literature on escalation bias has manipu- 
lated involvement with a decision by 


1. Manipulating the level of participation the second-stage 
decision maker had with the initial decision by having him 
or her participate (or not) in making the initial decision 
(Barton, Duchon, and Dunegan 1989; Staw 1976), making 
him or her primarily responsible for the decision or respon- 
sible only for making a recommendation to be reviewed by 
others (McAllister, Mitchell, and Beach 1979), and making 
him or her the company president or someone with no con- 
nection to the company (Arkes and Blumer 1985); 

2. Manipulating the personal stake the decision maker had in 
the success of the initial course of action (Arkes and Blumer 
1985) by, for example, implying that there were or were not 
job security implications (Fox and Staw 1979); and 

3. Manipulating how much the decision maker expected to 
need to justify the decision to others (Bobocel and Meyer 
1994; Simonson and Staw 1992). 


We used the first two methods to manipulate involve- 
ment with the initial decision. Specifically, high- 
involvement participants were put in the position of making 
the initial decision to launch or not to launch the product 
after receiving the previously discussed information. These 
participants were instructed that if they launched the prod- 
uct and it succeeded, they would likely be on the fast track 
to a senior management position. They were told that 
choosing not to launch the product would not hurt their 
careers but that they would lose this particular opportunity 
to be on the fast track. Those who chose to launch the prod- 
uct were subsequently told that they were appointed as the 
product’s brand manager. Consequently, they had overall 
responsibility for everything that happened, beginning with 
the decision to launch the product. 

Conversely, low-involvement participants were told that 
the boss, who had a highly centralized decision-making 
style, would make the launch decision. However, the boss 
wanted to know the manager’s beliefs about the key uncer- 
tainties associated with the launch. Furthermore, partici- 
pants in this condition were told that the boss wanted unbi- 
ased opinions, and thus all evaluations about these key 
factors were to be provided anonymously. After they pro- 
vided the requested belief information, these low- 
involvement participants were told that during the two years 
the product was on the market they were assigned to an 
entirely different project and that they were being called 
back to help make the reevaluation decision. Thus, the high- 
and low-involvement conditions differed in terms of (1) the 
level of the decision maker’s involvement in the initial 
launch decision (similar to the method that McAllister, 
Mitchell, and Beach [1979] used), (2) the level of the deci- 
sion maker’s involvement with the product’s performance 
after launch (Arkes and Blumer 1985), and (3) the potential 
career implications of the success of the initial decision for 
the decision maker (Arkes and Blumer 1985). 

To ensure that we successfully manipulated involvement 
with the initial decision, we conducted two independent 
manipulation checks. We provide details of these two 
checks in the Appendix. In summary, for all manipulation 
checks, there were significant differences in the expected 
directions between the two groups with respect to their 
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involvement with the initial decision. Still, we acknowledge 
that the low-involvement situation required the decision 
maker to think hard about the different aspects of the deci- 
sion and, in this way, differed from some prior studies in 
which the decision maker was not provided any information 
about the first decision. However, as Heath (1995) points 
out, it is important to capture the participants' beliefs before 
the first decision is made. This required the participants to 
know about the decision and form beliefs about the ele- 
ments of the decision that were relevant at the time. Conse- 
quently, all participants were at least somewhat “involved” 
with the decision. In the “Results” section, we discuss a 
third "manipulation check" that is based on the participants' 
actual actions. To foreshadow these results, again we found 
that the two groups differed, but this time with respect to 
their decision behavior. This provides additional support for 
our contention that there are significant differences in the 
two involvement groups. 


Design 

We used an incomplete 2 x 2 x 2 design for the study. The 
low-involvement condition was fixed at the positive level of 
the initial information (as opposed to very positive), but it 
was crossed with both levels of new information. Thus, the 
design included only six of the eight possible cells. We used 
this design for parsimony because there are no theoretically 


meaningful predictions about the interaction of involvement 
with the initial decision and the level of new information. 


Measures 


Our theory testing revolved around three different measured 
constructs: market-factor beliefs, importance weights, and 
summary beliefs. After each decision, we took two 
summary-belief measures. They were the manager's assess- 
ment of expected NPV and the probability of success (the 
probability that NPV would be positive). We combined 
these measures after normalizing each measure over the 
sample to have a mean of zero and a standard deviation of 
one. We then summed the two measures for each participant 
to form the summary-belief measure used in our analysis. 

We took measures for each of the three market-factor 
beliefs after both decisions. We measured the market 
growth rate and market-share beliefs using a nine-point 
scale anchored between “very negative” and “very positive" 
(the midpoint was designated as "neutral") Our use of 
affective evaluations rather than actual expectations ensured 
that all participants were on the same "goodness" metric. In 
this way, a higher rating reflects a higher evaluation even if, 
for example, a particular manager believes that a 2096 mar- 
ket share is a great success whereas another considers a 
40% market share a relative failure. For the market-factor 
belief associated with the major competitor's entry 
response, we used two measures: the likelihood of competi- 
tive launch and the effect of a competitive launch. We mea- 
sured both on nine-point scales, normalized them, and com- 
bined them into a single variable that, as with our 
summary-belief measure, captured both the probability of 
the event occurring and the value of the event. 

Finally, we measured importance weights for the com- 
ponents relevant to the second decision by asking partici- 


pants to specify the relative importance of the three market 
factors in determining the project's financial: success. In 
particular, participants divided 100 points between the three 
factors. 


Results 


Because we are interested in situations that involve two 
decisions, we discarded data for the 7 high-involvement 
participants who chose not to launch and for the 5 low- 
involvement participants who, given their beliefs, presum- 
ably would have chosen not to launch.4 Of the remaining 
130 participants, 68 (52%) decided to stay with the losing 
course of action, exhibiting behavior that is compatible with 
an escalation bias. Given this evidence of nonnormative 
behavior, we next report findings that bear on the strength 
of the three nonnormative paths based on our framework. 


Decision Involvement Inertia Results 


The Decision Involvement Inertia path implies that involve- 
ment should directly influence the second decision. To test 
this, we heeded the warnings of Heath (1995) and con- 
trolled for summary beliefs. Table 1 presents the results of 
our logistic regression for three different models. In Model 
l, the only independent variables are two of our experi- 
mental manipulations—initial information and new infor- 
mation—and our measure of the participant’s summary 
beliefs. Note that summary beliefs are highly significant 
and completely mediate any effects of the initial and new 
information. In Model 2, we add a variable that captures the 
experimental manipulation of involvement with the initial 





4All 7 of the 98 participants in the high-involvement conditions 
who chose not to launch the product estimated probability of suc- 
cess to be less than 50%. The average for these seven participants 
was 46%, as opposed to 73% for those who continued. In addition, 
5 of the 44 participants in low-involvement conditions forecasted a 
negative NPV and/or a less than 50% chance of a positive NPV. 
Given this initial negative forecast, we assumed that they would 
not have gone forward with the initial launch and thus eliminated 
them from further analysis. Furthermore, when we compared this 
initial decision not to launch for the two involvement groups, hold- 
ing fixed the level of initial information, we observe that 5 of 59 
did not launch in the high-involvement group, and we assume that 
5, of 44 would not have launched in the low-involvement group. 
These proportions are not significantly different. 


decision. As is immediately eviderit, involvement does not 
reach traditional levels of significance (p = .12). Moreover, 
there are no significant differences between Model 2 and 
Model 1 in the log-likelihood (122.135 versus 124.609) or 
in the predicative accuracy (84.396 versus 82.696). Thus, we 
find little statistical justification for the Decision Involve- 
ment Inertia path. 

A question might arise as to why our involvement mea- 
sure did not affect the second decision, given that numerous 
previous studies have shown. an ‘effect. One explanation is 
that our involvement manipulation did not provide enough 
difference between the two groups in terms of their percep- 
tions of being involved with the first decision. However, 
when we ran a model without the sümmary-belief measure 
but with the involvement measure (Model 3), we found that 
involvement was positively associated with continuing with 
the initial course of action and that this efféct was signifi- 
cant at the .05 level. This result has three major implica- 
tions: First, it provides one more piece of evidence that our 
involvement manipulation workéd. Second, it reproduces 
many previous findings. on involvement in which 
researchers did not control for sutpmary- beliefs. Third, a 
comparison of this result with the insignificant involvement 
effect in Model 2 shows that even thougli involvement may 
be associated with escalation of commitment, being psy- 
chologically and/or socially inyolved with the initial deci- 
Sion is not a necessary condition for escalation to occur. 
Instead, it seems that escalation bias enters the decision 
through the updated belief structure. Consequently, we turn 
our attention to the two paths that. pertain to distortion of 
information and, thus, biased updating. 


Decision Involvement Distortion Results 


Before we present the Decision' Involvement Distortion 
results, note that we tested both distortion paths using some 
variables that we measured instead of manipulated. This 
required us to control for background factors, such as par- 
ticipants’ individual differences, that could be correlated 
with both our independent variables and our dependent 
measure of interest (and thus produce misleading esti- 
mates). We did this using an instrurnental variable (IV) pro- 
cedure, in which the selected instruments are known to be 
free from any individual background factors. The IV proce- 
dure replaces the actual value of a measure with its pre- 








TABLE 1 
Results of Logistlc Regression on the Continue/Stop Decision (N - 126) 
Variable i Model 1 Modei 2 Model 3 
Intercept —4.94* —b.358* —.81* 
(1.041) (1.09) (.395) 
Initial information —.11 —.458 .388 
(.532) (.578) (.442) 
New information 201 .41 
(.439) (.449) (.366) 
Involvement with the Initial decision .85* 
(.517) (.437) 
Updated summary belief .0936* .092* 
(.02) (.0193) 
*p< 05 


Notes: Standard errors are in parentheses. 
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dicted value using the selected instruments, thus purging 
unobserved background factors from the measures. Note 
that the presence of correlation of our independent variables 
with unobserved individual differences is testable. Thus, in 
each equation, we used Hausman’s (1978) test to identify 
whether the IV estimation procedure was required to avoid 
misleading estimates due to unobserved background factors 
that differed across participants and were correlated with 
both our dependent and independent measures. If the test 
indicated the presence of correlation, we used the IV proce- 
dure (this only happened for the summary-belief equation). 
As a result, our findings based on the use of measured 
variables can be interpreted as having the same validity as 
traditional analyses of variance based on manipulated 
variables. 

Decision Involvement Distortion (H5) suggests that 
involvement with the initial decision indirectly affects man- 
agers' decisions by altering the way they update their 
beliefs. If this is true, we would observe different patterns 
for the updating process for:the two involvement groups. 
Consequently we tested this hypothesis by determining 
whether the coefficient vectors, both slope and intercept, 
varied significantly between the two groups for each of the 
belief-structure updating equations. 

We began by testing the summary-belief estimation 
equation (Equation 6). The appropraate F test indicated that 
the estimated intercept and slope coefficients did not vary 
between the two involvement groups for this equation 
(F4. 119 = .76). Next, we estimated the updating equations 
for the three market-factor beliefs for the same two involve- 
ment groups (Equation 3). Again, we failed to find any evi- 
dence of a difference in updating between the two groups 
(market share: F4, 122 = 1.51; competitive-entry response: 
F4, 127 = 1.19; and market growth: F4. 122 = 1.53). Finally, 
we estimated the updating equation for the importance 
weights associated with the market-factor beliefs. Here, we 
estimated a single updating equation (Equation 5) for the 
importance weights because the weights were constrained 


by a constant sum scale and therefore were interrelated. 
Again, we failed to find significant differences in updating 
for the two groups (Fs. 46? = 1.14). In summary, we failed to 
hind any difference in updating of beliefs due to involve- 
ment with the initial decision, and therefore we found no 
support for the path associated with Decision Involvement 
Distortion. 


Bellef inertia Distortlon Results 


Our previously reported logistic analysis suggests that 
updated summary beliefs are the driving forces behind the 
second continue/stop decision. Because we found that 
involvement does not affect the coefficients in the updating 
equations, we used the combined sample to estimate the 
three  market-factor-belief-updating equations, the 
importance-weight-updating equation, and the summary- 
belief-updating equation. In each case, we tested H4 by 
examining the nonnormative aspects of the equations. 


Market-factor beliefs. We report the results for the 
market-factor-belief-updating equations in Table 2. These 
results indicate support for Belief Inertia Distortion. In sup- 
port of H3, we find that the effect of the initial summary 
belief is positive and significant in both the market-share- 
updating (p < .05) and the market-growth-updating (p < 
.01) equations. In addition, as we expected, the initial 
summary-belief effect is not significant in the competitive- 
entry-response-updating equation. Because there was no 
new information provided about the major competitor's 
entry response, participants would not be expected to 
engage in an updating process (biased or unbiased) regard- 
ing the competitive response. 


Importance weights. We report the results from the 
importance-weight-updating equation in Table 3. Again, 
these results demonstrate support for Belief Inertia Distor- 
tion. Consistent with H5, we found a significant (p < .001) 
increase (decrease) in importance weights for market fac- 
tors when participants perceived the new information as 


TABLE 2 
Results of the Market-Factor-Belief-Updating Equations 





Market-Factor Bellef 








Competitive-Entry 
Market Share Market Growth Response 
Variable (N = 129) (N = 129) (N = 128) 
Consistent with Normative Updating 
Intercept 3.047*** 5.647*** 3.032*** 
(.592) (.646) (.464) 
New informaton .665** —088 ` —.169 
(.244) (.258) (.227) 
Initial market-factor belief .108 .149* .948*** 
(.075) (.088) (.078) 
Consistent with Blased Updating 
Initial summary bellef o .265** ~.084 
| (.386) (.102) (.361) 
"p< 05, 
**p < 01. 
"p < .001. 


Notes Standard errors are in parentheses. 
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TABLE 3 
Results of the Importance-Weights Equation 
(N = 366) | 
Variable 
Consistent with Normative Updating 
Initial Importance weight .764* 
(.029) 


Consistent with Blased Updating 
Helative positive or negative new 2.234* 
component information (.644) 


*p < 001. 
Notes: Standard errors are in parentheses, no intercept regression 
ls present in the model. i 


more (less) positive than their perception of the initial 
market-factor information. 

Summary beliefs. We present the estimation results of 
the summary-belief-updating equation in Table 4. These 
results show that the effect of the initial summary belief is 
positive and significant (p < .001), as is the effect of inte- 
gration of the participants’ updated market-factor beliefs 
and importance weights (p < .001). As we noted in our dis- 
cussion of Equation 6, for a well-calibrated manager, one 
but not both of these effects should be significant. Both 
terms are highly significant, which is a clear indication of 
bias entering the updated summary belief. 

In summary, we find strong support for Belief Inertia 
Distortion. Moreover, the distortion appears concurrently at 
multiple levels—that is, at the aggregate level of summary 
beliefs and at the more disaggregated level of both compo- 
nent beliefs and importance weights. 


Alternative Explanations for the Data 


Previous theories of escalation, such as self-justification, 
face saving, and framing of options, have pointed to 
involvement with the initial decision as the main driver of 
escalation behavior. Our results are compatible with these 
previous theories in that we find that high-involvement 
decision makers are more likely to stay committed to a los- 
ing course of action than low-involvement decision makers. 
However, our results suggest that involvement with the ini- 


TABLE 4 | 
Results of the Summary-Bellef-Updating Equation 
(N = 124) 
Variable | 
Intercept 34.664* 
(12.162) 
New information 2.871 
(30.151) 
Initial summary bellef& .251** 
(.066) 
Zwol Pag? 217" 
(.041) 
*p < .01. 
**p < .001. 


aVarlable is instrumented. 
Notes: Standard errors are in parentheses 


tial decision is not a necessary condition for escalation 
behavior. Instead, we find that biased belief updating is the 
driving force behind escalation behavior. Indeed, we find 
that participants in the low-involvement conditions who 
formed initially favorable opinions of the new products 
exhibited the same behavior as participants in the high- 
involvement condition. 

Given the limited role of biased belief updating in previ- 
ous studies, it might be expected that at least some, if not 
most, of the escalation effects would operate through the 
paths associated with decision involvement. However, our 
results counter much of the extant literature. Consequently, 
we explore possible alternative explanations for our 
findings. 

Perhaps the simplest explanation for our finding of no 
effects from Decision Involvement Inertia and Decision 
Involvement Distortion is that our involvement manipula- 
tion did not provide meaningful separation between the two 
groups. This separation is critical because the explanations 
associated with these two paths rest on the assumption that 
the reevaluation decision makers differ in terms of involve- 
ment with the initial decision. Importantly, our three inde- 
pendent manipulation checks indicated significant differ- 
ences between the two groups in terms of both the 
measured level of involvement and behaviors associated 
with different levels of involvement. Thus, we reject the 
idea that the lack of support for the explanations associated 
with these two involvement paths is due to a failed involve- 
ment manipulation. 

Another possible alternative explanation for the lack of 
support for the decision involvement paths is the possibility 
of reverse causality in some of the paths. In this scenario, 
involvement/responsibility for the initial decision still acts 
as the "trigger" for escalation bias. The logic is as follows: 
Responsibility for the initial decision causes the decision 
maker to stay the course. Having made this decision, the 
decision maker modifies his or her beliefs to bring them 
into accord with the decision. That is, the causal arrow runs 
from the second decision to the updated beliefs. 

To explore the feasibility of this account, we searched 
for evidence that would be compatible with this reverse 
causality argument. Specifically, the reverse causality inter- 
pretation implies that initial involvement with the decision 
must be correlated with the updated beliefs after we remove 
the mediating role of the subsequent decision. However, as 
we show in our analysis of the Decision Involvement Dis- 
tortion path, the degree of involvement with the initial deci- 
sion is independent of updated summary beliefs, market- 
factor beliefs, and weights. Therefore, the conjecture of 
reverse causality between the second decision and updated 
beliefs is incompatible with our data. 


Implications 
In this article, we provide an integrative framework that 
organizes the existing literature around three possible paths 
to escalation behavior. Although our methodology allows 
for the simultaneous conceptual and empirical existence of 
all three paths, our results suggest that the driving force 
behind escalation behavior in our study was not involve- 
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ment with the initial decision but rather biased belief updat- 
ing that outweighs initial positive beliefs. 

In considering the theoretical implications of our find- 
ings, we acknowledge that these findings are based on one 
laboratory study, albeit with a diverse set of participants. 
Thus, although we believe that these findings generalize to 
other populations and other organizational settings (e.g., the 
National Basketball Association draft; hiring decisions; 
other strategic marketing decisions, such as selecting an ad 
campaign or choosing a new channel of distribution), we 
caution readers to draw their own conclusions about the 
extent to which our findings generalize. In addition, we 
acknowledge that all our participants had some involvement 
with the initial decision, and thus our study differed from 
some prior studies in which the noninvolved group had no 
contact whatsoever with the initial decision. That said, our 
findings suggest that Decision Involvement Inertia (involve- 
ment with the initial decision directly affecting subsequent 
decisions) is not the major driving force behind escalation. 
This casts doubt on several commonly invoked explanations 
for escalation, such as a framing effect, agency theory, and 
public or self-justification. 

We acknowledge that this is a highly unexpected con- 
clusion. Because the lack of a Decision Involvement Inertia 
effect was so surprising, we took extra care in our analysis 
to ensure that no such effect was being masked by either 
our method or our analysis. Having taken these steps, we 
point to two factors in our methodological approach that 
could account for why our findings differ from much of the 
existing escalation literature. First, with regard to escalation 
research, Northcraft and Wolfe (1984), Heath (1995), and 
Camerer and Weber (1999) point to the importance of hav- 
ing an explicit normative benchmark and note the absence 
of such benchmarks in most escalation studies. From this 
starting point, they conclude that though there are a large 
number of prior studies that examine escalation of commit- 
ment, these studies provide little empirical evidence in sup- 
port of the existence of escalation bias. Therefore, we took 
great pains to provide information about marginal costs and 
benefits for each option, and in our setting, it is clear that 
the continuation decision is nonnormative. Thus, we believe 
that our study provides clear evidence of nonnormative 
escalation behavior. Second, Heath (1995) argues for the 
importance of disentangling the role and measurement of 
beliefs from the role and measurement of making the initial 
decision. Our study is the first to do this, enabling us to con- 
trol for beliefs when testing for the effects of involvement 
with the initial decision. . 

We also find no support for the hypothesis of Decision 
Involvement Distortion (involvement with the initial deci- 
sion leads to biased belief updating). Although this path has 
received much less attention in the literature, this result is 
important because it again shows that involvement with the 
initial decision is not the root cause for escalation behavior. 

Instead, our results point to the powerful influence and 
complexity of biased belief updating, which weights initial 
positive beliefs too heavily (the Belief Inertia Distortion 
path). Note that the biasing influences on belief updating 
enter the process at many different levels. Specifically, we 
find that prior beliefs nonnormatively affect not only the 
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updated market-factor beliefs but also the importance 
weights placed on those factors and the summary beliefs. 
Thus, all these sources of bias enter into the updating 
process concurrently. 

Note also that these nonnormative updating results are 
compatible with Boulding, Kalra, and Staelin's (1999) 
results. This compatibility is notable because their setting 
was very different. In that study, consumers were the infor- 
mation processors, and longitudinal evaluations of service 
quality were used instead of updated evaluations of a new 
product introduction. Consequently, we conclude that the 
model of belief updating presented in our article should be 
useful in understanding belief updating in a wide range of 
situations. Our findings should be of interest to researchers 
who study belief formation and updating as well as those 
who are interested in understanding escalation. In general, 
the results point out that people have difficulty recognizing 
redundancies in the information they use to form overall 
beliefs. This is of particular importance to both marketing 
scientists who design marketing decision support systems 
and marketing researchers who are interested in understand- 
ing how consumers make significant decisions over time. 

From a management perspective, the real issue is how to 
prevent escalation behavior in situations such as the intro- 
duction of a new product or the start of a new strategic 
alliance. Our findings should be helpful in determining 
which approaches have the most promise in preventing 
escalation behavior. However, because escalation was 
caused by a biased belief-updating process, our findings 
highlight the inherent difficulty in trying to eradicate esca- 
lation behavior completely. This may provide some expla- 
nation for why previous studies examining deescalation 
strategies produced mixed results (Boulding, Morgan, and 
Staelin 1997; Keil and Robey 1999; McNamara, Moon, and 
Bromiley 2002; Simonson and Staw 1992). The findings 
from those studies, in conjunction with our findings, lead us 
to suggest the following methods for preventing escalation 
behavior: (1) Change the organizational structure such that 
continue/stop decisions are made by someone with no prior 
beliefs about the project; (2) use stopping rules that are 
based on objective data; and (3) accept that decision makers 
hold biased beliefs, and institute policies and procedures 
that minimize the adverse effects of these biased beliefs. 
Such a policy could be an educational program that stresses 
the idea that managers tend to examine data with the goal of 
making the world appear consistent with their own views of 
reality (Carlson and Russo 2001). Specifically, it could 
stress the need for a decision maker to document how he or 
she might be wrong, thus forcing him or her to attend to 
negative information. 

However, even these suggestions are not foolproof. For 
example, Bolton (2003) shows that such analytic reasoning 
is unlikely to eliminate bias from sticky priors if these pri- 
ors were generated by nonanalytic thinking. Likewise, 
when McNamara, Moon, and Bromiley (2002) tried to use 
the approach of changing decision makers, they were able 
to attenuate but not eliminate escalation in a commercial 
lending situation. Similarly, Boulding, Morgan, and Staelin 
(1997) found that stopping rules adopted at the time of the 
initial decision also attenuated escalation but that some 


decision makers simply ignored their own stated stopping 
rule when making subsequent decisions. This willingness to 
change the rules is similar to what Keil, Mann, and Rai 
(2000) found in the domain of information technology pro- 
jects. They found that the best explanation for escalation in 
such projects was what they referred to as the “completion 
effect"—a tendency to substitute the goal of completing the 
project for the goal of maximizing economic returns. 
Together, these studies indicate that implementing organiza- 
tional solutions will not be easy and that managers need to 
be doubly alert to the possibility of escalation bias. More- 
over, research is needed to understand the underlying dri- 
vers of biased belief updating to help identify better ways to 
reduce escalation behavior in managerial settings. For 
example, research on decision processes at the team or 
organization level may point to additional factors that might 
help individuals and, therefore, firms avoid the trap of esca- 
lation bias. 


Appendix 


Manipulation Check 15 


In the first manipulation check, we collected data from 59 
participants from the same MBA student pool previously 
described, varying only the high-involvement (29 partici- 
pants) and low-involvement (30 participants) conditions. 
Participants completed the same initial decision/evaluation 
part as in the main study. After reading the part in the case 
that informed them that the company decided to proceed 
with the product introduction, they were asked to indicate 
how well ten different questions (see the “Manipulation 
Check Questions”) reflected their beliefs. The questions 
assessed (on a nine-point scale ranging from “low” to 
"high") how much respondents felt publicly committed, 
accountable, and responsible for the product launch.6 

Each of the ten items is significantly different (p « .001 
or better) between the low- and the high-involvement 
groups and in the expected direction. If we sum the items 
and divide by ten, the mean of the low-involvement group is 
5.1 and the mean of the high-involvement group is 7.2. 
These means are significantly different (t = 6.84, p « .001), 
and these results indicate significantly different involve- 
ment in the low- and high-involvement groups. 


Manipulation Check 2 


À total of 44 participants taken from the same MBA student 
pool and evenly divided between the high- and the low- 
involvement conditions completed the same initial decision/ 
evaluation part as in the main study. After finishing this 
decision/evaluation task, they were asked to fill in a debrief- 
ing sheet that instructed them to "describe why you think 
the Quality Valve Company should or should not launch the 
new product." 


SIn both manipulation check studies, we collected data in the 
positive prior information/negative new information conditions. 

6Cronbach’s alpha for the ten items 1s .94. Principal component 
factor analysis indicates a single factor solution. 


Prior research indicates that people who feel account- 
able for (i.e., psychologically and/or socially involved with) 
a decision have a need to justify their position. Therefore, 
they generate attitude-consistent thoughts, which lead to 
more (less) thoughts that support (oppose) their initial atti- 
tude (Huber and Seiser 2001; Lambert et al. 1996; 
Schlenker 1980; Tetlock 1985; Tetlock, Skitka, and 
Boettger 1989). Thus, we expect the high-involvement 
respondents to mention more arguments in support of and 
fewer arguments opposed to their decision than the low- 
involvement respondents. In addition, evidence in the litera- 
ture suggests that people who are accountable for their deci- 
sions write more thorough justifications (Koonce, 
Anderson, and Marchant 1995), are more likely to qualify 
their opinions, and exhibit more recognition of the trade- 
offs involved (Lord 1992). Thus, we expect the respondents 
in the high-involvement condition to report relatively more 
arguments that make explicit trade-offs between the pluses 
and minuses of the decision. 

Two coders, who were blind to the hypotheses, placed 
participants' arguments into the following three categories: 
(1) arguments that support the initial decision, (2) argu- 
ments that do not support the initial decision, and (3) argu- 
ments that make explicit trade-offs between supporting and 
nonsupporting aspects. The percentage of agreement 
between the coders was 8496, with a Cohen's (1960) kappa 
of .73 and a Perreault and Leigh (1989) reliability index of 
.87 (95% confidence interval of .81—.93). All measures indi- 
cate a reasonable level of agreement between the two 
coders. 

We used the F test for differences in count data (Kanji 
1993, p. 51) to test for differences between our manipulated 
conditions. As we expected, the number of supporting argu- 
ments made by the high-involvement group was signifi- 
cantly greater than the number made by the low- 
involvement group (F67, 110 = 1.62, p < .05), and the pattern 
was reversed for nonsupporting arguments (F5o 54 = 1.77, 
p < .05).8 As we also expected, the high-involvement group 
produced significantly more explicit trade-off arguments 
than the low-involvement group (Fs. 33 = 8.00, p < .001). 


Manipulation Check 1 Questions 


Respondents were asked to assess “How well do the follow- 
ing statements reflect your beliefs" on a nine-point scale 
ranging from "completely disagree" to "completely agree" 
for the following ten statements: 


1. If the project succeeds, I will be given substantial credit. 


2. People will associate the success or failure of the project 
with me. 


7The coders actually coded arguments as for or against the 
launch. For obvious reasons, we did not want the coders to be 
aware of the respondent's actual decision. By examining partici- 
pants' launch decisions later, we were able to ascertain whether the 
arguments were supporting or not. 

8All reported results are based on the average of the two coders’ 
values. Analysis based on each coder separately yields the same 
results. 
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3. I am largely responsible for the decision to launch the 
product. 


4. The outcome of this project will have a major impact on 
my career. 


5. If the project fails, I will be given substantial blame. 
6. People will attribute the decision to me. 


7. I feel responsible for the chosen course of action. 
8. I feel attached to the decision to launch the product. 
9. My input into the decision process was public. 


10 If the project fails, top management will believe that I gave 
poor advice. 
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The Evolution of Loyalty Intentions 


The drivers of customer loyalty intentions are dynamic. What remains unclear is how these intentions evolve 
through the introduction and growth phases of a life cycle. Using a longitudinal study of cellular phone customers, 
the authors demonstrate that loyalty intentions are a function of perceived value early in the life cycle. Over time, 
more affective attitudes toward the brand and the relationship with the company come to mediate the effects of 
value on intentions. The results suggest that from the introduction to the growth stage of a life cycle, managers 
must adapt from improving value per se to measuring and managing relationships and brands directly. 


roviding customers with perceived value or customer 
satisfaction is widely recognized as a means of 
improving loyalty intentions (Fornell et al. 1996; 
Zeithaml, Berry, and Parasuraman 1996) and actual reten- 
tion (Bolton 1998; Bolton and Lemon 1999; Mittal and 
Kamakura 2001). However, research demonstrates that 
these relationships are potentially complex and dynamic 
and that the drivers of intentions change and evolve over 
time (Mittal, Kumar, and Tsiros 1999; Slotegraaf and Inman 
2004). Such research focuses on relatively mature product 
or service categories, such as automobiles and credit cards. 
What remains unclear is how the drivers of intentions 
evolve for a new-to-market offering. 

A better understanding of these dynamics is essential 
for researchers and managers alike. Jt enables researchers to 
focus on the most important predictors of performance 
while helping managers design marketing programs that 
effectively adapt to customers through the diffusion 
process. This article builds on dynamic marketing models in 
two important ways: First, we explore the drivers of inten- 
tions for a panel of loyal customers who purchased and 
used a new-to-market offering— specifically, a new- 
generation cellular phone technology—through the intro- 
duction and growth phases of a life cycle. Second, we incor- 
porate two particular attitudes that have not been studied 
previously in this line of research: the affective commitment 
toward maintaining a relationship with the company 
(Fullerton 2003; Garbarino and Johnson 1999; Verhoef 
2003) and brand equity (Keller 1993; Rust, Zeithaml, and 
Lemon 2000). 

We first describe research on the dynamics of loyalty 
intentions and then develop our conceptual model and 
hypotheses. Using attitude theory, we argue that perceived 
value, as an overall evaluation of performance given price 
or prices paid, has a more direct influence on customers’ 
intentions early in a life cycle when customers are gathering 
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information, forming opinions about brands and relation- 
ships, and trying to reduce risk (Heilman, Bowman, and 
Wright 2000). Over time, attitudes toward the brand and 
relationship should come to mediate the effect of value on 
intentions (Oliver 1999). We test our model on a panel of 
cellular phone customers who were loyal to a particular 
brand over a four-year period that coincided with the intro- 
duction and growth phases of the life cycle. The results 
illustrate how loyalty intentions evolve from being purely 
value based early in the diffusion process to being brand 
and relationship based as the market evolves. The study has 
important implications for managers who are charged with 
allocating resources to improve product benefits, brand per- 
ceptions, and customer relationships over time. 


The Evolution of Loyalty Intentions 


The vast majority of work relating perceived performance 
to intentions has been cross-sectional. However, recent 
research has suggested that intention drivers vary consider- 
ably over time. Mittal, Kumar, and Tsiros (1999) compare 
the drivers of perceived performance and subsequent loy- 
alty intentions for new vehicle owners 3 to 4 months after 
purchase and again 21 months later. They find that service 
satisfaction has a greater impact on intentions early in the 
relationship, whereas product satisfaction has a greater 
impact later in the relationship. In a subsequent study, Mit- 
tal, Katrichis, and Kumar (2001) examine the drivers of 
customer satisfaction for people who owned a credit card 
for less than a year versus more than a year. They find that 
the weight customers place on different performance attrib- 
utes shifts on the basis of their tenure with a firm. Again, in 
an automotive context, Slotegraaf and Inman (2004) study 
changes in the effects of resolvable attributes (attributes that 
can be fixed or repaired under warranty, such as a faulty 
part) versus irresolvable attributes (attributes that cannot be 
fixed or changed regardless of the warranty, such as size or 
gas mileage). They find that resolvable attributes have 
greater influence on satisfaction over time, whereas irre- 
solvable attributes have less influence. 

These studies reinforce the need for a dynamic perspec- 
tive. Yet the settings for these studies involve relatively 
mature product categories (e.g., automobiles, credit cards). 
Logically, loyalty intentions should be particularly dynamic 
for a new-to-market offering. Theory suggests a specific 
dynamic for such offerings, for which more affective-based 
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attitudes come to mediate the effects of performance per- 
ceptions on intentions over time. Following traditional atti- 
tude models (Bagozzi and Warshaw 1990), we expect that 
the degree of mediation from perceived value, to brand and 
relationship attitudes, to intentions varies systematically as 
the market evolves. As the customers’ experience with a 
particular offering grows over time, attitudes toward brands 
and relationships should become stronger, more “top of 
mind” or accessible, persistent, resistant to change, and 
likely to guide intentions and subsequent behavior (Fazio, 
Powell, and Williams 1989; Priester et al. 2004). 

We use customers’ overall perceived value to capture 
perceptions of performance. As we noted previously, two 
important attitude-type constructs have emerged in the 
literature as possible mediators of the value—intentions rela- 
tionship. The first is affective commitment (Morgan and 
Hunt 1994; Verhoef 2003), or the factors that create sticki- 
ness in a relationship. This commitment is affected directly 
by the degree of personal interaction between a customer 
and a company and how the company manages the cus- 
tomer's account over time (Bendapudi and Leone 2002). 
The second is brand equity, which is influenced by factors 
such as repeated performance and satisfaction (Keller 1993; 
Selnes 1993), word of mouth or the “buzz” about the brand 
(Rosen 2000), the degree of identification with the brand, 
and its relevance to a customer's situation (Aaker 2004; 
Keller 2003). 


Perceived Value 


Perceived value is a customer's overall evaluation of what 
he or she receives compared with what he or she gives up or 
pays (Bolton and Drew 1991). Similar constructs in the 
marketing literature include payment equity (Bolton and 
Lemon 1999), distributive equity or fairness (Olsen and 
Johnson 2003), and reciprocity (Bagozzi 1975). Following 
the work of Bolton and Lemon (1999), we conceptualize 
perceived value as a broad construct that encompasses per- 
ceptions of quality given price and inputs versus outputs 
relative to the competition. We do not include overall cus- 
tomer satisfaction in our study. Empirically, however, per- 
ceived value and customer satisfaction are closely related 
constructs in the literature (Bolton and Lemon 1999; For- 
nell et al. 1996). Moreover, when customers are in an 
information-gathering or problem-solving mode, as should 
be the case in the early stages of a life cycle, value or equity 
perceptions are more relevant (Olsen and Johnson 2003). 


Affective Commitment 


Research on relationship commitment distinguishes 
between affective commitment and calculative'commitment 
(Bendapudi and Berry 1997; Hansen, Sandvik, and Selnes 
2003; Johnson et al. 2001). Our focus is on affective com- 
mitment, which is a “hotter” or more emotional factor 
related to the degree to which a customer identifies and is 
personally involved with a company and the resulting 
degree of trust and commitment (Bendapudi and Berry 
1997; Garbarino and Johnson 1999; Morgan and Hunt 
1994). Similar constructs used to describe exchange rela- 
tionships include friendship, rapport, and trust (Fullerton 
2003). Initially developed to explain employee commitment 


to organizations or work groups, the concept applies to con- 
sumption relationships as well (Gruen, Summers, and Acito 
2000). In a recent study of financial services, Verhoef 
(2003) demonstrates direct effects of affective commitment 
on actual bebavior, both relationship maintenance (reten- 
tion) and relationship development (share of a customer's 
business). Although both perceived value or payment equity 
and customer satisfaction were positive antecedents of 
affective commitment, they did not directly affect behavior. 

Calculative commitment, also called continuance com- 
mitment (Fullerton 2003), captures the "colder" or more 
rational, economic-based dependence on product benefits 
that is due to a lack of choice or switching costs that make 
it difficult to change suppliers (Anderson and Weitz 1992; 
Dwyer, Schurr, and Oh 1987). The company survey we use 
in this study contains only measures of affective commit- 
ment. However, we expect that some significant calculative 
commitment or switching costs exist and influence behav- 
ior, including the cost of the cellular phone itself and the 
inability of customers to transfer their numbers from phone 
to phone (Gustafsson, Johnson, and Roos 2005). 


Brand Equity 


Brand equity is defined as "the differential effect of brand 
knowledge on consumer response to the marketing of a 
brand" (Keller 1993, p. 8). Central to this definition is the 
concept of differential effect. When loyalty intentions are 
explained, this is the effect that brands have beyond the rest 
of the value proposition. In our conceptual model, which we 
describe subsequently, the two major non-brand-related 
effects are overall perceived value (products and services 
received for price or prices paid) and affective commitment, 
or the strength of the relationship with a company. Research 
shows that brand equity has a direct effect on intentions and 
mediates the effects of quality and satisfaction on intentions 
to various degrees (Selnes 1993). 

Particularly relevant to brand equity is the degree of per- 
sonal identification with a brand and its relevance to a cus- 
tomer's situation (Aaker 2004; Keller 2003). It is this iden- 
tification or personal “fit” with a brand that helps create the 
differential effect of brand equity on intentions beyond the 
effects of performance or relationships per se. Keller (1993) 
describes two approaches to measuring brand equity: indi- 
rectly through brand knowledge (brand awareness and 
image) and directly through the impact of brand knowledge 
on customer responses. Our study uses elements of both 
approaches. We measure brand equity as a latent variable 
using multiple measures, such as the degree to which the 
brand fits a customer's personality and lifestyle. We then 
determine whether the construct explains loyalty intentions 
beyond the influences of perceived value and affective 
commitment. 


Conceptual Model 


Our conceptual model in Figure 1 posits that perceived 
value builds loyalty intentions both directly and indirectly 
through the creation of brand equity and affective commit- 
ment. The prediction that brand equity has greater influence 
on intentions over time is consistent with other research on 
the evolution of brand preferences. Heilman, Bowman, and 
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FIGURE 1 
The Evolution of Loyalty Intentions: Conceptual Model 
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Wright (2000) examine how brand preferences and 
responses to marketing activities evolve for customers who 
are new to established product categories (disposable dia- 
pers and towels). They find little evidence of brand loyalty 
early in customers' experiences, but as customers gather 
more information and as perceived risk subsides, they buy 
the brands they prefer. 

Our conceptual model is consistent with Rust, Zeithaml, 
and Lemon's (2000) customer equity model, which includes 
three categorically different ways to increase customer 
equity: value equity, brand equity, and retention equity. 
Value equity is a customer's relatively cognitive perception 
of value, brand equity includes subjective appraisals of a 
brand that are not captured by objective performance, and 
retention equity is that which is gained from relationship- 
building and retention programs. For example, using data 
from the airline industry, Rust, Lemon, and Zeithaml (2004) 
show the projected returns to an organization from invest- 
ments in various customer equity drivers (e.g., quality, 
advertising, loyalty programs). Our model is also consistent 
with the logic that underlies Berry and Parasuraman's 
(1991) framework for customer retention strategies. 
Accordingly, early in a relationship, customers are tied to a 
firm primarily through product or service benefits and 
financial incentives (perceived value). Stronger relation- 
ships grow through personal relationships, social bonds, 
and customer intimacy. 


Temporal Effects and Consumption Systems 


Prior studies have demonstrated that the drivers of inten- 
tions are dynamic; that is, they vary over time. What these 
studies do not show is how these drivers evolve as cus- 
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tomers repeatedly use and experience an offering through 
the introduction and growth stages of a life cycle. An 
important factor to consider when modeling this evolution 
is the existence of temporal or carryover effects from one 
period to another. Early on, Oliver (1980) demonstrated the 
time dependence of attitudes and behavioral intentions in a 
satisfaction model. He showed how attitudes in one period 
are a function of both customer satisfaction in that period 
and attitudes in the previous period. In turn, behavioral 
intentions in a given period are a function of attitudes in 
that period and prior period intentions. Similarly, Bolton 
and Drew (1991) find temporal effects for both attitudes 
and overall quality perceptions. 

The time dependence of evaluations, attitudes, and 
intentions is a central feature of Mittal, Kumar, and Tsiros's 
(1999) consumption-system approach. They define (p. 89) a 
consumption system as “a bundle of goods and services that 
are consumed over time in multiple consumption episodes." 
Temporal effects in a consumption system are similar to the 
effects in Oliver's (1980) and Bolton and Drew's (1991) 
studies, in which perceptions, attitudes, and intentions in 
one period serve as anchors for the same constructs in sub- 
sequent periods. Evaluations of perceived value, brand 
equity, affective commitment, and loyalty intentions are not 
constructed anew each period. Rather, they are updated ver- 
sions of prior evaluations (Helson 1948; Oliver 1980). 

Another feature of the consumption-system approach is 
crossover effects, or the degree to which perceived perfor- 
mance in one part of the system (as with the product) 
affects behavioral intentions in another part of the system 
(as with the service) over time. Because our data come from 
the cellular phone provider and focus on loyalty intentions 


to the phone, we focus on capturing temporal effects rather 
than crossover effects in the consumption-systém approach. 
Our conceptual model incorporates the temporal effects of 
evaluations of value, brand equity, affective commitment, 
and loyalty intentions in one period on the same evaluations 
in a subsequent period. Because the model controls for tem- 
poral effects in the system, we gain a better understanding 
of the influence of perceived value, brands, and relation- 
ships on intentions. 


Research Hypotheses 


To summarize, we predict that perceived value has a more 
direct effect on intentions early in a life cycle. As the life 
cycle evolves, more affective attitudes, captured by the 
brand equity and affective commitment constructs in our 
model, should have greater influence. Moreover, the brand 
and relationship constructs should come to .mediate the 
effects of perceived value on intentions. By studying an 
evolving market, we are in a unique position to gain an 
understanding of how loyalty drivers grow and change as 
competition grows. 

We state these predictions formally as hypotheses. Our 
first hypothesis is related to the direct effect of perceived 
value on intentions over time. We expect that perceived per- 
formance has a direct effect on intentions that decreases 
over time as other factors (i.e., brands and relationships) 
come to influence intentions. 


H: The perceived value of an offering has a direct, positive 
effect on loyalty intentions that decreases as a market 


grows. 


As part of the mediation argument, we further predict that 
an offering's perceived value should build brand equity and 
affective commitment. 
H5: The perceived value of an offering has a direct, positive 
effect on brand equity that increases as a market grows. 
Ha: The perceived value of an offering has a direct, positive 
effect on affective commitment that increases as a market 
grows. 


As we argue in Hj, the direct effect of perceived value 
on loyalty intentions should decrease over time as the brand 
and relationship constructs grow and come to influence 
intentions. H4 and H; capture the increase in the effects of 
brand equity and relationship commitment on loyalty inten- 
tions that are consistent with the mediation argument: 

H4: Brand equity has a direct, positive effect on loyalty inten- 

tions that increases as a market grows. 


Hs: Affective commitment has a direct, positive effect on loy- 
alty intentions that increases as a market grows. 


Although not stated as formal hypotheses, consistent 
with Mittal, Kumar, and Tsiros's (1999) findings, we expect 
significant temporal effects in the consumption system. 
That is, perceived value, brand equity, affective commit- 
ment, and loyalty intentions in one period should affect the 
same construct in a subsequent period. All our predictions 
are time dependent. As a market continues to mature, the 
influence of perceived value in relation to brands and rela- 
tionships should stabilize. As we describe in the next sec- 
tion, because we estimate the model over three periods, 


there are a relatively large number of measures, constructs, 
and relationships involved. Given the possible number of 
alternative specifications, we do not posit and test any alter- 
native models per se. Rather, we implicitly examine alterna- 
tive specifications through our analysis of the size and sig- 
nificance of the path coefficients and how they change over 
time. We also conduct formal tests of mediation among our 
constructs (Baron and Kenny 1986). 


Empirical Study of the Cellular 
Phone Market 


We test the hypotheses using data from a commercial track- 
ing study conducted for a global supplier of mobile commu- 
nications in the German market. The study was fielded at 
three points in time that were approximately two years 
apart: September 1996, August 1998, and October 2000. 
Figure 2 shows the penetration rate over time for the cellu- 
lar phone market that was surveyed. The 1996 and 1998 
surveys coincide with the introductory and early growth 
phases of the market. By October 2000, the market had wit- 
nessed dramatic growth and an increase in competition. 
First-generation cellular phones were simply devices for 
making calls on a single network. The second-generation 
cellular phones, which we study here, represent a new-to- 
market offering in several respects, including the use of 
multiband technology (users are able to lock into multiple 
networks), short message system capability, a Web inter- 
face, a significantly lighter weight phone, longer battery 
life, and various service features. Several fundamental 
changes in the German cellular phone market stimulated the 
development and diffusion of this new offering. Before 
1989, Deutsche Telekom (DT) monopolized the market, 
which limited market penetration. From 1989 through 1998, 
there was a stepwise withdrawal of government influence 
on DT and a reduction in entry barriers for competitors. On 
August 1, 1996, the market was officially opened for any 
network provider. By 1998, four major network providers 


FIGURE 2 
Penetration Rate by Year for Cellular Phones in 
Germany 
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(DT, Mannesmann, E-Plus, and Viag Intercom) were collab- 
orating with the cellular phone hardware providers (Nokia, 
Samsung, Motorola, Siemens, and Sony-Ericsson) to pro- 
vide product/service offerings. Although the network and 
hardware providers were interchangeable, our data focus on 
the hardware provider. 

The original 1996 sample included 5486 respondents. 
The sample size was reduced because of panel mortality to 
4234 respondents in 1998 and to 3148 respondents in 2000. 
Another 248 respondents were removed from the sample 
because of high levels of missing data and outliers. This left 
a sample of 2990 respondents with usable responses in each 
period; each respondent owned a cellular phone and was 
interviewed in person. Of these customers, 1472 (49%) 
were loyal to the same brand of phone in all three periods. It 
is important for our analysis to focus on these brand-loyal 
customers for two reasons: First, theoretically, our goal is to 
capture the evolution of loyalty intentions for just such a 
group. Second, empirically, the carryover effects in the con- 
sumption system are more meaningful when the same brand 
is involved. The market shares of the different brands 
owned by the sample in each wave are similar to the shares 
of each brand in the marketplace. Of the 1472 loyal cus- 
tomers, 63% were male (37% were female), 60% had at 
least one child at home, and 31% had some university edu- 
cation. Of 2990 loyal and nonloyal customers, 68% were 
male, 48% had a child at home, and 30% had some univer- 
sity education. This indicates that the loyal customer sam- 
ple includes somewhat more females and at least one child 
at home. Income and age information is available only by 
ranges in the survey, but in each case, the distributions were 
almost identical. 

By definition, behavioral loyalty or retention is invariant 
across our test sample, making intentions the ultimate 
dependent variable in our model. However, these intentions 
may not be related to actual behavior (Verhoef 2003). A 
simple analysis examines whether our intentions are related 
to behavior. Beginning with our entire sample of 2990 
usable responses in Year 1 (1996), we extracted the first 
principal component from the loyalty intention measures. 
An analysis of variance reveals that the intentions of the 
loyal customers in our test sample were indeed higher in the 
first wave of the survey than were the intentions of those 
who subsequently switched (principal component values of 
.043 versus —.041; F = 5.285, p < .05). 

There are both advantages and disadvantages associated 
with using the same sample over time. The important theo- 
retical advantage is that time translates directly into accu- 
mulated customer experience, resulting in risk reduction for 
the brand-loyal customers in our test sample. The disadvan- 
tage is that we focus on what may be a very particular cus- 
tomer population, namely, technology upgraders or early 
adopters who readily embrace new technologies (Parasura- 
man 2000). Differences observed over time may be due to 
the loyal sample simply being different from other cus- 
tomers. Arguably, the advantages of using the sample out- 
weigh the disadvantages. First, our theoretical arguments 
rest on the assumption that customers remain loyal to a par- 
ticular cellular phone. Second, there is no readily available 
argument to suggest why the intention drivers would evolve 
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as predicted simply because this particular sample is 
involved. However, given the potentially unique nature of 
the sample, it is important that we focus on relative changes 
in intention drivers over time. 


Survey Measures 


The survey measures we used to estimate the conceptual 
model appear in Table 1. Respondents rated all measures on 
scales ranging from 1 ("completely disagree") to 7 (“com- 
pletely agree"). The perceived value measures include the 
performance received for the price paid, whether the offer- 
ing was a good deal compared with others in the market- 
place, whether the price was fair, and whether the offering 
was a great value overall. We measured brand equity, affec- 
tive commitment, and loyalty intentions using five survey 
measures each. The brand equity measures include whether 
the brand reflects customers' personal lifestyles, whether 
the brand fits their personality, and brand identification. The 
affective commitment measures include customers’ com- 
mitment to maintaining a relationship with the manufac- 
turer and personal interactions with the manufacturer (pro- 
vision of feedback, interactive events, and other dialogue 
with the manufacturer). Finally, the loyalty intentions 
include repurchase intentions and recommending the cellu- 
lar phone to others. 


Model Specification, Estimation, and Evaluation 


We specify all the constructs in the model using reflective 
indicators from Table 1 (Fornell and Cha 1994). The esti- 
mation method should be consistent with the complexity of 
the model, which in this case involves 12 latent constructs 
with carryover effects. Bagozzi and Yi (1994) and Fornell 
and Cha (1994) suggest that partial least squares (PLS) is 
well suited to the estimation of a complex structural equa- 
tion model. It integrates aspects of principal components 
analysis with multiple regression (Wold 1982). When 
reflective measures are involved, the procedure extracts the 
first principal component from each subset of measures for 
the various latent variables and uses these principal compo- 
nents within a system of regression models. The algorithm 
then adjusts the principal component weights to maximize 
the predictive power of the model. 

Typically, PLS models are evaluated on four key crite- 
ria: (1) the reliability of the constructs, (2) the discriminant 
validity of the constructs, (3) the size and significance of 
the path coefficients, and (4) the ability of the model to pre- 
dict, in this case loyalty intentions (Hulland 1999). Regard- 
ing construct reliability, the measurement loadings should 
exceed .707 to ensure that at least half of the variance in the 
observed variable is shared with the construct. This reliabil- 
ity criterion is referred to as communality or, in the case of 
the standardized results we report here, average variance 
extracted (AVE; see Fornell and Larcker 1981). Table 1 
reports the AVE from the PLS results for each construct for 
each of the three periods studied. For all 12 constructs in 
the model, AVE exceeds the reliability criterion. To evaluate 
the discriminant validity of the constructs, we examined 
whether the communalities or AVE measures for any two 
constructs that are related in the conceptual model exceed 
their squared correlations (Fornell and Larcker 1981). This 


TABLE 1 
Model Constructs, Survey Measures, and AVE by Time Period 





Construct Survey Measures AVE 

Perceived value 1. The celi phone is a good level of performance for the money I pay. t = 686 
2. The cell phone Is a good deal relative to other offers available in the market. t - 741 
3. The price of my cell phone is more than fair for the performance | receive. ^ — 649 
4. The cell phone is a great value. eus 

Brand equity 1. The brand reflects my personal lifestyle. 
2. The brand and my personality flt. t4 = .688 
3. | can Identify with the brand. to = .655 
4. If the brand were a person, | would like to take him or her out for dinner. t3 = .594 
5. | would like to wear clothing with the logo of my cell phone brand on it. 

Affective commitment 1.1 want to continue my relationship with the cell phone manufacturer. 
2. The cell phone manufacturer is interested in how | use my cell phone. t; = .634 
3. If the cell phone manufacturer were a person, | would like to have him or her as a friend. t» = .670 
4. | give feedback about my evaluations of the cell phone regularly. ts = .607 
5. Occasionally the cell phone dealer arranges events to show new products. 

Loyalty intentions 1. Next time | will definitely buy thls cell phone (or its successor) again. 
2. If | lose my cell phone | will definitely buy it again. t4 = .650 
3. If | got any cell phone for free, | would choose my cell phone. to = .640 
4. | recommend my cell phone to other people. t3 = .548 
5. | talk to other people about my cell phone. 


condition is satisfied in every case (not shown); thus, the 
discriminant validity of the constructs is also supported. 


Results 


The path coefficients for the overall model appear in Figure 
3. To test the statistical significance of the paths and prepare 
for subsequent formal tests of mediation, we used the mea- 
surement weights from the PLS model to operationalize the 
latent variables and rerun the final regression models. These 
models completely replicate the PLS results. We use the 
standard errors from these regressions to test for signifi- 
cance and significant differences in the paths over time. The 
only nonsignificant path in the overall model is the effect of 
affective commitment in t, on loyalty intentions in t. Given 
the nature of the hypotheses, the model results are best 
understood by first examining the significance of paths 
involving constructs in t, and then determining whether the 
effects in the model change significantly from period to 
period. These results appear in Table 2. ' 

The results for t, (early in the diffusion of the cellular 
phones) show how perceived value has the only significant, 
positive effect on loyalty intentions. Althoügh this per- 
ceived value makes some positive contribution to both 
affective commitment and brand equity, neither of these 
constructs makes a positive contribution to loyalty inten- 
tions. The unexpected results in this first period are that 
both affective commitment and brand equity have small 
negative effects on intentions, such that the negative effect 
of brand equity is significant. Consistent with Mittal, 
Kumar, and Tsiros's (1999) consumption-systems approach, 
all four carryover effects from t, to t? are significant (see 
Figure 3 and Table 2). The largest carryover effect is for 


perceived value (.263), followed by brand equity (.236), 
affective commitment (.173), and intentions (.167). 

Moving from t, to ty, the effect of perceived value on 
both affective commitment and brand equity increases. 
Important to the mediation argument, there is a significant 
decrease in the direct effect of value on intentions (from 
.356 to .302) combined with a significant increase in the 
effect of affective commitment on intentions (from —.024 to 
.188) and brand equity on intentions (from —.062 to .057; 
see Table 2). Consistent with the mediation argument, by t», 
the more affective attitudes in the model begin to mediate 
the effects of perceived value on intentions. Again, the 
carryover effects from t» to t4 are positive and significant, 
but note the changes that occur. The carryover effects for 
affective commitment (.087) and brand equity (.061) from 
t; to t4 are both significantly lower than the same effects 
from t, to ty. This suggests that both the commitment and 
the image constructs are evolving as more competitors enter 
the market. 

By t4, perceived value continues to build relationship 
commitment and brand image, though the impact on brands 
is lower from t; to t4. Importantly, the direct effect of per- 
ceived value on intentions continues to drop significantly 
(from .302 to .115), whereas the effects of both affective 
commitment and brand equity increase (for commitment, 
from .188 to .225; for brand, from .057 to .197). By ts, the 
effects of affective commitment and brand equity on inten- 
tions are each greater than the direct effect of perceived 
value on intentions. Again, this is consistent with the pre- 
diction that more affective attitudes come to mediate the 
effects of perceived value on intentions as the market 
evolves. The R2 for loyalty intentions is .124 for tj, .201 for 
tj, and .171 for t4. There are several reasons that the vari- 
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TABLE 2 
Changes In Path Coefficients 


— ——— st SSS 


Change Significant Change Significant 

Significant from Change? from Change? 
Path Coefficient t In t, t, to t (t, to to) t; to t4 (t; to t3) 
Value — commitment .059 Yes 143 Yes .038 No 
Value — brand .150 Yes .058 Yes —,052 Yes 
Value intentions .356 Yes —,054 Yes —,187 Yes 
Commitment intentions —.024 No 212 Yes .037 Yes 
Brand intentions —,062 Yes .119 Yes .140 Yes 
Current value — future value .263 Yes —.023 No — — 
Current commitment — future commitment 173 Yes —.086 Yes — — 
Current brand — future brand .236 Yes —.175 Yes — — 
Current intentions > future Intentions .167 Yes —.031 No — — 


Notes: Significance is based on p « .05. 


ance explained in loyalty intention is relatively low. This is 
a volatile period in the market's evolution, and we examine 
only the hardware side of the product/service bundle. We 
also expect that the variance explained increases as the mar- 
ket matures and buyers become loyal to particular phones 
and network providers. 


Additional Analyses and Tests 


We conducted three additional tests on the latent variable 
scores. First, we examined whether there were any system- 
atic changes in the levels of the perceived value, affective 
commitment, brand equity, or intention constructs using a 
series of analysis of variance models and time as a three- 
Jevel factor. Considering the large sample size involved, the 
differences were minimal. There were no significant differ- 
ences from year to year for perceived value, affective com- 
mitment, or loyalty intentions (cumulative averages of 
4.272, 4.231, and 4.257, respectively). The only difference 
from year to year was for brand equity, which increased sig- 
nificantly from t, (4.081) and t» (4.125) to t4 (4.252; F = 
6.908, p « .001). This is consistent with the increased 
impact of brand equity in the PLS results. Otherwise, the 
means are stable from period to period. 

Second, we examined whether the relationships involv- 
ing intentions were nonlinear, which could affect the inter- 
pretation of our model results. As a check, we used the 
latent variable scores to estimate linear and possible qua- 
dratic and cubic relationships between each of the three dri- 
vers of loyalty (perceived value, affective commitment, and 
brand equity) and loyalty intentions. In all three cases, only 
the linear relationship is significant; all the relationships in 
the model are essentially linear. An explanation for this 
finding is that our study focuses on the early stages of loy- 
alty formation. We suspect that nonlinearities are more 
likely to occur in more mature markets in which higher lev- 
els of trust and commitment have evolved (Mittal and 
Kamakura 2001). 

Third, we used Baron and Kenny's (1986) analysis to 
test for mediation among our latent constructs. With respect 
to our conceptual model, there are six separate sets of 


regression models used to test for mediation. For each year, 
we estimate the effect of perceived value on affective com- 
mitment (brand equity), the effect of perceived value on 
intentions, and the effect of affective commitment (brand 
equity) on intentions. Assuming that all relationships are 
significant, we then regress both perceived value and affec- 
tive commitment (brand equity) together on intentions. 
Mediation is supported if the effect of value on intentions 
drops significantly. Given the number of equations 
involved, we summarize the test results here. In t;, there is 
no support for mediation involving either affective commit- 
ment or brand equity. In t», the tests support partial media- 
tion for the relationship from perceived value, to affective 
commitment, to intentions but not for the relationship from 
value, to brand equity, to intentions. By t5, the tests support 
partial mediation for both sets of relationships. In general, 
these results are consistent with the PLS analysis. 


Results Summary 


We summarize our findings in terms of our hypotheses. In 
support of H,, we find that perceived value has a positive 
direct effect on loyalty intentions that decreases over time. 
In support of H5, value has positive effects on affective 
commitment that grow over time. The support for H3 is 
mixed. Although the positive effect of value on brand equity 
grows from t, to t», it declines again in t4. In each year, 
however, the effect is significantly positive. Affective com- 
mitment has no significant effect on intentions in t;, but the 
effect becomes progressively positive over time. Likewise, 
brand equity has a small negative effect on intention in t}, 
which becomes progressively positive over time. Thus, H4 
and Hs are partially supported. When we combine the 
analysis of path coefficients with the formal tests of media- 
tion, there is robust support for the prediction that intentions 
are predominantly value driven early in a life cycle and are 
increasingly mediated by attitudes toward the brand and 
relationship over time. Finally, consistent with Mittal, 
Kumar, and Tsiros's (1999) consumption-system frame- 
Work, our results support significant carryover effects from 
year to year for all our latent variables. 
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Discussion and Implications 


Recent dynamic models in marketing confirm that the dri- 
vers of loyalty intentions evolve over time. Yet these studies 
are limited to relatively mature product and service cate- 
gories. Our study provides two important contributions to 
the understanding of the evolution of customer evaluations, 
attitudes, and intentions. First, we demonstrate how the dri- 
vers of intentions evolve for a panel of loyal customers who 
purchased and used a new-to-market offering, cellular 
phone technology, through the introduction and growth 
phases of a life cycle. Second, we demonstrate how two 
constructs that have not been included in previous studies, 
affective commitment and brand equity, come to mediate 
the effects of perceived value on intentions over time. 

We find that cognitive perceptions of overall value drive 
loyalty intentions early in the diffusion process. As the mar- 
ket grows and customer experience accumulates, more 
affective attitudes toward the relationship and brand come 
to drive intentions. Our predictions and findings are consis- 
tent with attitude theory; performance beliefs have more 
direct effects on behavioral intentions when experience is 
low, whereas attitudes come to mediate the effects of per- 
formance beliefs on intentions as experience increases. An 
important theoretical implication is that affective commit- 
ment and brand equity are effective ways to operationalize 
the affect-based mediators of perceived value on intentions. 

Our findings both replicate and extend our understand- 
ing of consumption systems (Mittal, Kumar and Tsiros 
1999). Consistent with previous studies, we find that perfor- 
mance perceptions (perceived value) and intentions in one 
period serve as anchors for the same perceptions and inten- 
tions in subsequent periods. Building on previous studies, 
we introduce two new constructs to the consumption sys- 
tem, affective commitment and brand equity. One important 
finding is that both the commitment and the brand equity 
constructs exhibit significant carryover effects. Another 
finding is that, when we control for carryover effects, affec- 
tive commitment and brand equity come to mediate the 
effects of perceived performance on intentions over time. 

The results have important implications for managers 
who are responsible for improving perceived value and 
managers who are responsible for building brands and rela- 
tionships. Early in the introduction and growth of an offer- 
ing, managers should maintain a focus on improving value. 
With a focus on value creation, a foundation is laid for 
building brands and relationships. As a market grows, both 
brands and relationships become more important. Notably, 
brands and relationships had small negative impacts on loy- 
alty intentions in the first phase of our study, perhaps 
because our sample comprised technology upgraders or 
early adopters. As the market grew to include a variety of 
competitors, the influence of brands and relationships on 
intentions became positive and eventually exceeded the 
direct, positive influence of perceived value. 

With respect to relationship commitment, the implica- 
tion is that the personal relationship between a customer 
and a company should be measured, nurtured, and managed 
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effectively (Bendapudi and Leone 2002) through a com- 
pany’s customer relationship management system. For 
brand equity, the implication is that the degree to which 
customers identify with a particular brand and find it rele- 
vant to their situation should also be measured and managed 
effectively. Our results provide insight into how much time 
it takes for brands and relationships to influence intentions. 
Within a period of two years, affective commitment gains a 
sizable influence on intentions. Within a period of four 
years, both commitment and brand equity have direct 
effects on intentions that exceed the direct effect of per- 
ceived value. Note, however, that perceived value continues 
to influence loyalty through the mediating effects of the 
brand and relationship. 

In other words, managers must recognize how life 
inside a cellular phone or the technology per se is replaced 
by life outside the cellular phone over time. Early in the dif- 
fusion process, technology determines the product policy of 
cellular phone manufacturers. As the technology becomes 
accepted, other forms of differentiation, such as colors and 
design, become increasingly important. Parallel to this, 
whether these designs, colors, and associated brand names 
fit a customer’s own personality become critical. Adaptation 
to this evolution requires shifting from a focus on how a 
technology serves a need to a focus on other types of mar- 
keting expertise, specifically the building of brands and 
relationships. Several factors should affect this shift. The 
first factor is customer involvement; the more customers are 
involved in a particular product category, the more they 
seek brand and relationship benefits when the product ben- 
efits become standard. The second factor is competition; 
more competitive markets have more players that attempt to 
differentiate themselves. The third factor is core product 
Standardization; the more inherently homogeneous the 
product, the more companies must focus on brands and 
relationships as a source of differentiation. 

The most important contribution of this study is that it 
improves the basic understanding of how perceived value, 
brands, and relationships come to influence one another and 
loyalty intentions over the introduction and growth phases 
of a life cycle. A limitation of our study is that we focus on 
the cellular phone rather than the entire product/service 
bundle. An important reason for this is that the survey is 
from the perspective of the cellular phone manufacturer in 
an environment in which cellular phone brands and service 
providers are interchangeable. Another limitation is that our 
panel is limited to consumers who enter a market early and 
remain loyal to a particular brand. The results should also 
be replicated in other contexts before generalizations can 
emerge. It will be particularly important to follow the cate- 
gory forward as it reaches maturity to help understand how 
the drivers of both intentions and behavior stabilize. Finally, 
our measures of relationship commitment focus on affective 
commitment. Because the market for cellular phones 
includes significant switching barriers, it is important to 
include calculative commitment when further testing the 
model. 
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Converting Web Site Visitors into 
Buyers: How Web Site Investment 
Increases Consumer Trusting Beliefs 
and Online Purchase Intentions 


The authors investigate the impact of Web site design investments on consumers’ trusting beliefs and online pur- 
chase intentions. Such investments signal the component of trusting beliefs that is most strongly related to online 
purchase intentions: ability. These effects were strongest when consumers' goals were to search rather than to 


browse and when purchases involved risk. 


of faith. We type in our credit card numbers and 

other personal information in order to make pur- 
chases over the Internet and trust that this information will 
not be used in unauthorized or fraudulent ways" (Bargh and 
McKenna 2004, p. 585). Firms have responded to such con- 
sumer concerns by investing in Web site security, which has 
become a multibillion dollar industry (eMarketer 2005). Yet 
even experienced online buyers view purchasing online as 
risky (eMarketer.com 2005; Forrester 2005).: Indeed, con- 
sumers' perceptions of the risks involved in providing per- 
sonal information online often contrast the views of security 
experts, causing these consumers to avoid online activities 
that are actually safe (Dunn 2004). This avoidance has led 
some experts to speculate that the immediate threat to e- 
commerce is consumers' perceptions (eMarketer 2005; 
Hoffman, Novak, and Peralta 1999; Rust, Kannan, and Peng 
2002). Thus, although it is necessary, investing in back-end 
technology to protect consumers' information and ensure 
that e-commerce transactions run smoothly is not sufficient. 
Firms, particularly those attempting to convert visitors to 
buyers, still face the challenge of establishing consumers' 
trust online. Thus, it is important to understand how compa- 
nies can communicate their trustworthiness to consumers. 


6 É n important ways, using the Internet involves a leap 
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Marketing managers face the challenge of establishing 
consumers’ trust in a variety of contexts, but doing so in 
computer-mediated environments such as the Internet may 
be particularly difficult (Naquin and Paulson 2003). A com- 
mon approach is to post explicit statements that assure cus- 
tomers that personal data will be discreetly used and pro- 
tected (1.e., privacy and security statements). However, 
evidence on the effectiveness of such statements is contra- 
dictory. Whereas some research has shown that such state- 
ments help instill consumer confidence in e-commerce sites 
(Palmer, Bailey, and Faraj 2000), others suggest that they 
are not necessarily the most important predictor of online 
trust (Montoya-Weiss, Voss, and Grewal 2003; Sultan et al. 
2002). Findings from a recent large-scale study suggest that 
despite consumers' claims that privacy policies are impor- 
tant for establishing credibility, consumers refer instead to 
"surface" elements, such as Web site design (Fogg et al. 
2002). 

In this article, we develop a conceptual framework for 
understanding how marketing signals influence consumers’ 
trust in an e-commerce setting. We clarify important dis- 
tinctions related to trust that have been largely overlooked 
in the literature but are key to understanding how online 
firms can best convert visitors to buyers. Specifically, we 
argue that different Web site signals can influence different 
beliefs about a firm's trustworthiness, which in turn have 
differential effects on online purchase intentions. Further- 
more, these effects might vary according to the consumers' 
purpose for visiting the site and the level of risk they per- 
ceive in the purchase. Consistent with prior research (e.g., 
McKnight, Choudhury, and Kacmar 2002), we conceptual- 
ize online purchase intentions in terms of customer acquisi- 
tion—that is, consumers’ intentions to make an initial 
online purchase from a firm, despite their online purchase 
history with other firms. We begin with a review of the trust 
literature. We then present and test our conceptual frame- 
work and conclude with a discussion of the theoretical, 
managerial, and public policy implications of these 
findings. 


Journal of Marketing 
Vol. 70 (Apri! 2006), 133—148 


Trust 


Trust has been defined as “a willingness to rely on an 
exchange partner in whom one has confidence” (Moorman, 
Zaltman, and Deshpandé 1992, p. 315); “a generalized 
expectancy held by an individual that the word, promise, 
oral or written statement of another individual or group can 
be relied upon” (Rotter 1980, p. 1); and “a belief in a per- 
son’s competence to perform a specific task under specific 
circumstances” (Sitkin and Roth 1993, p. 373). Reflected in 
these and other definitions of trust is a cognitive aspect (i.e., 
trusting beliefs) and a behavioral aspect (i.e., trusting inten- 
tions) (Kim et al. 2004; Moorman, Zaltman, and Deshpandé 
1992). 

Trusting beliefs represent a “sentiment, or expectation 
about an exchange partner's trustworthiness" (Moorman, 
Deshpandé, and Zaltman 1993, p. 315). Although various 
trusting beliefs have been studied in the literature, the 
majority can be conceptually clustered into three dimen- 
sions: ability, benevolence, and integrity (McKnight, 
Choudhury, and Kacmar 2002). "Ability beliefs" reflect 
consumers' confidence that the firm has the skills necessary 
to perform the job (Mayer, Davis, and Schoorman 1995), 
“benevolence beliefs" reflect confidence that the firm has a 
positive orientation toward its consumers beyond an “ego- 
centric profit motive" (Mayer, Davis, and Schoorman 1995, 
p. 717), and "integrity beliefs" reflect confidence that the 
firm adheres to a set of moral principles or professional 
standards that guide its interactions with customers. These 
trusting beliefs are related, yet distinct. For example, con- 
sumers may believe that the firm cares about its customers 
and thus intends to deliver a smooth, error-free transaction 
(i.e., the firm is benevolent), but they may also believe that 
the firm lacks the ability to do so. Likewise, although 
integrity and benevolence beliefs are similar, the former 
focuses on meeting objective standards of corporate citizen- 
ship, and the latter focuses on customer welfare that goes 
beyond normal business activity. For example, despite con- 
sumers' beliefs that the firm follows a professional code of 
conduct (i.e., has integrity), they may still question the 
firm's genuine concern for its customers (i.e., its 
benevolence). 

. Although ability, benevolence, and integrity beliefs are 
acknowledged as conceptually distinct (e.g., Kumar, Scheer, 
and Steenkamp 1995), they are often combined into a 
global measure of trusting beliefs (e.g., Doney and Cannon 
1997). Whereas combining these beliefs into a single varia- 
ble is a parsimonious approach to studying trust, it can 
make it difficult to identify what action should be taken to 
build trust (Smith and Barclay 1997). Because a global 
measure likely obscures the reason certain signals are more 
effective than others in affecting online purchase intentions, 
we treat each trusting belief separately. 

Trusting intentions represent "a willingness to make 
oneself vulnerable to another in the presence of risk" (Kim 
et al. 2004, p. 105). What distinguishes trusting intentions 
from other types of behavioral intentions is that they 
involve risk (Moorman, Zaltman, and Deshpandé 1992). As 
reflected in the opening quotation, purchasing online 
involves risk, especially when a person lacks experience 
with the online firm. Specifically, the consumer must be 
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willing to transfer resources (e.g., credit card and other per- 
sonal information) to the online firm, the consequences of 
which could be damaging. For example, among the real 
and/or perceived risks are that the firm may overcharge, fail 
to deliver the product, deliver an inferior product, or fail to 
protect personal information. To the extent that consumers 
are concerned about these and other risks of purchasing 
online, online purchase intentions reflect trusting intentions. 

The distinction between trusting beliefs and trusting 
intentions has been acknowledged by some researchers 
(e.g., Moorman, Zaltman, and Deshpandé 1992; Sirdesh- 
mukh, Singh, and Sabo] 2002) but ignored by others who 
have studied only trusting beliefs, implicitly assuming that 
these beliefs imply trust (e.g., Doney and Cannon 1997; 
Ganeson 1994; Kumar, Scheer, and Steenkamp 1995; 
Mayer, Davis, and Schoorman 1995; Morgan and Hunt 
1994). For example, Morgan and Hunt (1994) argue that 
trusting beliefs are sufficient for measuring trust because 
such beliefs imply that trusting intentions will follow. In 
contrast, Moorman, Zaltman, and Deshpandé (1992) argue 
that trust is limited when trusting beliefs do not accompany 
a corresponding trusting intention or when trusting inten- 
tions occur without corresponding trusting beliefs (e.g., 
under conditions of coercion or limited alternatives). In 
other words, these researchers argue that both trusting 
beliefs and trusting intentions must be present for trust to 
exist. Likewise, we argue that trusting beliefs are a neces- 
sary but not sufficient condition for trust to exist, because 
increasing trusting beliefs will not always have a corre- 
sponding positive effect on trusting intentions. 


Conceptual Framework 


Drawing from research on trust (Mayer, Davis, and Schoor- 
man 1995; Moorman, Deshpandé, and Zaltman 1993), con- 
sumer goals (e.g., Hoffman and Novak 1996), and market- 
ing signals (e.g., Kirmani and Wright 1989; Prabhu and 
Stewart 2001), we develop a conceptual framework for 
understanding how different signals influence ability, 
benevolence, and integrity beliefs and thus influence online 
purchase intentions (see Figure 1). Because our objective is 
to understand how to increase consumers' willingness to 
buy online, we focus on those whose goal is most consistent 
with buying online, namely, consumers who search for 
product information (or searchers; Hoffman and Novak 
1996; Moe 2003; Schlosser 2003). Indeed, searchers think 
about and are persuaded more by product information 
(Schlosser 2003) and have higher visitor-to-buyer conver- 
sion rates than those who do not search (Moe 2003). We 
begin by examining the relationship between searchers' 
trusting beliefs and intentions. 


Searchers' Trusting Bellefs and Online Purchase 
Intentions 


Searching reflects purposive, task-specific behaviors, such 
as the planned acquisition of information during prepur- 
chase deliberation (Hoffman and Novak 1996; Janiszewski 
1998). Similar to those who read a text to find an answer to 
a question (Rosenblatt 1978), searchers are likely motivated 
to find the nght answer efficiently. Such a fact-gathering, 


i FIGURE 1 
Conceptual Framework of the Effect of Online Signals on Trusting Beliefs and Intentions 
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knowledge-seeking stance is typically outcome oriented, 
concentrated, impersonal, and objective (Rosenblatt 1978). 
Given this performance orientation, we expect that when 
considering whether to purchase online, searchers will 
focus on the trusting belief that is most relevant to perfor- 
mance: ability (Mayer, Davis, and Schoorman 1995). If this 
is the case, searchers' ability beliefs should largely influ- 
ence their online purchase intentions. In contrast, their 
beliefs about the firm's trustworthiness on non- 
performance-related dimensions (i.e., benevolence and 
integrity) should have relatively little effect on their online 
purchase intentions. 


H,: Searchers’ online purchase intentions depend on their 
beliefs about the firm's ability rather than their beliefs 
about the firm's benevolence or integrity. 


Signaling Ability Through Web Site Investment 

If trust in a firm’s abilities is critical to increasing online 
purchase intentions, a fundamental question is, How can 
firms use online cues to communicate that their abilities can 
be trusted? To address this question, we draw from research 
on marketing signals. Signals are the actions or announce- 
ments that convey a firm’s abilities and intentions (Porter 
1980). Marketers often use observable signals (e.g., price, 
warranties, advertising expenditures) to communicate the 
level of some unobservable quality (e.g., product quality; 
Kirmani and Rao 2000). Signaling may be especially 
important in an online purchasing context because of the 
inherent asymmetry of relevant information between buyers 
and sellers. Specifically, information about the quality of a 
given online transaction is generally unobservable by con- 
sumers before purchase. 


Investing in Web site design may be one observable sig- 
nal that firms can use to communicate their abilities and 
boost searchers’ online purchase intentions. Indeed, con- 
sumers can distinguish between expensive and inexpensive 
marketing tactics, such as ad production elements (Kirmani 
and Wright 1989). Moreover, they make inferences about 
companies (e.g., the company’s ability to make quality 
products and its credibility) based on these perceived mar- 
keting expenditures (for a review, see Kirmani and Rao 
2000). The attribution literature provides insight into these 
findings (Kirmani and Wright 1989). Specifically, when 
interpreting others’ performance, people infer that investing 
time and energy promotes success (Weiner 1986). Likewise, 
consumers will likely infer that a firm that has invested in 
Web site design can successfully handle online transactions. 

As in prior research, we consider investment in broad 
terms (Kirmani and Wright 1989); that is, investment 
reflects expenditures of time, money, and effort to Web site 
design. Importantly, it refers to investments in the front-end 
(design) elements of a Web site (i.e., its observable charac- 
teristics) and not back-end technologies, such as order ful- 
fillment software, security encryption, and firewall capabil- 
ities, which are typically unobservable before purchase. Yet 
because ability is a stable, internal characteristic (Weiner 
1972), people will likely generalize their trust in a firm’s 
ability in one area (design) to other related areas (e.g., order 
fulfillment). Thus, instead of being purely cosmetic, Web 
site design likely communicates important performance 
information. However, it likely communicates less about the 
firm’s goodwill, ethics, values, or intentions to mislead than 


it does about the firm’s ability.! Consequently, consumers 


lThis assumes conditions of normality. Because a poorly 
designed Web site (e.g., one with hyperlinks that do not work) 
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will likely use perceived Web site investment to infer a 
firm’s ability more than its benevolence or integrity. If this 
is the case and ability beliefs largely influence searchers’ 
online purchase intentions (Hi), it follows that 
Hz: Searchers’ online purchase intentions will be higher at a 
high-investment Web site than at a Jow-investment Web 
site. 
H4: Beliefs about the firm's ability will mediate the relation- 


ship between site 1nvestment and searchers' online pur- 
chase intentions. 


The first two studies test these hypotheses, and the last 
two studies examine two potential moderators (goal and 
perceived risk). Furthermore, to test the generalizability of 
these effects, we varied across studies the samples recruited 
(students versus nonstudents), companies (a fictitious ver- 
sus well-known firm), and products (home furnishings and 
accessories versus cameras). 


Study 1 


Method 


Sample and design. The sample consisted of 111 
respondents who participated in exchange for $10 and were 
recruited through an electronic and a printed newsletter dis- 
tributed to university employees. The sample was 6896 
female, with a mean age of 37.5 and a mean income of 
$35,000 to $49,999. Respondents had a median education 
of four years of college and used the Internet an average of 
four to six times per week. We randomly assigned respon- 
dents to a high- or low-investment site. 

Web site investment manipulation. We manipulated Web 
site investment through the presence of sophisticated Web 
site technology and visual design elements. Specifically, the 
high-investment site had a white background, sophisticated 
fonts (images for the navigation bar; Garamond font), and 
an enhanced zoom feature created with Design Within 
Reach. This enabled users to zoom in on any part of an 
image and to choose among three preselected zooms that 
executed automatically with a single click. In contrast, the 
low-investment site featured the default background color 
and font (gray; Times New Roman) and a limited zoom fea- 
ture, which, when clicked, simply provided a larger view of 
the focal product. The content and layout of both sites were 
identical. 

To test the effectiveness of this manipulation, 43 under- 
graduates viewed screen captures of the homepage (font 
and background color were used to manipulate investment) 
and a zoom page (technology was used to manipulate 
investment). The order was counterbalanced, and partici- 
pants viewed only those pages specific to the high- or low- 
investment site. After viewing each page, participants 
reported how much time, effort, and money they believed 
the firm invested in each page on a scale from 1 (“very lit- 
tle") to 7 (“a great deal"). We averaged the responses to 
these items (as > .94) and analyzed them with a 2 (invest- 
ment: high versus low) x 2 (page: zoom versus home) x 2 


likely violates consumers' expectations about norms of conduct 
for e-commerce firms, it will likely raise concerns about the firm's 
benevolence and integrity. 
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(order: viewed home or zoom page first) analysis of vari- 
ance (ANOVA). In support of the investment manipulation, 
participants perceived the high-investment site as requiring 
greater investment than the low-investment site (Ms — 4.09 
versus 3.18; F(1, 39) = 9.02, p « .01). Furthermore, the 
effectiveness of the investment manipulation was unaffected 
by the type or order of page viewed (Fs(1, 39) « 3.35, not 
significant [n.s.]). Thus, it appears that font, background 
color, and use of technology communicate investment. Par- 
ticipants also reported how informative, entertaining, and 
well organized they found the pages to be on the same 
seven-point scale. As we expected, investment did not affect 
these variables (Fs(1, 39) « 1.97, n.s.). 


Procedure. Participants sat at a computer and received a 
paper booklet. The first page instructed participants that 
they would be visiting a site for a new firm called Urban- 
Furniture (UF) and to limit their visit to the homepage and 
living-room sections of the site. All participants were told to 
imagine that they wanted to purchase contemporary furni- 
ture and were considering UF. As in prior research 
(Schlosser 2003), to instill a searching goal, participants 
were asked to write down two questions they had for UF 
about its products before visiting the site. 

Participants then visited either the high- or the low- 
investment site, which was preloaded on their computer. 
Afterward, they completed the survey, which contained 
three items that measured their online purchase intentions 
(œ = .91) and a modification of Mayer and Davis's (1999) 
scale of trustworthiness (for the measures used in this and 
the other studies, see the Appendix). This scale measured 
beliefs about UF's ability (a = .90), benevolence (a = .88), 
and integrity (a = .71). Then, to test the effectiveness of the 
investment manipulation in the main experiment, partici- 
pants completed the three-item Web site investment scale 
(see the Appendix; & = .95). 

At the end of the survey, participants reported their edu- 
cation and income levels as well as how often they used the 
Internet to purchase goods in the last six months on a scale 
ranging from 1 (“not at all") to 7 (“quite often"). We 
included these measures to control for individual differ- 
ences in Internet experience both directly (i.e., through self- 
reported use of the Web) and indirectly (1.e., using demo- 
graphic variables associated with Web use). To control for 
variance due to mechanical error, we asked participants to 
report the extent to which they encountered problems at the 
UF site (e.g., error messages, server delays, crashing) on a 
scale from 1 (“not at all") to 7 (“quite often"). 


Results 


Manipulation checks. We analyzed the investment 
manipulation with a one-way analysis of covariance 
(ANCOVA), controlling for reported problems with the site, 
prior Web purchase history, education, and income. In sup- 
port of the manipulation, perceived Web site investment was 
higher among those in the high-investment condition than 
among those in the low-investment condition (Ms = 3.46 
versus 1.97; F(1, 108) 2 32.45, p « .0001). 


Trusting beliefs and online purchase intentions. We 
used hierarchical regression to test H,. We first modeled 


, online purchase intentions as a function of ability beliefs. 


As we predicted, ability significantly influenced these 
intentions (B = .27, t(108) = 2.88, p < .005; R? = .07, 
F(1, 108) = 8.31, p < .01; see Table 1). Furthermore, the 
addition of benevolence and integrity beliefs to the model 
did not contribute significantly to explaining searchers’ 
online purchase intentions (AR? = .007, F(2, 106) < 1). We 
replicated this pattern of results with a stepwise regression 
analysis, which identifies the best subset of belief variables 
that predict online purchase intentions. Thus, regardless of 
the regression procedures we used, the results are consistent 
with H,. Furthermore, given the inherent correlation 
between trusting beliefs, we tested for multicollinearity by 
examining the maximum variance inflation factor (VIF). 
Multicollinearity is problematic for interpreting regression 
analyses when the maximum VIF is greater than 10 (Neter, 
Wasserman, and Kutner 1990). We found that multi- 
collinearity was not an issue in the preceding analysis (the 
maximum VIF = 3.10). 


Web site investment, trusting beliefs, and. online pur- 
chase intentions. We analyzed ability beliefs with a one- 
way ANCOVA. As we expected, ability beliefs were higher 
for those who visited the high-investment site than for those 
who visited the low-investment site (Ms = 2.41 versus 1.80; 
F(1, 109) = 13.27, p < .005). In contrast, Web site invest- 
ment did not affect benevolence or integrity beliefs 
(Fs(1, 109) « 2.00, n.s.). Thus, Web site investment is more 
effective in communicating the trustworthiness of a firm's 
ability than its benevolence and integrity. 

In support of H», online purchase intentions were higher 
among those who visited the high-investment site than 
among those who visited the low-investment site (Ms — 
—.13 versus —.78; F(1, 109) = 4.06, p < .05). To test whether 
ability beliefs mediated this effect (H3), we added ability 
beliefs as a covariate to the ANCOVA. Consistent with the 
requirements for mediation (Baron and Kenny 1986), abil- 
ity was significant (F(1, 108) = 4.31, p « .05), and the 
investment effect became nonsignificant (F(1, 108) = 1.48, 


n.s.). We found further support for H; using the criteria that 
Sobel (1982) endorsed for testing mediation (Goodman I 
test statistic — 2.22; p « .05). Because investment did not 
affect benevolence and integrity beliefs, these beliefs cannot 
be considered mediators. 


Conclusions 


Study 1 provides support for H,—-H3. Specifically, 
searchers’ online purchase intentions were influenced by 
their trust in the firm’s ability rather than their trust in its 
benevolence and integrity. As a result, ability signals (i.e., 
Web site investment) influenced tbeir online purchase inten- 
tions. Furthermore, their trust in the firm's ability mediated 
this effect. 

It is possible that searchers' online purchase intentions 
were influenced by their ability beliefs rather than by their 
benevolence and integrity beliefs because there were no sig- 
nals regarding the firm's benevolence and integrity. In the 
presence of such signals, ability beliefs may have less influ- 
ence. Indeed, the impact of a given signal is weakened 
when other, more relevant signals are present (Kirmani and 
Wright 1989). If benevolence and integrity signals are more 
relevant, the effect of investment on searchers' online pur- 
chase intentions should be weaker when such signals are 
present. However, if ability is more relevant, investment 
should affect searchers' online purchase intentions regard- 
less of whether benevolence and integrity signals are pre- 
sent. We directly test this in Study 2 by manipulating the 
presence and strength of a firm's privacy and security state- 
ment. Because benevolence represents the firm's orientation 
toward customers and integrity represents whether the firm 
will do what it promises (Mayer, Davis, and Schoorman 
1995), one method of signaling a firm's benevolence and 
integrity may be through formal statements of its intentions 
to consumers, such as through privacy and security state- 
ments. If so, such statements should affect consumers' 
beliefs about a firm's benevolence and integrity rather than 
its ability. Yet if ability is a stronger driver of searchers' 
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online purchase intentions (H4), investment should affect 
searchers' online purchase intentions regardless of whether 
a privacy and security statement is present. That is, H} and 
H4 should be supported even when such a statement is 
provided. 


Study 2 


Method 


Sample and design. A total of 79 undergraduate students 
participated in exchange for extra course credit. We ran- 
domly assigned participants to one of six conditions in a 2 
(investment) x 3 (privacy/security statement: strong or weak 
versus absent) design.2 We included the absent condition to 
examine whether the presence of a strong statement is better 
than having no statement. We included the weak condition 
to examine whether merely having a privacy/security state- 
ment might signal benevolence and integrity or whether the 
contents of the statement influence such beliefs. 


Privacy/security statement manipulation. We con- 
structed the strong and weak privacy/security statements on 
the basis of a content analysis of the privacy/security state- 
ments gathered from more than 25 sites. For the strong 
statement, explicit information was available about how UF 
collects and uses customer information. There was also a 
promise of confidentiality, a contact number, an opt-in fea- 
ture, encryption information, and a 10096 guarantee against 
information theft. In contrast, the weak statement informed 
the consumer that personal information would be collected 
and made available to other vendors that "are offering prod- 
ucts we feel are of interest to you." There was no opportu- 
nity to opt in or out of such correspondence. Consumers 
were also informed that UF "tries to safely transmit your 
account information." No account protection or guarantee 
was offered. 

To pretest this manipulation, 37 undergraduates read 
either the strong or the weak statement and rated its strength 
on six items. Among the items were "I believe Urban- 
Furniture is concerned about my privacy" and "I believe 
that Urban-Furniture is concerned about the security of my 
financial information." Participants responded on a scale 
ranging from 1 (“disagree strongly") to 7 ("agree 
strongly"), and we averaged the responses (o = .94). In sup- 
port of the privacy/security manipulation, participants 
agreed more that UF would preserve their privacy and secu- 
rity when they read the strong privacy/security statement 
than when they read the weak statement (Ms = 5.56 versus 
2.61; F(1, 36) = 100.53, p < .01). 


Procedure. The procedure and survey were the same as 
in Study 1, with a few exceptions. To increase involvement, 
participants were told to imagine that they accepted a job in 
Manhattan after graduation and were searching for living- 
room furniture for their apartment. After viewing the site, 
participants in the strong and weak privacy/security condi- 
tions read these statements before completing the survey. To 


2Our analysis was limited to either strong or weak privacy and 
security statements. Thus, for ease of exposition, we use the term 
“privacy/security” to describe levels of privacy and security. 
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test whether reading such statements might artificially 
increase participants' risk perceptions of shopping online, 
we added seven items to the end of the survey that mea- 
sured such concerns (see the Appendix). We averaged 
responses to provide a perceived risk score (œ = .89). 
Because the sample is homogeneous in terms of education 
and income, we deleted these demographic questions. In 
addition, because we measured online purchase experience 
in Study 1 with a single item that captures only recent 
online purchase experience, we added an item that mea- 
sures general online purchase experience (i.e., how often 
participants shop online) on a scale from 1 ("not at all") to 7 
(“quite often"). We averaged these items to capture online 
purchase experience (r = .78). 


Results 


Manipulation checks. In support of the investment 
manipulation, a 2 (investment) x 3 (privacy/security state- 
ment) ANOVA yielded a significant investment effect 
(F(1, 73) = 6.84, p = .01): Participants perceived the high- 
investment site as requiring a greater investment than the 
low-investment site (Ms — 3.36 versus 2.48). No other 
effects were significant (Fs(1, 73) « 1). 

To examine whether reading a privacy/security state- 
ment might increase participants' perceived risks of shop- 
ping online, we compared risk perceptions using a 2 x 3 
ANOVA. None of the effects were significant (the invest- 
ment effect: F(1, 73) = 1.22, n.s.; the direct and interactive 
effects of the privacy/security statement: Fs(2, 73) « 2.11, 
n.s.). On average, participants perceived buying online as 
risky (M = 5.71, which is significantly higher than the mid- 
point of 4; higher numbers reflect greater perceived risk, 
t(78) = 14.16, p < .01). 

Trusting beliefs and online purchase intentions. As in 
Study 1, we tested H, using hierarchical regression. Às we 
predicted, ability beliefs influenced searchers' online pur- 
chase intentions (B = .39, t(77) = 3.73, p < .005; R? = .15, 
F(1, 77) = 15.57, p « .01, see Table 1). Furthermore, the 
addition of benevolence and integrity beliefs to the model 
did not significantly contribute to explaining online pur- 
chase intentions (AR? = .002, F(2, 75) « 1). A stepwise 
regression analysis yielded the same results. Multicollinear- 
ity was not a problem (the maximum VIF = 2.58). 


Web site investment and trusting beliefs. Here and else- 
where, we analyzed the data with a 2 (investment) x 3 
(privacy/security statement) ANCOVA, controlling for 
online purchase experience and problems with the site. 
Ability beliefs were higher for those who visited the high- 
investment site than for those who visited the low- 
investment site (Ms = 2.68 versus 2.13; F(1, 69) = 6.54, p « 
.05), thus replicating the results of Study 1. No other effects 
on ability beliefs were significant (Fs(1, 69) < 2.34, n.s.). 
Furthermore, as we expected, Web site investment did not 
affect benevolence or integrity beliefs (Fs(1, 68) « 2.29, 
n.S.). 

Whereas investment appears to communicate a firm's 
ability but not its benevolence and integrity, privacy/ 
security statements appear to communicate benevolence 
and integrity but not ability: The privacy/security effect was 
significant for benevolence beliefs (F(2, 69) = 3.30, p « .05) 


and integrity beliefs (F(2, 69) = 4.03, p < .05) but not for 
ability beliefs (F(2, 69) = 2.34, n.s.). Benevolence and 
integrity beliefs were significantly higher among those who 
read a strong statement than among those who read a weak 
statement (benevolence: Ms = 3.04 versus 2.65; F(2, 68) = 
2.89, p < .05 [one-tailed test]; integrity: Ms = 3.19 versus 
2.66; F(2, 68) = 7.12, p < .05). These beliefs were also 
higher for those who received a strong statement than for 
those who received no statement (benevolence: Ms = 3.04 
versus 2.44; F(2, 68) = 6.55, p < .05; integrity: Ms = 3.19 
versus 2.76; F(2, 68) = 5.02, p < .05). However, the mere 
presence of a privacy/security statement does not appear to 
signal a firm’s benevolence and integrity; the difference 
between a weak statement and no statement was not signifi- 
cant at p < .05. Jt appears that both the presence and the 
strength of the statement signal the firm’s benevolence and 
integrity. 

Web site investment and online purchase intentions. We 
analyzed online purchase intentions with a 2 x 3 ANCOVA. 
In support of H», online purchase intentions were higher 
among those who visited the high-investment site than 
among those who visited the low-investment site (Ms = 
—.16 versus —.79; F(1, 69) = 3.06, p < .05 [one-tailed test]). 
A privacy/security statement did not moderate this effect 
(F(2, 69) « 1). Thus, regardless of whether benevolence and 
integrity signals (1.e., privacy/security statements) were pre- 
sent, ability signals (i.e., Web site investment) significantly 
influenced searchers' online purchase intentions. 

The only other significant effect was a privacy/security 
effect (F(1, 69) = 4.95, p = .01). In support of the argument 
that benevolence and integrity signals (e.g., a strong 
privacy/security statement) should have little effect on 
searchers' online purchase intentions, online purchase 
intentions did not differ between those who received the 
strong privacy/security statement and those who received no 
statement (Ms = —36 versus .09; F(1, 46) = 1.10, n.s.). 
However, online purchase intentions were lower among 
those who received a weak privacy/security statement than 
among those who received a strong statement (Ms = —1.15 
versus —.36; F(1, 41) = 3.82, p « .05 [one-tailed test]) or no 
statement (Ms = —1.15 versus .09; F(1, 49) = 8.58, p < .01). 
This finding is consistent with existing research that nega- 
tive cues regarding a firm's character tend to be unexpected, 
violating established norms about "business as usual" 
(Garfinkel 1963). In such situations, people appear unwill- 
ing to buy online from the firm. 

To test whether ability beliefs mediate the investment 
effect on online purchase intentions (H3), we added ability 
beliefs as a covariate to the 2 x 3 ANCOVA: Consistent 
with the requirements for mediation, ability was a signifi- 
cant covariate (F(1, 68) = 8.47, p « .01), and the investment 
effect became nonsignificant (F(1, 68) « 1). We replicated 
this finding with the Sobel (1982) test (Goodman I test sta- 
tistic = 1.93; p = .05). As in Study 1, benevolence and 
integrity beliefs cannot be considered mediators, because 
investment did not significantly affect these beliefs. How- 
ever, it is possible that benevolence and integrity beliefs 
mediate the privacy/security effect on online purchase 
intentions. To test this, we added these beliefs as covariates 


to the 2 x 3 ANCOVA. Inconsistent with the requirements 
for mediation but in support of the prediction that searchers' 
online purchase intentions are influenced by their ability 
rather than by their benevolence and integrity beliefs (H;), 
neither of these beliefs were significant (Fs(1, 67) « 2.02, 
n.s.), and the privacy/security effect on online purchase 
intentions remained significant (F(2, 67) = 5.75, p < .01). 


Concluslons 


Replicating the results of Study 1, we found that searchers' 
ability beliefs, rather than their benevolence and integrity 
beliefs, influenced their online purchase intentions. As a 
result, ability signals (Web site investment) influenced their 
online purchase intentions more than did signals of the 
firm's benevolence and integrity (the presence of a strong 
privacy/security statement). 

Because the objective of this research is to predict 
online purchase intentions, thus far our focus has been on 
people whose goals are most consistent with prepurchase 
deliberation, namely, searchers. For those with this goal, 
ability beliefs are a stronger driver of online purchase inten- 
tions than are benevolence and integrity beliefs. However, 
there may be goals that highlight the importance of a differ- 
ent component of trust than ability beliefs. Specifically, for 
those whose goal for visiting the site is more personal and 
less outcome oriented (i.e., browsers), a different pattern of 
results may emerge. Browsing is a moment-by-moment 
activity rather than a search process for a specific piece of 
information (Janiszewski 1998). As such, it is exploratory 
(Moe 2003) and reflects recreational behavior (Hoffman 
and Novak 1996). Similar to those who read a text for enter- 
tainment (Rosenblatt 1978), browsers are likely focused on 
what they are "living through" during their site visits. Thus, 
whereas searchers tend to be more objective and outcome 
oriented and thus are likely to disengage from having a per- 
sonal experience with the site, browsers' experiences are 
likely more personal. 

We argue that distinguishing between these goals has 
important implications for the relative impact of each trust- 
ing belief on online purchase intentions. Whereas searchers 
focus on performance and thus base their online purchase 
intentions on their ability beliefs, browsers likely have a 
more personal experience with the site and thus will be 
influenced by the most personal aspect of trust: benevo- 
lence beliefs. Benevolence beliefs reflect consumers' beliefs 
that the firm cares about their welfare and well-being (e.g., 
“I trust that the firm is concerned about my wants and 
needs, even if doing so results in profit reductions"), 
whereas integrity beliefs reflect beliefs about the firm's 
moral standards, regardless of how it feels about the indi- 
vidual (e.g., "I trust that the firm is guided by sound busi- 
ness principles and standards"). Likewise, ability beliefs 
reflect beliefs about the firm's expertise, regardless of how 
it feels about the individual (e.g., “I trust that the firm has 
the necessary skills to be successful"). Thus, because 
browsers' experiences are highly personal, their online pur- 
chase intentions should depend on the most personal 
dimension of trust (benevolence) rather than on the less per- 
sonal dimensions (i.e., ability and integrity). 
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H4: Browsers’ online purchase intentions depend on their 
beliefs about the firm's benevolence rather than their 
beliefs about the firm's ability or integrity. 


We also propose that the distinction between searching 
and browsing has important implications for the influence 
of Web site investment on online purchase intentions. 
Whereas prior research has demonstrated the importance of 
separating signals from interpretation, because not everyone 
interprets a signal in the same manner (Prabhu and Stewart 
2001), it may be equally important to separate signal inter- 
pretation from response because even similarly interpreted 
signals may lead to meaningfully different responses. In 
particular, we argue that though both searchers and 
browsers will likely interpret investment as signaling the 
firm's abilities, the impact of this belief on their online pur- 
chase intentions will vary. Specifically, if searchers' online 
purchase intentions are affected by their ability beliefs, abil- 
ity signals (or Web site investment) should influence their 
online purchase intentions. However, if browsers' online 
purchase intentions are unaffected by their ability beliefs, 
ability signals (even if interpreted as such) should have rel- 
atively little influence on their online purchase intentions. 
Consequently, we hypothesize the following: 


Hs: Web site investment influences searchers’ but not 
browsers' online purchase intentions. 


Because the firm studied thus far was an unknown Internet- 
only firm, an additional objective of Study 3 was to repli- 
cate the results for searchers with a well-established firm 
that sells a different set of products (electronics rather than 
home furnishings and accessories). 


Study 3 


Method 

Sample and design. A total of 152 undergraduate stu- 
dents participated in exchange for extra course credit. We 
randomly assigned participants to one of four conditions in 
a 2 (investment) x 2 (goal: searching versus browsing) 
experimental design. 

Web site investment manipulation. In contrast to Studies 
l and 2, we manipulated investment using only technology. 
Specifically, for the high-investment site, we used Macro- 
media's Shockwave technology to allow participants to 
experience an online demonstration and to roll over the 
product image to gather additional information about its 
features. For the low-investment site, we conveyed the same 
information through text and static graphics rather than 
through the use of this technology. 


Procedure. The procedure was similar to that used in 
Study 2, with a few exceptions. Each participant was seated 
at a computer terminal, which contained the instructions, 
the site, and the survey. Participants were told that they 
would be visiting a portion of Kodak’s site devoted to a spe- 
cific model of digital camera. Those assigned to browse 
were instructed to “have fun, looking at whatever you con- 
sider interesting and/or entertaining.” Those assigned to 
search were instructed that before doing so, they should 
type two questions they have for Kodak about this digital 
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camera. To ensure that both groups would be attending to 
information relevant to them, we did not specify what to 
look for.’These instructions are identical to those used in 
prior research to instill a searching versus browsing goal 
(Schlosser 2003). Participants then visited the site. To con- 
trol for the amount of time browsers versus searchers spent 
at the site and any possible impact of this on the dependent 
variables, all participants viewed the site for five minutes, 
which is comparable to the time imposed in prior research 
(Schlosser 2003). We did not investigate privacy/security 
statements in this study. 

After visiting the site, participants completed the online 
survey, which was identical to that used in Study 2, except 
that the risk questions were replaced by the goal manipula- 
tion check. Participants were asked the extent to which their 
time at the site was spent looking for specific information (a 
searching activity) or looking to be entertained (a browsing 
activity) on a scale from 0 ("not at all”) to 5 (“a lot"). In 
addition to the Web site investment items, participants rated 
how informative they found the site to be on a scale from 1 
(“very little") to 7 (“a great deal"). We also made a slight 
change to the trusting beliefs measure: We asked partici- 
pants to focus on the trustworthiness of Kodak's Internet 
marketing department. We made this change to direct par- 
ticipants' attention toward the e-commerce side of the firm 
rather than the firm in general or the brand. For example, 
participants may trust the offline firm's ability, benevolence, 
and integrity, but they may not believe that its Internet mar- 
keting managers share these same qualities. 


Results 


Manipulation checks. We analyzed the manipulation 
check items with a 2 (investment) x 2 (goal) ANOVA. In 
support of the goal manipulation, searchers reported spend- 
ing more time searching (Ms = 2.52 versus 1.86; 
F(1, 150) = 14.08, p < .01) and less time browsing (Ms = 
1.31 versus 1.97; F(1, 150) = 13.80, p « .01) than did 
browsers. In support of the investment manipulation, the 
high-investment site was perceived as a greater investment 
than the low-investment site (Ms = 4.51 versus 2.58; 
FU, 148) = 88.76, p < .01). This effect emerged for both 
browsers (Ms = 4.15 versus 2.34; F(1, 74) = 20.19, p « .01) 
and searchers (Ms = 4.86 versus 2.10; F(1, 74) = 85.14, p « 
.01). No other effects were significant. The sites did not dif- 
fer in perceived informativeness (F(1, 148) « 1). 


Trusting beliefs and online purchase intentions. To test 
whether ability beliefs explain searchers' online purchase 
intentions (Hj) and whether benevolence beliefs explain 
browsers' online purchase intentions (H4), we conducted 
separate hierarchical regression analyses for searchers and 
browsers; we modeled ability beliefs as a function of online 
purchase intentions (Model 1) before adding benevolence 
and integrity beliefs (Model 2). As we predicted, ability 
beliefs were significantly related to searchers' online pur- 
chase intentions (D = .42, t(74) = 3.96, p < .01; R? = .18, 
F(1, 74) 2 15.70, p « .01; see Table 2). The addition of 
benevolence and integrity beliefs to the model did not sig- 
nificantly contribute to searchers' online purchase inten- 
tions (AR? = .03, F(2, 72) = 1.50, n.s.). For browsers, ability 


, TABLE 2 
Study 4: Hierarchical Regression Analysis of Trusting Beliefs on Online Purchase Intentions When Risk Is 
High Versus Low 


A: Results from a Hierarchical Regression Analysis 


Model 1 Model 2 
Social Risk Variable | p F for R? B AR2 F for AR2 
High Product attitudes .60** 7.91™ .58** .02 1.80 
Autotellc NFT —.09 —.14 
Instrumental NFT —.03 —.05 
Online buying .16 .06 
Site problems .10 .17 
Ability .29** .24* 
Integrity .18 
Low Product attitudes .A4** 2.40** .A1** .02 1.12 
Autotelic NFT —.12 —.10 
Instrumental NFT .02 .00 
Online buying .05 .14 
Site problems .09 .01 
Ability .08 .17 
integrity —.17 
B: Results from a Stepwise Regresslon Analysis 
High Product attitudes .64** 33.92** b6** .07 5.94** 
Ability .27** 
Low Product attitudes 48** 13.47 
*p < .10. 


“p< .05. 
Notes: NFT 2 Need for touch. 


beliefs alone did not significantly explain their online pur- 
chase intentions (R? = .02, F(1, 74) = 1.62, n.s.). However, 
the addition of benevolence and integrity beliefs to the 
model significantly contributed to browsers' online pur- 
chase intentions (AR? = .10, F(2, 72) = 4.28, p < .05). Con- 
sistent with H4, browsers' benevolence beliefs were signifi- 
cantly related to their online purchase intentions (B = .33, 
t(74) = 2.70, p « .01), whereas ability and integrity beliefs 
were not (t(74) « 1). Multicollinearity was not a problem in 
the analyses (maximum VIFs < 1.50). 

Web site investment, trusting beliefs, and online pur- 
chase intentions. We analyzed each belief with a 2 x 2 
ANOVA. As we expected, ability beliefs were higher 
among those who visited the high-investment site than 
among those who visited the low-investment site (Ms — 
3.5] versus 2.99; F(1, 148) = 21.74, p < .01). Investment 
did not affect benevolence and integrity beliefs 
(Fs(1, 148) < 1). Furthermore, people’s goals, for visiting 
the site neither directly affected nor interacted ' with invest- 
ment to influence their ability, benevolence, or integrity 
beliefs (Fs(1, 148) < 2.27, n.s.). Thus, investment appears to 
signal a specific trusting belief (i.e., ability rather than 
benevolence or integrity), replicating the results of Studies 
1 and 2 with a different set of stimuli. Moreover, this find- 
ing persisted despite differences in people’s goals. 

In support of Hs, searchers had higher online purchase 
intentions after visiting the high-investment site than after 
visiting the low-investment site (Ms = —03 versus —.76; 
F(1, 74) = 4.92, p < .05), whereas browsers’ online purchase 


intentions were unaffected by investment (Ms = —.48 versus 
—.08 at the high- and low-investment sites; F(1, 74) = 1.26, 
n.s.). This investment x goal interaction was significant 
(F(1, 148) = 5.56, p < .05). 

In addition to demonstrating that the site effect on 
online purchase intentions varies across goals, we tested 
whether the mediating effects of ability vary across goals 
using the procedure that Baron and Kenny (1986) outline. 
Specifically, we regressed online purchase intentions on 
goal, site, goal x ability, and site x ability, and we found a 
significant goal x ability effect (t(146) = 1.82, p « .05 [one- 
tailed test]). For searchers, mediation was supported: Abil- 
ity was significant (F(1, 73) = 10.45, p « 01), and the 
investment effect became nonsignificant (F(1, 73) « 1). A 
Sobel (1982) test supports this finding (Goodman I test sta- 
tistic — 2.74, p « .01). However, for browsers, mediation 
was not supported: Ability had little effect on online pur- 
chase intentions (F(1, 73) = 2.64, n.s.). Furthermore, 
because investment did not affect benevolence and integrity 
beliefs, they cannot be considered mediators. 


Conclusions 


In Study 3, we examined a boundary condition for the effect 
of Web site investment on online purchase intentions: con- 
sumers' goals for visiting sites. We found that the effects of 
Web site investment and ability on online purchase inten- 
tions are specific to searchers and do not generalize to 
browsers. For browsers, the most personal component of 
trust (i.e., benevolence rather than ability) influences their 
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online purchase intentions. Consequently, although 
browsers recognized that Web site investment signals abil- 
ity, it had relatively little influence on their online purchase 
intentions. 

A remaining question is whether the findings are really 
a matter of trust. Recall that trusting intentions involve risk 
(Moorman, Zaltman, and Deshpandé 1992), which causes 
people to consult their trusting beliefs to determine whether 
to perform the trusting behavior. If the findings are driven 
by trust, Web site investment should influence consumers’ 
online purchase intentions under conditions of risk. 
Although buying online is typically risky, certain situational 
factors (e.g., buying a gift for a significant versus an 
insignificant other) can make buying online more or less 
risky. When there is relatively little risk, buying online 
should involve relatively little trust. Consequently, people 
are less likely to consult their trusting beliefs when deciding 
how to act. Indeed, the degree of trust necessary to influ- 
ence behavior has an approximate linear relationship to the 
degree of risk involved (Corritore, Kracher, and Wieden- 
beck 2003; Mayer, Davis, and Schoorman 1995). Thus, 
when the purchasing scenario involves relatively little risk, 
ability beliefs (and signals designed to influence such 
beliefs) should have relatively little effect on searchers’ 
online purchase intentions. 


Hg: Ability beliefs affect searchers’ online purchase intentions 
only when buying involves risk. 

H7: Web site investment affects searchers’ online purchase 
intentions when buying involves risk. 


Another objective for Study 4 was to test the importance 
of trust in determining online purchase intentions versus 
online purchase experience and a desire to examine prod- 
ucts physically before purchase. For people who are moti- 
vated to touch products, barriers to touch can decrease con- 
fidence in product evaluation, though conveying haptic 
information through text and graphics can help (Peck and 
Childers 2003b). If trust is critical in determining online 
purchase intentions, ability beliefs should be related to 
online purchase intentions even when we account for these 
other factors. 


Study 4 


Method 

Sample and design. A total of 98 undergraduate students 
participated in exchange for extra course credit. We ran- 
domly assigned them to one of four conditions in a 2 
(investment: high- versus low-investment site) x 2 (risk 
high versus low) factorial design. They visited the UF site 
used in Studies 1 and 2. 

Risk manipulation. We manipulated risk by varying 
social risk. Specifically, the high-risk scenario was as fol- 
lows: "Imagine that you graduated and obtained your dream 
job at a firm. You are invited to your boss’ house-warming 
party, which takes place in a week.” The low-risk scenario 
was as follows: “Imagine that you are invited to a former 
roommate’s house-warming party, which takes place in a 
month.” Participants in both scenarios were told that this 
person likes modern furniture and accessories, such as those 
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offered at UF’s site, and that they intend to buy a home 
accessory (e.g., a vase) as a house-warming gift for this 
person. 

To test the effectiveness of these scenarios, 41 under- 
graduate students read ‘either scenario and then rated how 
nervous and concerned they would be about making this 
purchase and how risky they considered this purchase on a 
scale from 1 to 7 (higher numbers reflected greater risk). 
We averaged these items to form a perceived risk score (a = 
./6). AS we expected, perceived risk was higher among 
those who read the high-risk scenario than among those 
who read the low-risk scenario (Ms = 4.22 versus 3.12; 
F(1, 37) = 7.12, p = .01). Participants then looked at screen 
captures of the high- or low-investment site and answered 
the Web site investment items (à = .94; see the Appendix). 
As we expected, perceived investment was higher for the 
high-investment site than for the low-investment site (Ms — 
5.12 versus 3.81; F(1, 37) = 14.78, p « .01). Moreover, risk 
did not directly affect or moderate these investment percep- 
tions (Fs(1, 37) « 1.39, n.s.). 


Procedure. 'The procedure and survey were the same as 
that used in Study 2, with a few exceptions. At the begin- 
ning of the experiment, participants read either the high- or 
the low-risk scenario before receiving the search instruc- 
tions. Participants did not receive a privacy/security 
statement. ' 

Unlike Study 2, online purchase intentions were specific 
to home accessories (the gift they were to buy). Participants 
also reported their attitudes toward UF’s home accessories 
to control for any individual differences in their liking of 
UF's offerings (see the Appendix). In addition, to test the 
robustness of our effects related to ability versus integrity 
beliefs, we replaced Mayer and Davis’s (1995) scale with 
an adaptation of Moorman, Deshpandé, and Zaltman’s 
(1993) ability and integrity items.? Three items measured 
ability beliefs (œ = .93), and two items measured integrity 
beliefs (r = .31; see the Appendix). Finally, to test the 
impact of trust on online purchase intentions relative to con- 
sumers' desire to touch products before purchase, we 
administered Peck and Childers's (2003a) need for touch 
(NFT) scale, which measures the need for autotelic and 
instrumental touch (see the Appendix). 


Results 


Trusting beliefs and online purchase intentions. To test 
Hg, we conducted a hierarchical regression analysis sepa- 
rately for searchers in the high- and low-risk conditions: we 
modeled ability beliefs and variables believed to influence 
online purchase intentions (attitudes toward home acces- 
sories, both types of NFT, online purchase experience, and 
site problems) as a function of online purchase intentions 
(Model 1) before adding integrity beliefs (Model 2). As we 
predicted, for searchers in the high-risk condition, ability 
beliefs were significantly related to online purchase inten- 
tions (B = .29, t(48) = 2.39, p < .05; R2 = .52, F(6, 43) = 
7.91, p < .01; see Table 2). The only other significant varia- 
ble was attitudes toward home accessories (B = .60, t(48) = 





3Moorman, Deshpandé, and Zaltman (1993) do not measure 
benevolence beliefs; thus, we do not address them here. 


5.06, p « .01). The addition of integrity beliefs.to the model 
did not significantly contribute to explaining online pur- 
chase intentions (AR? = .02, F(1, 42) « 1). For searchers in 
the low-risk condition, the only significant variable from 
Model 1 was attitudes toward home accessories (B — .44, 
t(45) = 2.88, p < .01; R2 = .27, F(6, 40) = 2.40, p < .05). The 
addition of integrity beliefs did not significantly contribute 
to explaining online purchase intentions (AR? = .02, 
F(1, 39) = 1.12, n.s.). Consistent with Heg, ability beliefs 
influenced online purchase intentions only when risk was 
high. When risk was low, neither ability nor integrity beliefs 
could explain online purchase intentions. Moreover, ability 
beliefs is an important variable in explaining searchers' 
trusting intentions: In the high-risk condition, ability beliefs 
explained more variance in their intentions to buy online 
than did individual differences in NFT or online purchase 
experience. À stepwise regression led to the same conclu- 
sions: Furthermore, multicollinearity was not an issue: The 
maximum VIFs were below 1.99. 

Web site investment, trusting beliefs, and online pur- 
chase intentions. For both beliefs, we conducted a 2 x 2 
ANCOVA. As we expected, ability beliefs were higher 
among those who visited the high-investment site than 
among those who visited the low-investment site (Ms = 
4.01 versus 2.84; F(1, 90) = 14.82, p « .01). Investment did 
not affect integrity beliefs (F(1, 90) = 3.30, n.s.). Further- 
more, risk had neither a direct nor a moderating effect on 
either belief (Fs(1, 90) « 1). Thus, regardless of purchase 
risk, investment communicates a firm's ability. 

For online purchase intentions for home accessories (the 
gift), a 2 x 2 ANCOVA yielded a significant investment X 
risk interaction (F(1, 88) = 6.06, p « .05). Consistent with 
H5, when risk was high, searchers’ online purchase inten- 
tions were higher at the high-investment site than at the 
low-investment site (Ms = .76 versus —.38; t(49) = 2.64, p « 
.05), whereas when risk was low, investment had little effect 
on their online purchase intentions (Ms = .46 versus .82 at 
the high- and low-investment sites; t(47) « 1). 

In addition to demonstrating that the site effect on 
online purchase intentions varies across risk, we tested 
whether the mediating effects of ability vary across levels of 
risk by regressing online purchase intentions on risk, site, 
risk x ability, and site x ability. As we expected, the risk X 
ability effect was significant (t(96) = 1.93, p ='.057). In the 
high-risk condition, mediation was supported: Ability was a 
significant covariate (F(1, 43) = 8.67, p < 01), whereas the 
previously significant site effect was reduced to nonsignifi- 
cance (F(1, 43) « 1). The Sobel (1982) test provides further 
support for this finding (Goodman I test statistic = 2.13, p « 
.05). In the low-risk condition, however, mediation was not 
supported: It did not affect online purchase intentions 
(F(1, 40) « 1). As in Studies 1—3, integrity cannot be con- 
sidered a mediator, because investment did not'significantly 
affect this variable. 


Discussion 


Collectively, these studies clarify the role of trust in predict- 
ing online purchase intentions. Across four studies with dif- 
ferent samples, companies, and products sold, we demon- 


strate that Web site investment influences searchers' inten- 
tions to buy online by influencing one component of trust- 
ing beliefs, ability (versus benevolence and integrity). Five 
conclusions can be drawn from our findings: First, they pro- 
vide empirical support for the position that trusting beliefs 
should be considered separately from trusting intentions 
(Moorman, Deshpandé, and Zaltman 1993; Moorman, Zalt- 
man, and Deshpandé 1992). Although definitions of trust 
typically contain both (e.g., Rotter 1980), most empirical 
research on consumer trust implicitly assumes that measur- 
ing trusting beliefs alone is sufficient (e.g., Morgan and 
Hunt 1994). However, our research suggests that it is 
important to measure trusting intentions because building 
trusting beliefs does not necessarily lead to higher trusting 
intentions. For example, although the presence (versus 
absence) of a strong privacy/security statement increased 
searchers' benevolence and integrity beliefs in Study 2, it 
did not lead to higher online purchase intentions. Likewise, 
in Study 3, Web site investment increased browsers' ability 
beliefs but not their online purchase intentions. 

Second, our findings are consistent with the view that 
risk is a necessary (but not a sufficient) condition for trust 
(Moorman, Deshpandé, and Zaltman 1993; Moorman, Zalt- 
man, and Deshpandé 1992). Specifically, in Study 4, we 
demonstrate that Web site investment and trusting beliefs 
influence online purchase intentions only when risk is high, 
or when the behavior requires trust. In contrast, when risk is 
low (and, thus, the behavior does not require trust), online 
purchase intentions were largely unaffected by Web site 
investment and trusting beliefs. 

Third, we demonstrate the importance of considering 
the differential impact of trusting beliefs on trusting inten- 
tions. As we demonstrated across four studies, trusting 
beliefs do not have an equally strong influence on online 
purchase intentions. For example, in Study 2, we find that 
among searchers, gains in ability beliefs led to higher online 
purchase intentions, whereas gains in benevolence and 
integrity beliefs did not. Consequently, signals that were 
effective in building benevolence and integrity beliefs (i.e., 
the presence of a strong privacy/security statement) were 
less effective than ability signals (i.e., Web site investment) 
in increasing online purchase intentions. Together, these 
findings suggest that it is important for an online firm to 
identify the most influential trusting belief in order to iden- 
tify the signal that will most effectively increase trusting 
intentions. 

Fourth, our research contributes to the signaling litera- 
ture. Prior research has demonstrated the importance of sep- 
arating signals from interpretation because not all con- 
sumers interpret signals in the same way (Prabhu and 
Stewart 2001). Our research builds on this by demonstrating 
that even when consumers interpret signals as a firm 
intends, their responses may vary significantly. For exam- 
ple, although searchers and browsers (Study 3) and those 
perceiving high and low risk (Study 4) interpreted Web site 
investment as a signal of ability, Web site investment signif- 
icantly affected only the online purchase intentions of 
searchers and for those perceiving high risk. 

Fifth, these findings are also helpful in explaining what 
appear to be contradictory findings regarding the effective- 
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ness of privacy/security statements relative to Web site 
design. Our research suggests that though privacy/security 
statements can influence certain trusting beliefs (i.e., benev- 
olence and integrity beliefs), they are less effective in 
increasing searchers’ willingness to buy online. Conse- 
quently, even when privacy/security statements improve 
beliefs about a firm's trustworthiness, they can still be rela- 
tively less effective in converting visitors to buyers. These 
findings also help explain why Web site design plays such 
an important role in online purchase intentions. Instead of 
serving a purely aesthetic function, it signals that a firm's 
ability can be trusted, which we found to be the most sig- 
nificant driver of searchers' online purchase intentions. 


Managerial Implications 


The prevailing wisdom has been that concerns about buying 
online would diminish as consumers gain experience buy- 
ing online (eMarketer 2005). Thus, marketing managers 
may wonder whether they need to invest in establishing 
trust as consumers gain online purchase experience. Con- 
trary to this prevailing wisdom, however, recent reports 
indicate that consumers at all levels of Internet experience 
are increasingly cautious about buying online (eMarketer 
2005; Forrester 2005). Compounding this problem is the 
notion that the actual risks associated with being online 
may even increase over time (Rust, Kannan, and Peng 
2002). Consistent with the view that firms need to establish 
trust regardless of their visitors' online buying experience at 
other sites, we found that Web site investment had a signifi- 
cant effect on consumers' online purchase intentions by 
influencing their ability beliefs even when we controlled for 
online purchase experience (Studies 1—4). 

Likewise, marketing managers of established offline 
firms may wonder whether they need to invest 1n Web site 
design to establish trust online. Our findings suggest that 
establishing trust is important for both an established offline 
firm (Kodak) and an unknown pure play (UF). It may seem 
surprising that consumers did not trust the ability of a 
known firm to bandle online transactions successfully, yet 
this finding may reflect well-founded consumer skepticism 
about whether firms have the same abilities online as they 
do offline. Indeed, recently publicized examples of delivery 
failures by known firms (e.g., Toys "R" Us during the 2003 
Christmas season) likely contribute to such concerns. Part 
of consumers' caution may also stem from their awareness 
that offline firms may outsource e-commerce functions, 
thus making it difficult to predict the quality of the online 
transaction on the basis of their offline experiences with the 
firm. 

To use Web site investment effectively, however, man- 
agers should identify the reason most consumers visit their 
site. Our results indicate that Web site investment is effec- 
tive in boosting online purchase 1ntentions when visitors are 
searchers. Thus, if most visitors are searchers, Web site 
investment is warranted, but if most visitors are browsers, 
such investments may be ineffective. If the site attracts 
both, it may be best to invest mainly in pages that are most 
likely to attract searchers. For example, compared with 
browsers, searchers tend to "drill down" beyond category- 
level pages (e.g., a page featuring all the living-room furni- 
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ture that a store offers) to specific product pages (e.g., the 
pages on the Bay Club chair; Moe 2003). Thus, investment 
in such technology as Shockwave or Design Within Reach 
may be most effective at the product level rather than the 
category (or home page) level. Firms can identify the seg- 
ment visiting their sites by using clickstream data 
(searchers spend more time at product and search pages 
than browsers do; Moe 2003), usability studies, or more tra- 
ditional market research techniques, such as online surveys 
or panel studies. 

It appears that Web site investment is effective for pur- 
chases that involve risk, particularly social risk. For exam- 
ple, our findings suggest that sites selling gifts for important 
social occasions (e.g., weddings, birthdays, formal business 
functions) or holidays (e.g., Christmas, Valentine's Day) 
may benefit more from Web site investment than sites sell- 
ing products with less risk (e.g., office supplies). For firms 
selling products that vary in social risk, investing in Web 
site design would likely be most effective at the product- 
level pages for products that are most likely to carry high 
social risk (e.g., jewelry). 

Our focus has been on how to convert visitors into those 
who are willing to buy online. However, the firm may have 
other objectives, such as creating goodwill. Because Web 
site investment did not significantly affect benevolence or 
integrity beliefs in any of our studies, this tactic is likely to 
be ineffective in meeting such objectives. The findings from 
Study 3 suggest that privacy/security statements are better 
than Web site investment for influencing these trusting 
beliefs. It would be valuable to examine how other signals 
(e.g., announcements of investments in social causes) might 
communicate benevolence and integrity as well. 

This research also has important public policy implica- 
tions. Although signals can be used to communicate a firm's 
true abilities and intentions, they can also be used to mis- 
lead (Prabhu and Stewart 2001). Unscrupulous retailers 
may take advantage of consumers by offering high- 
investment sites for inferior products or an inferior online 
transaction or, worse, by stealing the consumer's identity. 
Indeed, according to the National Consumer League (2005), 
the second most common Internet scam complaint was 
about products that were never delivered or were misrepre- 
sented at an e-commerce site. Yet many of the Federal Trade 
Commission's (FTC's) initiatives focus on e-commerce 
problems at the technological level by recommending better 
encryption and firewall technology (e.g., FTC 2003). 
Although such investments are important, our findings sug- 
gest that e-commerce problems are a marketing issue as 
well. Specifically, our research indicates that consumers 
infer from investments in site design that the firm's abilities 
can be trusted. As a result, they are more willing to buy 
from the firm. Although a firm with a well-designed site has 
demonstrated an investment in site design, it does not fol- 
low that the firm itself has these capabilities. Oftentimes, 
site design and order fulfillment are outsourced and thus do 
not reflect the firm's online abilities. Furthermore, even if 
the firm developed the site internally, it does not follow that 
its abilities in this area will generalize to other areas, such 
as order fulfillment. However, because consumers appear to 
infer that a firm with a well-designed site can be trusted, 


unscrupulous firms could use Web site investment to 
deceive through implication. i 

As with regulating other e-commerce practices, there is 
debate about whether government intervention is necessary 
or whether industry self-regulation is sufficient (Rust, Kan- 
nan, and Peng 2002). Currently, the FTC has taken a caveat 
emptor stance toward such issues. Although the FTC pro- 
vides standards for what to include in a privacy/security 
statement (e.g., notifying consumers of a firm’s information 
practices and ensuring the security of the information col- 
lected) and advises consumers to consult these statements 
before buying (www.ftc.gov), our results suggest that even 
when consumers read these statements, theit online pur- 
chase intentions are largely unaffected by them unless they 
‘are noticeably weak. Instead, consumers’ intentions to buy 
online from a vendor are affected more by its Web site 
investment. Thus, the FTC’s suggestions may not fully pro- 
tect consumers from being victimized. Conversely, industry 
self-regulation may be sufficient. Investing in Web site 
design may not be profitable for most low-quality firms, 
thus discouraging such firms from doing so. Publicly visi- 
ble signals that require high up-front investments before any 
sales transactions rely on repeat purchases and word-of- 
mouth recommendations to recoup such sunk costs (Kir- 
mani and Rao 2000). Thus, even if low-quality firms can 
induce trial with a high-investment site, the firm’s true abil- 
ities would be revealed. Consequently, future purchases 
would be unlikely, making it difficult for low-quality firms 
to recover Web site expenditures. 


Limitations and Future Directions 


As with any study, the findings should be considered in 
light of their limitations. For example, instead of being part 
"of the site, the privacy/security statements used were 
included in the survey booklets. This enabled us (1) to 
ensure that those in the strong and weak privacy/security 
conditions were exposed to the statement and (2) to control 
the timing of when they read the statement relative to expe- 
riencing the site. Although such gains in control come at the 
expense of face validity, this likely provided a more conser- 
vative test of our predictions. That is, by placing the state- 
ments in the survey booklet and exposing participants to 
them after their visit to the site (but immediately before 
reporting their online purchase intentions), we likely 
increased their salience. Yet despite this, investment had a 
stronger effect on online purchase intentions than did the 
presence of a strong privacy/security statement. Still, it 
would be beneficial to examine what, if any, effect a 


privacy/security statement would have if its exposure was 
allowed to vary. Perhaps the first hurdle is to establish trust 
in the firm’s ability. Only then might searchers consult such 
statements. 

Although we studied several products (i.e., furniture and 
accessories, which are likely to be evaluated more on their 
experiential qualities than on their search qualities, and dig- 
ital cameras, which are likely to be evaluated more on their 
search qualities than on their experiential qualities), all 
were high-ticket items. It would be fruitful to examine 
whether such results generalize to inexpensive items, such 
as CDs or books. For such products, the risk of buying 
online might be reduced, thus reducing the effect of invest- 
ment on online purchase intentions. It would also be fruitful 
to examine whether Web site investment has the greatest 
influence on trusting intentions for sites that feature ser- 
vices, such as banking or travel. When transactions are 
more personal in nature, perhaps trust in the firm's benevo- 
lence would be as important as, if not more important than, 
trust in its ability. 

Note that participants did not have prior experience buy- 
ing online from either UF or Kodak (which did not sell 
directly from its site at the time of Study 3). It would be 
worthwhile to examine how online purchase experience 
with a firm might influence consumers' interpretations of 
the firm's signals, as well as their trusting beliefs and inten- 
tions. Furthermore, we examined only signals that can be 
observed and experienced, but other types of signals are 
also possible (Kirmani and Wright 1989). For example, 
firms might signal their Web site investments through 
announcements in press releases or on the site itself. How- 
ever, because such announcements require relatively little 
cost (and thus could be used by both high- and low-quality 
firms), they may be less effective than site investment in 
building trust. This would be a worthwhile future research 
direction. 


Conclusion 


Despite these limitations, we believe that our framework 
provides meaningful insights into how firms can most effec- 
tively signal their trustworthiness to convert online visitors 
to buyers. Furthermore, it contributes to the trust literature 
by demonstrating the importance of treating trusting beliefs 
as (1) a multidimensional construct and (2) a necessary but 
not sufficient condition for trust. Similarly, it contributes to 
the signaling literature by demonstrating that even if con- 
sumers interpret signals as a firm intends, their responses 


can vary significantly. 
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APPENDIX 
Measures and Items 





Measures and Items Source 


Online Purchase Intentions (Trusting Intentions) New scale 
"Unlikely/likely" (—3 to +3) 
"Impossible/possible" (—3 to +3) 
"Improbable/probable" (—3 to +3) 


Trusting Bellefs 

Ability (scale used in Studies 1-3). > Mayer and Davis (1999) 
Urban-Furniture seems very capable of performing online transactions. 

Urban-Furniture appears to be successful at the things It tries to do. 

Urban-Furniture seems to have much knowledge about what needs to be done to fulfill 
online transactions. 

| feel very confident about Urban-Furniture's online skills. 

Urban-Furniture appears to have speclallzed capabilities that can Increase Its perfor- 
mance with online transactions. 

Urban-Furniture appears to be well qualified In the area of e-commerce. 

Benevolence (scale used in Studies 1—3} b Mayer and Davis (1999) 
Urban-Furniture seems very concerned about my welfare. 

My needs and desires appear to be important to Urban-Furnlture. 

It doesn't seem that Urban-Furniture would knowingly do anything to hurt me. 
Urban-Furniture seems to really look out for what is important to me. 
Urban-Furniture appears to go out of its way to help me. 

Integrity (scale used in Studies 1—3}. b Mayer and Davis (1999) 
Urban-Furniture seems to have a strong sense of Justice. 

Urban-Furnrture appears to try hard to be fair In dealings with others. 
Urban-Furniture’s actions and behaviors are not very consistent. (reverse scored) 
| like Urban-Furniture's values. 

Sound principles seem to guide Urban-Furniture's behavior. 

Ability (scale used In Study 4) Moorman, Deshpandé, 
"Nonexpert/expert" (1 to 7) and Zaltman (1993) 
"Untrained/trained" (1 to 7) 

"Inexperienced/experienced" (1 to 7) 

Integrity (scale used in Study 4) Moorman, Deshpandé, 
“No integrity/integrity" (1 to 7) and Zaltman (1993) 
Urban-Furniture does not have a great deal of integrity.: (reverse scored) 


Web Site Investment New scale 
The amount of time Invested into developing this website seems to be: “very little/a great 
deal.” (1 to 7) 
The amount of effort devoted to developing this website seems to be: “very little/a great 
deal.” (1 to 7) ' 
The amount of money invested into developing this website seems to be: “very IIttle/a 
great deal? (1 to 7) 


Online Risk Perceptions (scale used in Study 2) New scale 
Shopping online is risky. 
Providing credit card information online is risky. 
Providing personal information (i e., social security number and mothers maiden name) 
online Is risky. 
Purchasing tems online Is risky. 
Providing my e-mall address and phone number online Is risky. 
Registering online is risky. 
It is riskier to shop online for a product than to shop offline for It. 


Attitudes Toward Home Accessories (scale used in Study 4) Í New scale 
"Bad/good" (-3 to +3) 
“Unpleasant/pleasantť (-3 to +3) 
"Dislike/like" (—3 to +3) 


146 / Journal of Marketing, April 2006 


APPENDIX 
Continued 
eee 


Measures and Items 


Source 


NFT (scale used In Study 4) 
Autotelic NFT 
Touching products can be fun. 


Peck and Childers 
(2003a) 


| like to touch products even if | have no intention of buying them. 


When browsing in stores, I like to touch a lot of products. 


When walking through stores, | can’t help touching all kinds of products 
When browsing in stores, rt is Important for me to handle all kinds of products 


| find myself touching all kinds of products in stores. 
Instrumental NFT 


| place more trust in products that can be touched before purchase. 

| feel more comfortable purchasing a product after physically examining it. 

| feel more confident making a purchase after touching a product. 

If | can't touch a product In the store, | am reluctant to purchase the product. 

The only way to make sure a product is worth buying is to actually touch tt. 

There are many products that | would only buy if | could handle them before purchase. 


aMeasured on a scale from 1 (“disagree strongly") to,5 (“agree strongly’). 


bin Study 3, we replaced Urban-Furniture with Kodak. 


‘Measured on a scale from 1 (“disagree strongly") to 7 ("agree strongly"). 
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From the Editor: The Maturation of 
Marketing as an Academic Discipline 


70th volume, and the changes in the journal reflect the 

changeará in the marketing field. In short, this 1s a tran- 
sitional period for the marketing discipline, one that reflects 
the maturation of marketing thought and a growing self- 
confidence in the discipline's ability to create important 
new knowledge on its own. 


; | ournal of Marketing (JM) is now in the middle of its 


Marketing as an Applied Discipline 


Unlike many of the core disciplines from which marketing 
borrows (e.g., economics, psychology, anthropology), mar- 
keting arose as a discipline taught in business schools, pri- 
marily for the purpose of helping marketing managers do 
their jobs better. This applied orientation for the field ties 
academic thought in marketing to the real world, either 
directly or indirectly. The business world benefits tremen- 
dously from academic publications in journals such as JM, 
as a visit to any sophisticated practitioner conference, such 
as the American Marketing Association (AMA) Advanced 
Research Techniques Forum, the AMA Frontiers in Service 
Conference, or any Marketing Science Institute conference, 
will quickly confirm. This applied orientation also has its 
advantages for marketing academics, in that the perceived 
usefulness of their work has led to higher salaries compared 
with professors in the humanities or the pure sciences. 


Science Versus Technology 


The applied nature of marketing has created some internal 
conflicts in the academic field because much of the acade- 
mic culture of marketing has been borrowed from core dis- 
ciplines in the social sciences, notably, from psychology 
and economics. The pure science culture tends to treat 
knowledge as inherently good and to disregard (or even dis- 
dain) application. This culture pervades the social sciences, 
in which application of scientific knowledge is largely con- 
sidered outside the academic domain. Such a dichotomy is 
also found in the physical sciences, in which fundamental 
scientific discovery is in conflict with, but is also symbiotic 
with, technological application. For example, applications 
in solid state physics in a physics department may make 
technological advances possible in an engineering depart- 
ment. Similarly, mathematical advances in a mathematics 
department may create the fundamental basis for new com- 
putational technologies in a computer science department. 
Viewed in these terms, marketing departments should per- 
haps be most similar to departments that create more 
applied technologies, such as engineering or computer 
science. 

Nevertheless, the actual evolution of marketing depart- 
ments has resulted in a science-technology dichotomy, 
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often existing within the same department, and certainly 
both points of view coexist within the field of marketing 
itself. This has led to certain misunderstandings, with 
science-oriented researchers looking down on technologi- 
cally oriented researchers, and vice versa. 


Legitimate Purposes for Research 
in Marketing 


The science-technology dichotomy implies two main legiti- 
mate purposes for research in marketing. The first is 
research that advances fundamental knowledge in market- 
ing and furthers scientific thought. Winners of JM's Harold 
H. Maynard Award are sometimes articles of this type (e.g., 
MacInnis and De Mello 2005). Most of the consumer 
behavior research that is published in the field of marketing 
is of this nature. In some journals (e.g., Journal of Con- 
sumer Research), concern for the practical application of 
scientific knowledge has traditionally been deemphasized. 

The second legitimate purpose for research in marketing 
is research that uses existing fundamental scientific knowl- 
edge to advance practical interests (usually managerial 
interests but also consumer interests or public policy inter- 
ests). Many of the quantitative decision models in market- 
ing are of this type. For this type of research, there is (and 
should be) less emphasis on advancing scientific thought 
and more interest in inventing something new that is useful. 
Some examples of this approach are recent winners of JM's 
Marketing Science Institute/H. Paul Root Award (Reinartz, 
Thomas, and Kumar 2005; Rust, Lemon, and Zeithaml 
2004). Such articles often appear in the more quantitatively 
oriented journals, such as Journal of Marketing Research 
and Marketing Science. 

To some degree, JM is a bridge between these two pur- 
poses. Many of its articles advance scientific thought in 
marketing, but there is always attention to how the work is 
meaningful to stakeholders in marketing. 


Homegrown Theory 


How to deal with theory is one of the major issues that the 
marketing field must address as it matures. In the scientific 
paradigm, theory is essential to provide generalizability and 
efficient ways of understanding. Too often, this has been 
understood as requiring “basic discipline" theory, meaning 
that a theory must be fundamental to psychology, econom- 
ics, or some other basic discipline. Often, this has been 
interpreted to mean that research in marketing that does not 
borrow fundamental theory from one of the basic disci- 
plines is ‘“‘atheoretical.” Marketing will only become mature 
as a field when it is realized that it is also possible to have 
theory that is indigenous to marketing—that is, theory that 
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does not necessarily result in a fundamental advance in one 
of the basic disciplines. There are already some good exam- 
ples of this in the literature (e.g., Shugan and Xie 2000; 
Vargo and Lusch 2004). 

Reviewers and editors must learn to be accepting of 
such theory. Too often, I have observed reviewers react to 
new, marketing-related theory by saying that it is “ad hoc.” 
By ad hoc, they really mean that it has been created by the 
author rather than borrowed from a core discipline. In mar- 
keting, there must be more acceptance of homegrown 
theory that is developed on the basis of problems and issues 
that are not only specific to marketing but also generaliz- 
able within marketing. For example, it should be acceptable 
to develop behavioral theory that is relevant to marketing 
but not particularly fundamental to psychology. Likewise, it 
should be acceptable to develop theory in the form of 
mathematical models that are useful to marketing but not 
necessarily of interest to economics. 


The Changing Environment 


The maturation of the marketing discipline also means 
keeping up with the technological and economic environ- 
ment. As technology advances, the economy changes in 
predictable ways that affect how marketing should be prac- 
ticed and studied. Technology—particularly, information 
technology—is making it easier to communicate with cus- 
tomers, store information about customers, and process this 
information to provide better service to customers. The 
result is that forming and maintaining relationships with 
customers on a large-scale basis is easier than it has ever 
been. This results in an expansion of the importance of ser- 
vice in the economy. Because technology development is a 
one-way street—it is always moving forward—this expan- 
sion of the importance of service is certain to continue, and 
the concepts of service marketing and relationship market- 
ing will become increasingly central to the field. 
Technology has also radically changed the way research 
is conducted, and it will continue to do so. It is clear, though 
perhaps easy to forget, how much more computational 
research in marketing became in the 1960s and 1970s when 
computers first became widely usable for research. Today, 
almost every doctoral student takes for granted the ability to 
use high-speed computing capability for data analysis. This 
trend will continue, and it is safe to predict that future 
research in marketing will become even more computation- 


ally focused, with likely advances coming from approaches 
that are highly computationally intensive. This has already 
been observed in the past ten years or so as Bayesian meth- 
ods have become mainstream, and in the next ten years, 
large-scale simulations will likely become widely used as a 
research approach. 

The effect of technology on communication technolo- 
gies also results in globalization of the economy. A gain, this 
is a one-way street, and the economy will become increas- 
ingly global over time. Marketing practice is increasingly 
global, and the same trend is reflected in marketing acade- 
mics. To mature fully as a discipline, marketing must 
become fully global, and the related journals, conferences, 
and research papers must reflect this. As such, JM has tried 
to lead in this effort by expanding the international diversity 
of the editorial board. Another model of internationalization 
is the 2006 AMA Frontiers in Service Conference, which 
will be held in Brisbane, Australia, and will feature presen- 
ters from a record 29 countries. To further JM's interna- 
tional agenda, as editor, I have made it a point to travel 
internationally whenever possible and to encourage interna- 
tional submissions. 


Summary 


After 70-plus years as an academic discipline, the market- 
ing field has come a long way, but it needs to make further 
progress to be truly mature. We need to recognize our pur- 
pose as an applied discipline and respect and tolerate the 
conflicting value systems inherent in the science and tech- 
nology paradigms. We need to recognize that advancing sci- 
ence is not the only legitimate purpose for an academic arti- 
cle in marketing, nor is advancing the technology of 
marketing by inventing practical new models or devising 
new procedures. Both kinds of advances should be encour- 
aged. Crucially, we need to be confident enough as a disci- 
pline to create and accept indigenous theory that is based on 
marketing issues rather than that of a core discipline. We 
need to modernize and update our field by reflecting the 
changes created by technological advances, which means an 
increasing focus on service and relationships, greater use of 
computational methods, and increasing globalization. The 
mature marketing discipline will be tolerant, confident, 
global, technologically savvy, and unashamed to be pri- 
marily about marketing. 
ROLAND T. Rust 
Editor 
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From the Web Editor: Improvements 
to the Journal of Marketing Web Site 


~~ 


am pleased to highlight several features that have 

been recently added to the Journal of Marketing (JM) 

Web site (http://www.marketingpower.com/jm). These 
improvements aim to increase the salience of JM in readers' 
professional careers, encourage greater interaction between 
JM readers and authors, and share new information about 
the journal. I take this opportunity to thank American Mar- 
keting Association (AMA) colleagues Chris Leporini, 
Francesca Van Gorp Cooley, and James Heckman for their 
help in facilitating these improvements and Charles 
Hofacker for promoting the improvements in the Academic 
Resource Center section of the AMA Web site. Please try 
the following features that are currently available at the JM 
Web site. 


Current Features 


Make JM Your Default Home Page 


Readers are only a click away from making JM a prominent 
part of their professional life. A. "Make JM Your Home 
Page" button on the main Web site (and linked pages) helps 
set JM as the default home page for Internet Explorer users. 
We plan to extend this feature to other Internet browsers. 


Visit the "Book Review” Section 


This section has transitioned to the Web site under the lead- 
ership of the JM Book Review Editor Ramarao Desiraju. 
This process has opened opportunities for new features that 
Ram has planned, such as cross-linking book reviews to 
relevant content at other AM A-related sites and incorporat- 
ing readers' inputs about the content of book reviews. Over- 
all, this section appears exciting. 


Blog on Specific JM Articles 


Blogging combines the immediacy of e-mail with the visi- 
bility and permanence of a public, informed, and ongoing 
online dialogue that is vital for community-building efforts. 
To encourage blogging by JM authors and readers, a URL 
tag line will accompany each article beginning with JM's 
July 2006 issue. The broad goal is to stimulate à continuous 
public discourse on each JM publication. In particular, 
blog-stimulated interactions between marketing practition- 
ers and academics are likely to enhance the relevance of 
published research; they should also contribute new and 
interesting insights that advance the discipline. Authors of 
all JM articles now have the opportunity to prepare an 
executive summary along the following lines: a list of short 
questions addressed in that article, a few sentences that 
answer these questions by showcasing key findings, and, 
finally, a list of short issues that the article offers for further 
research. The executive summary is posted in an article- 
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specific URL and serves as a starting point for open-ended 
discussion about that article through blogs that participants 
contribute. The format of the executive summary intention- 
ally differs from the standard abstract of JM articles to 
encourage participation from and interactions with market- 
ing managers. I urge all authors and readers to participate 
actively in this blogging forum. Related future plans include 
preparing frequently asked questions and search/retrieval 
features that support blogging. 


Check Out the List of “Most-Cited JM Articles” 


The JM Web site showcases a list of its top ten most-cited 
articles; this list is based on information supplied by Google 
Scholar. These rankings are currently anchored to the 2000— 
2005 period and represent a snapshot as of the date indi- 
cated. The rankings are updated each quarter, and Web site 
visitors can access ranking information from previous quar- 
ters. Additional plans for future improvements include the 
identification of “classic” JM articles from each decade 
(from the 1960s through the 1990s) and new metrics to 
assess the popularity of JM articles (e.g., the number of 
downloads of specific JM articles through the "pay-per- 
view" option that AMA provides). 


Get Acquainted with JM Statistics 


The Web site includes pie charts that depict various sub- 
scriber types for both members and nonmembers. In the 
future, we plan to include graphical information on citation 
and impact statistics for the journal. 


Future Features 


I plan to add a data/code repository at, the JM Web site, a 
feature that supports both replication research and graduate 
student learning. For example, researchers interested in 
replicating the work in a JM article may benefit from the 
ability to download the data used therein. Similarly, market- 
ing academics who teach doctoral seminars (or rigorous 
MBA classes) may develop empirically focused student 
assignments that are structured around specific journal arti- 
cles. Authors of published JM articles can help by con- 
tributing a copy of their data sets/program codes to a reposi- 
tory housed within the JM Web site. This approach requires 
sensitivity to protecting the right of participating JM 
authors to use their data 1n their own future publications. 
Thus, JM plans to restrict downloads of repository content 
to researchers who submit to a rigorous registration/authen- 
tication process and who commit to using the extracted 
data/code only for research replications (not research 
extensions). 

I hope you find the current (and planned) features at the 
JM Web site useful. Ultimately, the JM Web site is a 
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resource for the entire JM community, so meaningful 
improvements to the Web site critically hinge on readers’ 
suggestions and inputs. Please comment on the aspects you 
like (and those you do not like), and take a few moments to 
propose new features that will help you. I especially wel- 
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come your thoughts on bow the journal's Web site can bet- 
ter serve you (siva.jmwebsite @ gmail.com). 


SIVA K. BALASUBRAMANIAN 
Web Editor 


Michael Haenlein, Andreas M. Kaplan, & Detlef Schoder 


Valuing the Real Option of 
Abandoning Unprofitable Customers 
When Calculating Customer Lifetime 

Value 


In recent years, several authors have developed models that focus on the allocation of scarce marketing resources 
based on customer lifetime value (CLV). These approaches use CLV to develop a rank order of customers and 
recommend devoting more resources to customers with higher ranks. However, it has been discussed in the 
literature that a simple net present value analysis may not reflect the value of the flexibility to make such decisions. 
Therefore, some authors recommend the use of a real-options analysis in certain situations. Building on this stream 
of research and using the case of the option to abandon unprofitable customers, this article proposes an approach 
that combines real-options analysis and CLV; this approach explicitly values the seller's flexibility to abandon 
unprofitable customers. Using a combination of examples, empirical analysis, and Monte Carlo simulations, the 
authors provide evidence that the divergence between CLV that includes and CLV that excludes option value can 
be substantial and may not be the same for all customers. Therefore, the authors conclude that a distribution of 
customers based on CLV can change when option value is included. Thus, using CLV as a basis for marketing 
decisions but not including the value of the option to make such decisions a priori when calculating CLV can lead 






to an overall blased result. 


defined long-term customer profitability as "the pre- 
sent value of the future profit stream expected over a 
given time horizon of transacting with the customer." Since 
then, customer lifetime value (CLV) has contributed to solv- 
ing various problems, from mailing decisions in the catalog 
sales industry (Bitran and Mondschein 1996) to decisions 
related to mergers and acquisitions (Selden and Colvin 
2003) to questions about company valuation (Gupta, 
Lehmann, and Stuart 2004). In recent years, several authors 
have developed models that focus on the allocation of 
scarce marketing resources based on CLV (e.g., Reinartz, 
Thomas, and Kumar 2005; Rust, Lemon, and Zeithaml 
2004; Venkatesan and Kumar 2004). These approaches use 
CLV to develop a rank order of customers and recommend 
devoting more resources to customers with higher ranks. 
Some authors go so far as to state that customers with low 
ranks (especially when these show negative CLV) should be 
abandoned completely to increase overall profitability of 
the customer base (Zeithaml, Rust, and Lemon 2001). 


fA e 30 years ago, Kotler (1974, p. 24) 
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However, it has been discussed in the literature that the 
simple net present value (NPV) analysis may not reflect the 
value of the flexibility to make such decisions. Therefore, 
some authors recommend the use-of a real-options analysis 
in certain situations (e.g., Amram and Kulatilaka 1999; 
Dixit and Pindyck 1995; Pindyck 1991). To the best of our 
knowledge, real-options analysis has been investigated only 
twice in the context of CLV, namely, in a publication by 
Levett and colleagues (1999) and a working paper by 
Hogan and Hibbard (2002). Levett and colleagues propose 
that the relationship between a buyer and a seller can be 
expressed as a set of options. Because the customer has the 
option to choose between buying and not buying at any 
point in time, the authors determine CLV using a binomial 
option valuation model. Hogan and Hibbard develop a real- 
options-based framework for assessing the value of busi- 
ness relationships. They state that the value of any relation- 
ship is influenced by the core benefit value and the value of 
two options: a switching option (i.e., the flexibility to 
switch to another seller or buyer) and a growth option (i.e., 
the freedom to expand the relationship beyond its current 
scope). Thus, relationships can be categorized on the basis 
of a three-dimensional typology, depending on the magni- 
tude of each of these value components. 

In this article, we propose an alternative approach to 
combining real-options analysis and CLV that explicitly 
values the seller's flexibility to abandon unprofitable cus- 
tomers. Traditional CLV calculations assume that a com- 
pany will continue to deal with unprofitable customers, and 
indeed, a company may do so, in which case regular CLV 
calculations are appropriate. However, when a firm discon- 
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tinues unprofitable customers because it believes that the 
customer relationship cannot become profitable in future 
periods, we propose a modification of standard CLV calcu- 
lations. In this sense, our work differs from that of Levett 
and colleagues (1999) because we focus on modeling the 
seller’s options, whereas they concentrate on the buyer’s 
options and build on the assumption that the seller will 
always sell the product when the buyer is willing to pur- 
chase it. Our article is also different from Hogan and Hib- 
bard’s (2002) work because we develop a specific metric 
(i.e, CLV) to quantify the value of a specific type of 
market-based asset (i.e., customer relationships), whereas 
their approach is conceptual and broader in scope. Because 
our results provide evidence that the divergence between 
CLV that includes and CLV that excludes option value can 
be substantial and may not be the same for all customers, 
we conclude that a distribution of customers based on CLV 
can change when option value is included. Therefore, using 
CLV as a basis for marketing decisions but not including 
the value of the option to make such decisions a priori when 
calculating CLV can lead to an overall biased result. 

Take the mobile phone industry as an example. Because 
of the absence of indicators of customer quality at the time 
of acquisition, mobile phone operators usually attribute the 
same customer acquisition cost (mainly consisting of hand- 
set subsidies) to all customers acquired through the same 
acquisition channel. To prioritize acquisition channels in the 
allocation of a limited total budget, the CLV of an “aver- 
age" customer acquired through any channel is calculated. 
Because customers expect to replace their handset at regular 
time intervals, this CLV includes the expected cost of future 
upgrades. In reality, however, companies upgrade customers 
only at the end of their acquisition contract if the revenues 
generated until that point indicate sufficiently good quality 
to justify a reinvestment in the relationship. At this point, 
revenue that the customer generates is a much better predic- 
tor of future customer quality than simply the information 
about the acquisition channel. Therefore, mobile phone 
operators keep the freedom to exercise their option not to 
upgrade when prior revenues are unfavorable. This usually 
equates to an abandonment decision because a customer 
who does not receive a desired upgrade is likely to leave. 
However, CLV does not account for this flexibility, because 
it assumes a fixed expected cost of future upgrades for all 
customers acquired through the same channel. Thus, the 
value of the flexibility not to upgrade low-value customers 
is not included in the calculation of CLV. This makes the 
traditional CLV measure inconsistent with actual behavior. 
Therefore, using CLV determined this way when allocating 
a limited total customer acquisition budget across acquisi- 
tion channels is likely to lead to a bias. 

The remainder of this article is structured as follows: In 
the next section, we introduce real-options theory by show- 
ing how this valuation technique can overcome some limita- 
tions of simple NPV analysis and how it can be applied in a 
marketing context. We then use two examples to illustrate 
how the value of the option to abandon unprofitable cus- 
tomers can be calculated. We go on to validate our approach 
empirically using data provided by the Direct Marketing 
Educational Foundation (DMEF) to estimate the divergence 
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between CLV that includes and CLV that excludes option 
value. Subsequently, we conduct a Monte Carlo simulation 
to identify factors that influence this divergence. The article 
concludes with a discussion of the theoretical and manage- 
rial implications of our findings, as well as the limitations 
of our approach and areas of further research. 


Real-Options Theory 


Authors in the area of strategic investment decision making 
have noted that simple NPV analysis is associated with sev- 
eral implementation and conceptual problems. Although 
some of these problems can be cured by modifications of 
this approach that have been discussed in the literature (for 
a detailed list and explanation, see Adler 2000), a shortcom- 
ing remains (e.g., Amram and Kulatilaka 1999; Dixit and 
Pindyck 1995): NPV analysis assumes that one specific 
value for some kind of metric (e.g., cash flow) can be 
defined for each period within the time horizon of the 
analysis. Although it is possible to account for future uncer- 
tainty by specifying more than one state of nature per 
period, different values of this metric must be combined 
into one expected value before NPV can be calculated. 
Therefore, simple NPV analysis builds on the assumption 
that a project can be completely planned beforehand and 
ignores the project’s potential to create future options that 
are not foreseeable based on the current stock of knowl- 
edge. Such options could include the freedom to abandon 
the project in case of unfavorable conditions or to extend its 
scope in case of favorable ones. However, it is a characteris- 
tic of options to show an asymmetric behavior in net pay- 
offs because they always represent the right but not the 
obligation to carry out a future action. Thus, because NPV 
analysis ignores these options, it often underestimates the 
true value of an investment (e.g., Benninga and Tolkowsky 
2002). The literature on the strategic management-finance 
interface that has emerged around the valuation of such 
(real) options is extensive, whereas to our knowledge, there 
are only four studies that discuss this approach in marketing 
(Finch, Becherer, and Casavant 1998; Hogan and Hibbard 
2002; Levett et al. 1999; Slater, Reddy, and Zwirlein 1998). 
Nevertheless, the application of a real-options approach to 
customer acquisition/retention analysis using CLV seems to 
be particularly appropriate. These activities usually create 
future options because the decision to acquire or reinvest 1n 
a customer always creates the choice of further investment 
in the relationship in subsequent periods. 

An essential step after identifying the existence of a real 
option is its valuation. For this task, there are three different 
approaches that can be used: Black and Scholes's (1973; 
hereinafter, the Black-Scholes model) model, contingent 
claims valuation (Cox, Ross, and Rubinstein 1979), and 
dynamic programming (for an application to real-options 
valuation, see Copeland and Tufano 2004). The Black- 
Scholes model has the advantage of providing a closed- 
form solution for option value. It has previously been 
applied in the context of real-options analysis (e.g., 
Luehrman 1998), but its drawback is that it is computation- 
ally demanding. Contingent claims valuation is much easier 
to apply than the Black-Scholes model and uses less strin- 


gent assumptions, though it includes the Black-Scholes for- 
mula as a limiting case. It has also been used to value real 
options (e.g., Sachdeva and Vandenberg 1993), but it builds 
on the assumption of arbitrage-free markets and thus 
assumes that the real option is tradable. This assumption 
may be questionable in the context of CLV calculations. In 
addition, as Devinney and Stewart (1988) note, models that 
have their roots in financial markets cannot always be 
applied to product markets, because this may violate key 
assumptions underlying them. Therefore, we do not recom- 
mend either of these two approaches for valuing real 
options in a marketing context; rather, we suggest the use of 
a third method, the dynamic programming approach. 
Dynamic programming is a class of solution methods 
for solving sequential decision problems. It builds on the 
assumption that "the optimum way of getting from a par- 
ticular state to an end point is not a function of the path 
taken to that state" (Elton and Gruber 1971, p. 480). This is 
often also referred to as Bellman's principle of optimality. 
Thus, the optimal decision at any time t can be identified by 
maximizing the sum of the paybacks from following a par- 
ticular policy at time t plus the expected paybacks from fol- 
lowing an optimal policy from time t + 1 onward. In the 
context of real-options analysis, dynamic programming 
helps examine options from more of a decision theory point 
of view. It is not based on the same assumptions underlying 
financial option valuation. Nevertheless, despite this funda- 
mental difference, the contingent claim value of an invest- 
ment can also be identified by the dynamic programming 
approach. However, this is true only after parameter calibra- 
tion (Knudsen, Meister, and Zervos 1999) because the 
dynamic programming method assumes an exogenously 
determined discount rate, whereas contingent claims valua- 
tion determines a risk-adjusted discount rate endogenously. 
In the following section, we use two examples to show 
how dynamic programming can be applied to valuing a 


seller's abandonment option to terminate business with a 
customer who has proved to be unprofitable. We propose a 
real-options-based approach to CLV, which explicitly val- 
ues the option to abandon unprofitable customers. 


Examples 


Imagine a marketing. manager who must choose between 
implementing one of two alternative advertising campaigns 
(A and B). With Campaign A, the company can attract 
“good” customers with a probability of 30% and "bad" ones 
with a probability of 70%. A good client will generate a net 
profit of $1,250 in Period 1, whereas a bad one is assumed 
to result in a net loss of —$470 in the same period. For both 
types of clients, this net contribution can either improve or 
deteriorate by 2096 in Period 2, whereas the likelihood of an 
improvement will be 596 for a good and 1096 for a bad cus- 
tomer. In this context, the term improvement is viewed from 
a company perspective and represents either a profit 
increase or a loss decrease (see Figure 1). With Campaign 
B, only one type of customer, one that generates a constant 
net profit of $46 in Periods 1 and 2, can be acquired. For 
both campaigns, the average cost of recruiting the customer 
and setting up a new account is assumed to be $50, and the 
discount rate is 15%. Using an average lifetime of two peri- 
ods, the CLV of a newly acquired customer based on a sim- 
ple NPV analysis is approximately —$16 for Campaign A 
and $75 for Campaign B. Because the CLV of Campaign B 
is greater than that of Campaign A and greater than the 
acquisition cost, the decision would be in favor of Cam- 


paign B.! 


L Although the example is an artificial one, the key numbers can 
be considered realistic for the special case of the retail banking 
1ndustry. The net contribution of a good and bad customer in Cam- 
paign A ıs determined on the basis of an expected revénue of 
$1,600 and $230 and the expected cost of $350 and $700, respec- 


FIGURE 1 
Expected Net Contribution of Customers Acquired Through Campaign A (Example 1) 


Perlod 0 Period 1 Perlod 2 
Acquisition cost Probability Net profit Probability | Net profit 
Net profit Improvement (+20%) 
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$1,500 


(1.2 x $1,250) 





-~ Net profit deterioration (-20%) 


. . $1,000 
(.8 x $1,250) 


95% | 


Net loss improvement (—20%) 


—$376 
b | (8 x~$470) 
Net loss deterioration (+20%) 
d -$564 
90% ' (1.2 x -$470) 
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However, as we noted previously, NPV analysis does 
not account for the flexibility of the marketing manager 
after the end of Period 1. At that point, the marketing 
manager knows (e.g., on the basis of experience from com- 
parable campaigns already carried out by the company) that 
a customer who proved to be bad in Period 1 will generate 
an expected loss in Period 2. Therefore, the marketing 
manager has the option to behave asymmetrically in Period 
2; that is, in the case of a good customer, the client relation- 
ship will continue, and in the case of a bad one, it will be 
abandoned. This asymmetrical behavior protects the 
manager from the downside risk to realize the loss of a bad 
customer in Period 2. Simple NPV analysis does not 
account for this option because only one value for the 
expected net contribution of a customer in Period 2 needs to 
be determined. We assume that the marketing manager 
decides to terminate the client relationship with a bad cus- 
tomer at the end of Period 1 and that this abandonment does 
not cause additional cost. In this case, the lifetime value of 
a bad customer increases from —$821 to —$409, which 
increases the expected CLV from -$16 to $273. On the 
basis of this increase of $289, which can be interpreted as 
the value of the abandonment option, Campaign A now 
seems far more attractive than Campaign B. 

The previous example builds on a simple situation and 
attempts to explain the basic idea behind our approach in 
more detail. Now, we apply our approach to a more com- 
plex and, thus, more realistic situation. 

Imagine that a marketing manager is faced with the 
decision to acquire a new customer, Mrs. Smith. Again, we 
assume that this acquisition costs $50. In every period after 
acquisition, Mrs. Smith can purchase a high-margin prod- 
uct, a low-margin product, or no product at all. We assume 
that a high-margin product generates a revenue of $200 and 
that a low-margin product generates a revenue of $100. In 
every period, the bank invests marketing costs into the rela- 
tionship with Mrs. Smith. In the first period after acquisi- 
tion, these costs are assumed to be $100. In every following 
period, they depend on Mrs. Smith’s purchase in the previ- 
ous period. If she purchases a high-margin product, market- 
ing costs will be $100; if she purchases a low-margin prod- 
uct, marketing costs will drop to $50; and if she makes no 
purchase, marketing costs will be $25. After every period, 
the marketing manager has the choice to terminate the rela- 
tionship with Mrs. Smith. However, we assume that this 
decision is associated with an abandonment cost of $25. On 
the basis of previous experience with customers similar to 
Mrs. Smith in terms of a set of observable characteristics 
(e.g., acquisition channel, demographics, socioeconomic 
status), the marketing manager expects a purchasing behav- 





tively. This corresponds to the figures given for the most profitable 
6% and the least profitable 14% for customers at First Chicago 
(O’Sullivan 1997, p. 44). The customer acquisition cost of $50 is 
in line with Reichheld and Sasser’s (1990, p. 106) estimations for 
the credit card industry. We also refer to the work of Berger, Wein- 
berg, and Hanna (2003) for a detailed discussion of how these val- 
ues can be estimated using data about comparable campaigns 
already carried out by the company. All CLVs we report in this 
section exclude acquisition cost. 
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ior for the three periods after acquisition (see Figure 2). The 
discount rate assumed for the analysis is 1596. 

Using this information, Mrs. Smith's CLV is $46.51 
when option value is excluded. Because this is less than the 
acquisition cost of $50, the decision would be not to acquire 
Mrs. Smith. However, this CLV does not account for the 
option to behave asymmetrically in future periods. After 
every period, the marketing manager can terminate the rela- 
tionship with Mrs. Smith depending on the future net con- 
tribution streams expected from this relationship. If we 
apply a dynamic programming approach to this example, 
there are three points at which abandonment is economi- 
cally sensible: (1) when Mrs. Smith purchases a high- 
margin product in the period after acquisition, (2) when she 
purchases a low-margin product in the period after acquisi- 
tion and a high-margin product in the next period, and (3) 
when she makes no purchase in the period after acquisition 
and purchases a low-margin product in the next period. 
Including the value of the option to abandon Mrs. Smith at 
these three points in time leads to a CLV of $52.97, which 
is greater than the acquisition cost. Thus, when option value 
is accounted for, the acquisition of Mrs. Smith is a worth- 
while strategy. 

Abandoning Mrs. Smith at these three points makes 
sense because of a combination of two factors: First, there 
is a relatively high probability of no purchase in the third 
period after acquisition, especially when purchases have 
been carried out in previous periods. This can be explained 
either by Mrs. Smith's saturation in terms of product con- 
sumption or by budget constraints she might be facing. For 
example, after Mrs. Smith purchases a high-margin product 
in Periods 1 and 2 after acquisition, the probability that she 
will not purchase in the following period is 8096. Second, 
marketing expenditures depend on previous purchasing pat- 
terns. This makes a no-purchase scenario highly unattrac- 
tive when, for example, it is preceded by a high-margin 
product purchase. There are also other situations in which 
Mrs. Smith generates a loss; however, in such cases, this 
loss is less than the abandonment cost of $25, so the aban- 
donment is not a recommendable strategy (given the time 
frame chosen for analysis). This example shows that in a 
more complex and realistic setting, the value of the aban- 
donment option requires calculations that can no longer be 
easily carried out using simple spreadsheet methods. Thus, 
there is a need to apply more advanced option-pricing 
approaches. 

At this point, it is important to note how real-options 
analysis differs from two other techniques that have been 
discussed in the literature: Bayesian updating and elemen- 
tary decision tree analysis. Bayesian updating is closely 
linked to real-options analysis in that both techniques deal 
with the revision of prior beliefs in light of new informa- 
tion. However, there are some specific characteristics of real 
options, for example, the inherent asymmetry included in 
their analysis or that the value of a real option increases 
with both reduced and increased uncertainty. Thus, it is 
more appropriate to view Bayesian updating as a routine 
component of real-options analysis than to view both tech- 
niques as identical (for a more detailed discussion, see 
Herath and Park 2001; Miller and Park 2005). With respect 
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to the difference between real-options and elementary deci- 
sion tree analysis, our example could give the impression 
that real-options analysis is only the folding back of a deci- 
sion tree. For our example, these two approaches are indeed 
equal because we voted for the use of dynamic program- 
ming as a valuation technique. Nevertheless, this equality is 
not generalizable to all situations, because other researchers 
might want to vote for other valuation techniques. For 
example, the Black-Scholes model, which we apply subse- 
quently in the context of our Monte Carlo simulation, has 
its roots in the solution of heat-transfer equations in physics 
and therefore is not related to decision tree analysis. 


Empirical Validation 


The aforementioned examples, which are built on hypo- 
thetical figures, should be viewed only as an illustration of 
our basic idea. In this section, we focus on empirically vali- 
dating our approach and on obtaining an estimate of the 
divergence between CLV that includes and CLV that 
excludes option value. We use empirical data provided by 
the DMEF (academic data set three), which consists of 12 
years of sanitized purchasing data for more than 100,000 
customers of a specialty catalog company. Our empirical 
validation consists of four steps: First, we estimate the 
future purchasing behavior (i.e., probability of purchase and 
dollar amount expected to be spent) for a set of customer 
segments using the recency, frequency, and monetary value 
(RFM) approach. Second, we determine costs generated per 
customer per year. Third, we use these two inputs to esti- 
mate the expected future revenue and profit contribution for 
each customer segment. Fourth, we calculate CLV that 
includes versus CLV that excludes option value and com- 
pare these values on customer base level to obtain an esti- 
mate of the divergence between them (for computational 
details regarding our approach, see the Appendix). 

The first step, the estimation of the future purchasing 
behavior for a set of customer segments, is carried out with 
the RFM approach. The RFM approach, which Cullinan 
(1978) first presented and Bauer (1988) subsequently 
extended, assumes that a customer’s future purchase behav- 
ior is determined by three purchase history variables: 
recency, frequency, and monetary value. Recency is the 
time elapsed since the previous purchase, frequency is the 
number of purchases made ın a specified time period, and 
monetary value is the dollar sales value of prior purchases 
(Bauer 1988). Although other, more advanced techniques 
have been proposed in the literature to model future cus- 
tomer behavior, such as the Pareto/NBD model (Schmitt- 
lein, Morrison, and Colombo 1987) and the beta-geometric/ 
NBD framework (Fader, Hardie, and Lee 2005a), we 
decided to apply the RFM approach in our context. Because 
the focus of our empirical validation lies in comparing two 
approaches to determining CLV given an assumed future 
purchasing behavior, and not in evaluating the predictive 
accuracy of these methods or in proposing a better method 
to forecast customer activity, the RFM approach, which ıs 
widely known and easy to use, is preferable. 

To use the RFM approach, we first split the 12 years of 
historical purchase data available in our data set into two 
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groups, the first one covering two-thirds (8 years, from t — 
12 to t — 5) and the second group covering one-third (4 
years, from t — 4 to t — 1). We then determine recency 
(operationalized as the number of years since the last order) 
and frequency (measured as the total number of years in 
which at least one purchase occurred) as of tọ = t — 4. This 
leads to eight values for recency and frequency. From the 
total number of 106,284 customers in our data set, 47,617 
could be assigned to one of the resulting 36 distinct 
recency—frequency cells. The remaining 58,667 customers 
did not make any purchases between t — 5 and t — 12 and 
were discarded from the analysis. 

For each of these 36 customer segments, we subse- 
quently calculate conditional purchase frequencies and 
average dollar amounts spent for all possible purchasing 
patterns between years t — 4 and t — 1 (for a similar 
approach, see Berger, Weinberg, and Hanna 2003). Figure 3 
visualizes the purchasing pattern of one segment, namely, 
customers with a recency and frequency of 1 (ie., cus- 
tomers who last purchased in t — 5 and conducted exactly 
one purchase between t — 5 and t — 12). For ease of notation, 
we refer to this segment as "Segment 1" in the following 
discussion. In this graph, a purchase in one year is repre- 
sented by an upward move and a white cell, whereas no 
purchase is represented by a downward move and gray- 
shaded cell. From the total of 8345 customers in this seg- 
ment, 2715 conducted a purchase in t — 4, and 5630 did not. 
Thus, the conditional purchase frequency for a customer to 
carry out a purchase in t — 4 is 32.5396 (2715/8345). 
Because these 2715 customers spent a total of $229,088, the 
associated average dollar amount spent is $84.38 per cus- 
tomer ($229,088/2715). In total, there are 15 conditional 
purchase frequencies and average amounts spent (corre- 
sponding to the 15 white cells) per customer segment that 
can be determined this way. The resulting 15 pairs of values 
serve as a proxy for future purchasing behavior in t to t + 3 
for all consumers who belong to this specific segment. 

The second step consists of the estimation of costs gen- 
erated per customer per year. Specifically, we include two 
types of costs in our model: the cost of goods sold and 
direct marketing expenditures. We assume that the cost of 
goods sold represents a certain percentage of the average 
dollar amount spent per year, and we estimate this percent- 
age to be approximately 5596 on the basis of data available 
in our data set. For direct marketing expenditures, we face 
the problem that the data set does not provide sufficient 
information for their estimation. Therefore, we build on the 
work of Gónül and Shi (1998, p. 1256), who assume an 
average cost per mailing of $2.50. On the basis of the 
assumption that our specialty catalog company might send 
out between zero and four catalogs per year (i.e., at a maxi- 
mum of one catalog per season), we assume direct market- 
ing expenditures to vary between $0 and $10 per customer 
per year. Because there is no way to determine which of 
these values is more likely than the others, we calculate 100 
different scenarios by increasing cost in steps of $.10, 
beginning at $0 and continuing to $10.2 





“We allow marketing expenditures to vary in $.10 increments 
instead of assuming five distinct values ($.00, $2.50, $5.00, $7.50, 


FIGURE 3 
Purchasing Pattern of Customer Segment 1 In Years t — 4 to t — 1 
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In the third step, we use these two inputs to estimate the 
expected future revenue and profit contribution of an aver- 
age customer in each segment. We first calculate expected 
revenue as the average amount spent given a certain pur- 
chasing pattern multiplied by the probability that this pur- 
chasing pattern can be observed. We then calculate profit 
contribution as expected revenue less the cost of goods sold 
less marketing expenditures. Marketing expenditures are 
weighted with the probability of observing last year’s pur- 
chasing pattern because we assume that they are paid out at 
the beginning of each year. Figure 4 shows the resulting 
profit contributions for an average customer in Segment 1, 
determined on the basis of the information provided in Fig- 
ure 3 and the assumption that marketing expenditures are 
equal to $5 per customer per year. 

In the fourth step, we calculate CLV that includes and 
CLV that excludes option value. We discount profit contri- 
butions back to the beginning of year t and sum them up, 
which results in a CLV of an average customer in Segment 
N that excludes option value CLVNggoy. We make the sim- 
plifying assumption that profit contributions are realized in 
the middle of each year because expected revenue can occur 
at any time, whereas we assume that marketing expendi- 
tures are paid out at the beginning of each year. Using a dis- 
count rate of 20% and the example of Segment 1, CLV! oy 
is equal to $12.91. To calculate CLV that includes option 
value CLVNioy, we must first determine the activity status 
of each customer conditional on his or her purchasing pat- 
tern. To do this, we make two assumptions: (1) that a cus- 
tomer should be abandoned in year T, whenever the sum of 
expected future profit contributions from T onward dis- 
counted back to T is less than the cost associated with the 
abandonment, and (2) that an abandoned customer becomes 
inactive for all years after abandonment. Then, it is straight- 
forward to define a variable that describes the activity level 
of each customer conditional on his or her purchasing pat- 
tern, which takes the value of 1 in the case of the customer 
being active and 0 in the case of abandonment. In the spe- 
cific case of Segment 1, this rule results in the decision that 
a customer who did not conduct a purchase during time t 
should be abandoned. Thus, CLVN;oy can be defined as the 
discounted sum of expected profit contributions multiplied 
by the corresponding activity level. On the basis of a dis- 
count rate of 2096 and the assumption that abandonment is 
not associated with any cost, CLV !joy is equal to $13.65 for 
the example of Segment 1. To obtain an estimate of the 
divergence between these two values of CLV on customer 
base level, we multiply CLVNgoy; and CLVNioy by the total 
number of customers in Segment N and sum them over N. 
Although this is not strictly necessary—a comparison could 
also be carried out between CLVNgoy and CLVhjoy 
directly—we carry out this calculation to obtain a weighted 
average of these two values over all segments. 


and $10.00) to allow for cases in which average cost per mailing 
differs from $2.50 and/or more than four mailings are sent out per 
customer per year. We assume that these expenditures are uniform 
across customers and include both fixed and variable costs. How- 
ever, they are limited to retention cost only because we assume 
that no acquisitions take place 1n the analysis period 
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Figure 5 shows the divergence (calculated as CLVjoy/ 
CLVgoy — 1) between these two values as a function of 
marketing expenditures per customer per year. The result 
supports our hypothesis that CLV that excludes option value 
underestimates customer value. For marketing expenditures 
equal to $5 per customer per year, the divergence is ~1.5% 
on average, and for marketing expenditures greater than 
$6.50, it consistently exceeds 596. For any given marketing 
expenditure, the divergence at the customer segment level 
differs greatly from one segment to another. For example, in 
the case of marketing expenditures of $7.50 per customer 
per year, segment-specific divergence varies from 0% to 
70%. This latter point shows that there may be factors other 
than cost that determine the divergence. Therefore, we con- 
duct a Monte Carlo simulation to investigate this point 
further.? 


Monte Carlo Simulation 


The purpose of our Monte Carlo simulation is to gain a bet- 
ter understanding of the influence of factors other than cost 
on the divergence between CLV that includes and CLV that 
excludes option value. Specifically, we focus on the poten- 
tial impact of a change in expected revenues, which we 
modify by changing the standard deviations and average 
values for the probability of purchase and dollar amount 
expected to be spent. The simulation consists of two steps: 
First, we analytically identify which kind of influence these 
factors can be expected to have on the divergence. Second, 
we conduct a set of Monte Carlo simulations to determine 
the order of magnitude of this influence. 

For the first step, the analytical identification of the kind 
of influence to be expected, we build on the equations that 
Black and Scholes (1973) derive to value financial options. 
The Black-Scholes model has the advantage of providing a 
closed-form solution for the value of a (real) option, which 
is not given either for contingent claims valuation or for 
dynamic programming. Although we applied dynamic pro- 
gramming, not the Black-Scholes model, as a valuation 
technique in our empirical validation and the following 
Monte Carlo simulation, we are comfortable with this 
choice in the context of the analytical verification. The 
Black-Scholes model is included as a limiting case in con- 
tingent claims valuation (Cox, Ross, and Rubinstein 1979), 
and contingent claims valuation and dynamic programming 
are equivalent modulo parameter calibrations (Knudsen, 
Meister, and Zervos 1999). 

Black and Scholes (1973) show (not building on 
dynamic programming but on the solution of heat-transfer 
equations in physics) that the value of a put option Vpop 
(i.e., the right to sell a stock) can be calculated using the 
following three equations: 


3We also conduct a similar analysis in which cost of goods sold 
represent 5096 and 6096 of revenue. The results show that an 
increase (decrease) in cost of goods sold simply results in an 
upward (downward) shift of the graph without substantially influ- 
encing variance in segment-specific divergence. 

^We use the equations that describe a put versus a call option 
because the real option of abandonment is more similar to a put 
option 1n financial terms (Sachdeva and Vandenberg 1993). 


FIGURE 4 
Profit Contribution of an Average Customer in Segment 1 
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stock, and t represents the time to expiry. The notation In(a) 
is used to describe the natural logarithm, and N(a) is used to 
represent the cumulative normal density function. In the 
context of real-options analysis, this interpretation changes 
slightly. As Luehrman (1998) discusses, x can then be 
viewed as the present value, v? can be viewed as the riski- 
ness of expected cash flows, and c can be interpreted as the 
present value of fixed costs. As we stated previously, the 
focus of our Monte Carlo simulation is to investigate the 
potential impact of a change in the standard deviations and 
average values assumed for the probabilities of purchase 
and dollar amounts expected to be spent. Because expected 
revenues can be determined as the product of these two 
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FIGURE 5 
Divergence Between Customer Base Values Determined by Including Versus Excluding Option Value 
(Empirical Validation) 
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parameters, this is equal to investigating a change in the 
Standard deviations and average values of expected reve- 
nues. Using the terminology of the Black-Scholes model, 
we focus on the potential impact of changes in v2 and x on 
real-options value. 

To derive the impact of these two variables on option 
value analytically, we calculate the partial derivatives of 
Vro,p With respect to v and x. Keeping in mind that 
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this can be simplified to 
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(8) Trop = ce-r tf (-d,) and 
3V 
(9) — Nd): 


In this simple form, the partial derivative of Vgo p with 
respect to v has a positive sign, and the one with respect to 
x has a negative sign. Thus, Vag p increases with increasing 
v and decreasing x. In other words, the value of the real 
option of abandonment increases with increasing uncer- 
tainty and decreasing present value of expected cash flows. 
Translating this back to the factors we investigated in the 
context of the Monte Carlo simulation, this implies that the 
difference between CLV joy and CLV gov can be expected to 
increase with increasing standard deviations and decreasing 
average values of future purchase probabilities and dollar 
amounts expected to be spent. 

In the second step, we then carry out a set of Monte 
Carlo simulations to get an idea of the order of magnitude 
of these influences. For this, we build on the always-a-share 
CLV example that Dwyer (1989) proposes. Dwyer’s 
approach is essentially equivalent to the one we used in the 
empirical validation; it is characterized by the following 
two assumptions: First, purchase probabilities, dollar 
amounts expected to be spent, and marketing expenditures 


are assumed to depend on the recency cell of each customer 
(see Table 1). Second, cost of goods sold is assumed to rep- 
resent 60% of revenue. Dwyer’s example is more realistic 
than our empirical validation in some dimensions (i.e., if we 
assume that marketing expenditures depend on each cus- 
tomer’s recency rather than that they are constant, as in our 
case) and less in others (i.e., if we assume that purchase 
probabilities and dollar amounts expected to be spent 
depend only on recency rather than that purchase probabili- 
ties and dollar amounts expected to be spent depend on 
recency and frequency, as in our analysis). However, 
because the focus of our Monte Carlo simulation is to inves- 
tigate how the divergence between CLV that includes and 
CLV that excludes option value changes with a variation in 
model parameters, it is not essential that the simulation is 
completely identical to our empirical validation. It is neces- 
sary only to ensure that the assumptions underlying it 
change in a controlled manner among different scenarios. 
However, it should be kept in mind that the absolute values 
of divergence are not comparable. In total, we carried out 
1.8 million simulations, split into nine scenarios with 
200,000 runs each, to investigate how the divergence 
changes when we change (1) the standard variation in future 
purchase probabilities and dollar amounts expected to be 
spent and (2) the average values assumed for these two 
parameters. 

To address the first objective, the investigation of the 
change in the divergence associated with a change in stan- 
dard deviations, we assume that the eight values for pur- 
chase probabilities and dollar amounts expected to be spent 
(one probability and one dollar amount for each of the four 
recency cells) behave according to a normal distribution 
with average values as assumed by Dwyer (1989; see Table 
1) and standard deviations that represent 5% of these aver- 
age values. For modeling purchase probabilities, we used a 
truncated normal distribution to avoid probabilities that 
were inferior to 0 and larger than 1. We then conducted the 
following three-step procedure: First, we simulated 20,000 
customers by calculating probabilities and dollar amounts 
expected to be spent on the basis on these distributional 
assumptions. Second, we determined the resulting diver- 
gence between CLV that includes and CLV that excludes 
option value for each of them. We excluded all customers 
for which CLVgoy < 0 because this resulted in CLVyjoy = 0 
and a divergence of —100%. Third, we calculated an overall 
average divergence for all 20,000 customers as the median 
of all customer-level divergences. We decided on the 
median rather than the arithmetic mean to obtain a con- 
servative estimate and to avoid biases caused by outliers as 


far as possible. We subsequently increased the standard 
deviations in 596 increments up to 5096 of the average val- 
ues and followed the same three-step procedure described 
previously. The results of these 200,000 runs (10 different 
standard deviations with 20,000 runs reach) appear in Fig- 
ure 6, in which we plotted the resulting overall average 
divergence as a function of the standard deviation. 

Figure 6 shows two interesting properties: First, there 
appears to be a quadratic relationship between the average 
divergence and the standard deviation. This is also con- 
firmed by an associated analysis we conducted using SPSS. 
Regressing overall average divergence on standard devia- 
tion results in an adjusted R-square of .735, an F value of 
26.01, and a t-value of the dependent variable of 5.10, 
whereas regressing it on the squared standard deviation 
(i.e., variance) results in an adjusted R-square of .909, an F 
value of 91.35, and a t-value of 9.56. This relationship is 
intuitive because increasing variance is associated with 
increasing probabilities for unprofitable customer relation- 
ships and, thus, increasing occurrence (and value) of aban- 
donment decisions. Second, the absolute impact of standard 
deviation on average divergence seems to be marginal. 
Although standard deviation increases tenfold from 5% to 
5096, the average divergence increases only slightly (from 
85% to 1.21%). In summary, this analysis confirms our 
theoretical finding that average divergence increases with 
increasing standard deviation. Nevertheless, this increase 
seems to be of limited size.° 

With respect to the second objective, the analysis of a 
potential influence of a change in average values on a 
change in overall average divergence, we again use Dwyer's 
(1989) assumptions as a starting point. We subsequently 
modify the average values for purchase probabilities and 
dollar amounts spent by either doubling or halving the val- 
ues. This leads to seven other scenarios; for each scenario, 
we conducted the same analysis described previously (i.e., 
increasing the standard deviation in 596 increments from 
596 to 5096 and determining an overall average divergence), 
the results of which appear in Table 2. The overall average 
divergence increases with decreasing purchase probabilities 
and dollar amounts spent, which is in line with our theoreti- 
cal expectations. For the range of values we investigated in 
our simulation, this effect seems to be substantially larger 
than the impact of standard deviation on average diver- 


5This result was also confirmed in the seven other scenarios we 
calculated. Because of space constraints, we show only one graph, 
but this additional information can be obtained from the first 
author on request. 





TABLE 1 
Assumptions of Always-a-Share CLV Example as Proposed by Dwyer (1989) 

Probabllity of Amount Expected Marketing 
Hecency Purchase In to Be Spent in Expenditures In 
Cell Description Period t (96) Period t ($) Period t ($) 
1 Customer last purchased in t — 1 30 100 3.60 
2 Customer last purchased in t — 2 20 80 3.10 
3 Customer last purchased In t — 3 15 60 1.80 
4 Customer last purchased in t — 4 5 40 1.30 
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FIGURE 6 
Divergence Between Customer Base Values Determined by Including Versus Excluding Option Value 
(Simulation, Average Values as Noted In Table 1) 
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TABLE 2 
Divergence Between Customer Base Values Determined by Including Versus Excluding Option Value 

(Simulation) 

a es UO 
Dollar Amount Expected to Be Spent 

Purchase Probabllity Halved Unchanged Doubled 
Halved 11.52% 1.58% 
Unchanged 5.99% 94% .0396 
Doubled .2596 .00% .00% 


Notes. “Halved,” “unchanged,” and “doubled” refer to the average values In Table 1. We did not calculate an overall average divergence for the 
scenano in which both purchase probabilities and dollar amounts spent were halved, because In this case, for all standard deviations, 


at least 80% of all simulated customers resulted In CL Vgoy < 0. 


gence. In addition, a change in average purchase probabili- 
ties has a greater impact than a change in average dollar 
amounts spent. 


Theoretical Implications and 
Managerial Relevance 


In summary, several authors have used CLV to develop 
models that focus on the allocation of scarce marketing 
resources (e.g., Reinartz, Thomas, and Kumar 2005; Rust, 
Lemon, and Zeithaml] 2004; Venkatesan and Kumar 2004). 
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However, it has been discussed in the literature that simple 
NPV analysis may not reflect the value of the flexibility to 
make such decisions. Therefore, some authors recommend 
the use of real-options analysis in certain situations (e.g., 
Amram and Kulatilaka 1999; Dixit and Pindyck 1995; 
Pindyck 1991). Building on this stream of research, we pro- 
posed a real-options approach for customer valuation. 
Using the case of the option to abandon unprofitable cus- 
tomers, we first illustrated through example the basic idea 
behind real-options analysis and its application to determine 
CLV. We then used data provided by the DMEF to validate 


our approach empirically and to obtain an estimate of the 
divergence between CLV that includes and CLV that 
excludes option value. Finally, we conducted a set of Monte 
Carlo simulations to obtain a better understanding of factors 
that influence this divergence. We showed both analytically 
and through our simulation that the divergence increases 
with (1) decreasing future purchase probabilities, (2) 
decreasing dollar amounts expected to be spent in future 
purchases, and (2) increasing variance in these two param- 
eters. Regarding the order of magnitude of these influences, 
our simulations provide evidence that the impact of a 
change in variance is marginal compared with a change in 
average values. In addition, a change in average purchase 
probabilities has a larger impact than a change in average 
dollar amounts spent. 

From a theoretical perspective, our approach contributes 
to research in the area of CLV by proposing a methodology 
that can be used to incorporate the value of the seller’s 
flexibility into CLV. In this sense, it adds to the literature in 
three ways: First, it complements Levett and colleagues’ 
(1999) publication and adds to Rust, Lemon, and Zeit- 
haml’s (2004) work, both of which explicitly model the cus- 
tomer’s flexibility when determining CLV. Second, it 
addresses two research directions that Hogan and Hibbard 
(2002) propose; specifically, by using the example of the 
option to abandon unprofitable customers, we developed a 
metric to quantify the value of business relationships using 
the real-options-based framework by explicitly modeling 
the decision to “fire a customer.” Finally, the current study 
can be viewed in the context of Hogan, Lemon, and Libai's 
(2003) article because we provide evidence that customers 
who appear to be unprofitable from an NPV-based CLV 
perspective may not be as unattractive as they seem. 

Therefore, on the basis of our findings, we recommend 
including option value in the calculation of CLV. Our 
results show that neglecting option value can lead to a sub- 
stantial underestimation of true customer value. Because 
this underestimation is not uniform across the whole cus- 
tomer base, it is likely to affect decisions made on the basis 
of a rank order of customers developed using CLV. The 
results of our Monte Carlo simulation show that customers 
with low probabilities of future purchase and low dollar 
amount expected to be spent tend to be underestimated 
more often than customers with more favorable values 
along these two dimensions. Thus, a rank order of cus- 
tomers based on CLV may be different when CLV is calcu- 
lated including option value than when it is calculated 
excluding option value. Most methods for the allocation of 
scarce marketing resources recommended in the literature 
(e.g., Reinartz, Thomas, and Kumar 2005; Rust, Lemon, 
and Zeithaml 2004; Venkatesan and Kumar 2004) build on 
such a rank order of customers and recommend the alloca- 
tion of more resources to customers with higher ranks. 
However, our results lead to the hypothesis that this alloca- 
tion could be substantially different if option value were 
included. 

With respect to the implementability of our approach, 
we showed in our example and our empirical validation that 
the calculation of CLV that includes option value does not 


require additional data compared with the determination of 
CLV that excludes option value. Many companies already 
classify customers into a set of distinct RFM cells. Thus, 
the basic information to determine option value is most 
likely already available in any given company. If it is not 
available, it can easily be generated if information about the 
purchase history of each individual customer is known. 
Building on this allocation of customers into distinct RFM 
cells, it is possible to determine option value using standard 
spreadsheet software. We conducted all calculations in this 
article using Excel 2002. 

Regarding the practicability of abandoning unprofitable 
customers, approaches have been discussed in the literature 
for a long time under the label “selective demarketing” 
(Kotler and Levy 1971). With the increasing importance of 
CLV, the topic has recently received the attention of a wider 
range of researchers (e.g., Rosenblum, Tomlinson, and 
Scott 2003; Selden and Colvin 2003; Zeithaml, Rust, and 
Lemon 2001). Techniques for dealing with unprofitable 
customers that have been proposed in this context can be 
grouped into “hard” and “soft” approaches. Hard 
approaches imply telling the customer directly that he or 
she is abandoned. For example, a recent article in The Seat- 
tle Times (Freed 2004) speaks about the Massachusetts- 
based fashion retailer Filene’s Basement. This company 
banned two sisters, who claimed to have been loyal cus- 
tomers for years, from all 21 of its stores after deciding that 
they returned too many items after wearing them for an 
evening and complained too often about service. In contrast 
to this strategy, soft approaches summarize all actions that 
can be taken to make the customer leave on his or her own. 
Among others, this includes not sending any promotional 
materials to such customers, introducing additional fees 
(e.g., chargeable customer service calls), and stopping cer- 
tain services (Selden and Colvin 2003). 


Limitations and Areas of Further 
Research 


Our study has two limitations that further research could 
address: First, our findings apply only to the specific type of 
option we analyzed, namely, the abandonment option. Thus, 
the degree of divergence we determined during our empiri- 
cal validation and the factors that influence it should not be 
generalized to other types of real options. We chose this 
specific option type as an illustration because some authors 
argue that the flexibility that lies at the core of the real- 
options approach requires the possibility of abandoning 
investment initiatives. This perspective makes abandonment 
essential for limiting downside risk (Adner and Levinthal 
2004). It also differentiates abandonment options from 
other option types that have been discussed in the literature 
(i.e., defer, learning, growth, and switching options). How- 
ever, our approach can be modified easily to reflect these 
types of options. Nevertheless, further analysis is needed to 
obtain a better understanding of this type and other types of 
options. 

Second, our empirical analysis and Monte Carlo simula- 
tion both assume that abandoning a customer is not associ- 
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ated with any cost. However, this is unlikely to be the case. 
Although it may be possible to cease business with an 
unprofitable customer without creating additional direct 
cost, abandonment carries the risk of creating an image of 
unfairness and discrimination in the market, which could 
lead to substantial,indirect cost (Cullwick 1975). For exam- 
ple, previous research has shown that companies that 
behave unfairly may be punished in the long run, even when 
this results in cost to their customers (Kahneman, Knetsch, 
and Thaler 1986). One such punishment could be the spread 
of negative word of mouth, the effects of which can be sub- 
stantial if the number of customers who experience dissatis- 
faction is high enough (Richins 1983). Such indirect aban- 
donment costs should be considered, perhaps by an 
approach similar to the one that Hogan, Lemon, and Libai 
(2004) use to quantify positive word of mouth. 

In addition to these two areas of further research, we 
believe that studies calculating real-options value by mak- 
ing use of more sophisticated approaches to model future 
customer behavior, such as the Pareto/NBD (Schmittlein, 
Morrison, and Colombo 1987) or the beta-geometric/NBD 
(Fader, Hardie, and Lee 2005a) framework, would be 
worthwhile. Such methods could help account for hetero- 
geneity among different customers more effectively than 
the RFM approach we used in our empirical validation, 
which could contribute to a better understanding of 
customer-specific factors that influence option value. Fur- 
thermore, this line of research would be an initial step in 
marrying our approach with the work done regarding the 
calculation of CLV based on these approaches (e.g., Fader, 
Hardie, and Lee 2005b; Reinartz and Kumar 2000, 2003). 
Finally, we believe that combining our work with publica- 
tions focused on the integration of the buyer's flexibility 
into CLV (e.g., Levett et al. 1999; Rust, Lemon, and Zeit- 
haml 2004) to come up with an integrated model that simul- 
taneously values the buyer's and seller's options would be a 
promising area of further research. Such a model could be 
helpful in two ways: First, 1t could further improve the 
understanding of the "true value of a lost customer" 
(Hogan, Lemon, and Libai 2003) and, consequently, the 
quality of decisions regarding customer selectivity by better 
distinguishing between good and bad customers. Second, it 
could contribute to firms making better recommendations 
about the allocation of scarce marketing resources based on 
a more profound understanding of "return on marketing" 
(Rust, Lemon, and Zeithaml 2004). 


Appendix 

In the following notation, let XN;,4 describe a variable, X, 
which applies to an average customer of Segment N (N - 1, 

., 36) with purchasing pattern ijkl. Here, i(j, k, 1) takes 
the value of 1 in the case of a purchase having taken place 
in year t (t+ 1, t + 2, t + 3) and 0 otherwise. Furthermore, 
let COGS represent cost of goods sold as a percentage of 
sales and ME stand for fixed marketing expenditures per 
customer per year that are assumed to be paid at the begin- 
ning of each year. Finally, let fa and AAT. describe the 
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purchase frequencies and average amounts spent deter- 
mined on the basis of the RFM approach. 

Using these notations, we can ERE the expected 
revenue, ERN p and profit contribution, PCN p as follows: 


(Al) PCN = ERN x (1- COGS) - ME, with 
ERN = fN x AANE; 
(A2) PCH = ERN x(1- COGS) - fN x ME, with 


ERN = fN x f x AAN; 


(A3) PC), = ERJ X(1- COGS) - fN x fN x ME, with 
ERN = £8 x fh x ff x AAT; and 
(A4) PCI, = ER, X (1- COGS) - fN x fN x fÑ x ME, with 


ERR pef x fh x fi x fax AAR. 


On the basis of Equations A1—AA, it is now straightfor- 
ward to calculate CLV that excludes option value. If we 
take a discount rate of d% per year and, for the sake of sim- 
plicity, assume that the profit contribution of each customer 
is paid out in the middle of each year (expected revenue can 
occur at any time, whereas marketing expenditures are 
assumed to be paid out at the beginning of each year), we 
can calculate the CLV for an average customer in Segment 
N based on a lifetime of four years (t to t+ 3), CLVNgoy as 


follows: 
N N 
Zeh Lee Yer 2 
(A5) EOV (1-d35 (1-d25 (14dy5 (14 d)5 


To determine CLV that includes option value, it is first 
necessary to define a decision rule that provides informa- 
tion about when a customer should be abandoned. Let 
DVN./T, be the value of an average customer of Segment N 
at the beginning of year T (T =t + 1, t + 2, t + 3), which we 
can calculate as the sum of expected future profit contribu- 
tions from T onward, discounted back to T, according to the 
following three formulas: 


dP PC 


A pyNttt — ee ces 
9) ! d uS * üdys is 
N 
PCN, — 
(AD) DVvNt+2 =k 4i and 
y (4d)5 (1+d)5 


PCN 
a +d)5° 





A customer should be abandoned at the beginning of 
year T whenever DVN.T x is less than the cost associated 
with this abandonment AC. Let ABNT xı be a variable that 
takes the value of 1 when the customer should be aban- 
doned at the beginning of year T and 0 otherwise. Using 
Equations A5—A8, we can determine ABN.T;,,, as follows: 


(A9) Apne =}! if CLVNpy < AC 


, 


O ifotherwise 


(A10  ABNtt! = 


, 


0O if otherwise 


1 if DVN +2 < AC 


(Al) ABNtt+2— , and 


( if DVN +1 < AC 


0 if otherwise 


(A12)  ABN'*?- l i MS 


O ifotherwise 


Taking into account that a customer, after he or she is 
abandoned, turns from an active to an inactive status for all 
future years, we can define a variable, SN/T,, which 
defines the activity status of a customer at year T and takes 
either the value of 1 when the customer is active or the 
value of O when the customer is inactive, as follows: 


(A13) SN. = ] — ABN.t, 
(Ald) | SN! *! Z1 min(SNt + ABE * E 1), 
(A15) Si *^-1-min(S?" * t+ ABM * ^; 1), and 
(A16 — Si *^-1-min(S;" +? + AB * 5 D. 
Finally, building on Equations A5 and A13—A16, we 


can now easily calculate CLV that includes option value as 


follows: 
N,t + 3p-N N,t+ 2pon | 
) Sik PC kı ] Sy PC) 


N L dk y 
GU ONO a S 
N, 
5 S PO ooN 
SN.PCI 
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Christian Homburg, Nicole Koschate, & Wayne D. Hoyer 


The Role of Cognition and Affect in 
the Formation of Customer 
Satisfaction: A Dynamic Perspective 


Despite the strong recognition that customer satisfaction should be viewed from a dynamic perspective, little is 
known about how the satisfaction judgment develops over time. Therefore, this study provides a dynamic analysis 
of the simultaneous influence of cognition and affect in the satisfaction formation process. The results of an 
experimental study based on a real consumption experience indicate that the impact of cognition on the satisfaction 
evaluation increases and the influence of affect decreases over time. Moreover, these effects are attenuated with 
inconsistent performance experiences. Finally, the study shows that the variance in customer satisfaction jointly 
explained by cognition and affect increases as experience accumulates. 


e topic of customer satisfaction/dissatisfaction has (1997, p. 319) states that affect “coexists alongside various 
been of great interest to marketing and consumer cognitive judgments in producing satisfaction” and that it is 
researchers for many years. This interest is driven in central to understanding customers’ consumption experi- 

part by the notion that customer satisfaction can have long- ences. In addition, a key conclusion of Szymanski and 


term benefits, including customer loyalty and increased Henard’s (2001) meta-analysis is that disconfirmation and, 
profitability (e.g., Anderson, Fornell, and Lehmann 1994; to a lesser extent, affect are strongly related to satisfaction. 
Anderson, Fornell, and Rust 1997; Anderson and Sullivan However, only a few studies have investigated cognitive 
1993; Bearden and Teel 1983; Bolton and Drew 1991; For- and affective antecedents of customer satisfaction simulta- 
nell 1992; LaBarbera and Mazursky 1983; Oliver 1980; neously. For example, in studying affects that are provoked 
Oliver and Swan 1989a, b; Rust, Moorman, and Dickson by service failures, Smith and Bolton (2002) find that feel- 
2002; Rust and Zahorik 1993). Against this background, it ings predict satisfaction levels, after accounting for cogni- 


is important to understand the mechanism that leads to cus- tive factors. Kempf’s (1999) study finds that feelings (e.g., 
tomer satisfaction from both an academic and an applied arousal, pleasure) are particularly important antecedents of 
perspective. product evaluations of hedonic products, whereas brand 


Previous research has recognized that both cognition cognitions are not. The opposite is true for functional prod- 
and affect significantly predict satisfaction judgments. Cog- ucts. Oliver (1993) examines the role of disconfirmation 
nition has been studied mainly in terms of the disconfirma- and affective experiences and finds that both are critical to 
tion paradigm, which predicts satisfaction to be a function the modeling of satisfaction judgments. 
of a comparison between expectations and performance Although previous studies identify the key roles of cog- 
(e.g., Bearden and Teel 1983; LaBarbera and Mazursky nition and affect in influencing satisfaction, it is important 
1983; Oliver 1980; Oliver and DeSarbo 1988). Other stud- to acknowledge that previous work has largely been static 


ies have recognized that the affect experienced during the (i.e., cross-sectional) in nature. This represents a significant 
acquisition and consumption of the product or service (e.g., research gap because it is well established that customer 
joy, happiness, disgust) can also have a significant influence satisfaction is a dynamic phenomenon. Even the classic 


on satisfaction judgments (e.g., Mano and Oliver 1993; confirmation/disconfirmation paradigm, which has guided 
Westbrook 1987; Westbrook and Oliver 1991). Oliver much of the research in the area, recognizes that satisfaction 

judgments can have an important influence on customer 
— expectations for subsequent consumption occasions (Oliver 
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Mittal, Kumar, and Tsiros (1999) and Mittal, Katrichis, and 
Kumar (2001) find that the weights given to an attribute in 
determining satisfaction shift over time on the basis of their 
salience. Slotegraaf and Inman (2004) show how satisfac- 
tion with product attributes can decline over time (particu- 
larly those that can be remedied and especially as customers 
approach the end of their warranty period). Other dynamic 
studies in this area focus on carryover effects of previous 
satisfaction judgments to subsequent satisfaction evalua- 
tions (Bolton and Drew 1991; Boulding et al. 1993; Mittal, 
Kumar, and Tsiros 1999). Bolton (1998) proposes a 
dynamic model in which the duration of customer relation- 
ship depends on an anchoring and adjustment process in 
which cumulative satisfaction serves as the anchor for sub- 
sequent experiences. Nevertheless, to the best of our knowl- 
edge, there is no study that investigates the effects of cogni- 
tion and affect simultaneously over time. 

Against this background, our study investigates cogni- 
tion and affect simultaneously as drivers of customer satis- 
faction in a dynamic context. The fundamental proposition 
of our study is that the role of cognition and affect may 
change over time. More specifically, we argue that affect 
plays its strongest role at early stages of satisfaction devel- 
opment because of a lack of knowledge about the product 
or service. As experience accumulates, however, the impact 
of cognition should increase. To study this proposition, we 
examine customer satisfaction formation, beginning from a 
point at which customers have no prior experience with the 
product at all. In this respect, our study differs from previ- 
ous work in this area, which essentially considers situations 
in which customers have some prior experience with the 
product or service. In addition, we examine this process in 
the context of moderately complex, utilitarian products. 

In addition, from a dynamic perspective, the evaluation 
process that occurs during consumption serves as an input 
into subsequent decision-making situations. In other words, 
the nature of the feedback received has a strong influence 
on the satisfaction judgment, and this affects future deci- 
sions. Therefore, we build a case that the phenomenon 
under study also depends on the level of consistency of the 
consumption experience (re., Is it consistently positive, 
consistently negative, or inconsistent?). The importance of 
the concept of feedback consistency has been established in 
related areas, such as job performance (Alvero, Bucklin, 
and Austin 2001; Stone and Stone 1985). To the best of our 
knowledge, however, this concept has not been studied in 
the customer satisfaction literature. 

Furthermore, previous research in social psychology 
and marketing has indicated that attitudes and judgments 
become more stable over time after multiple experiences 
with the attitude object (Fazio 1986; Petty, Unnava, and 
Strathman 1991; Smith and Swinyard 1983). On the basis 
of this literature, we argue that over time and with repeated 
consumption experiences, satisfaction judgments should 
also become more stable and that the ability of both cogni- 
tive and affective factors to predict customer satisfaction 
should increase. 

Thus, our research is in line with the frequent call for 
longitudinal studies in marketing (e.g., Bolton 1998; 
Fournier and Mick 1999; Szymanski and Henard 2001). In 
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addition to the theoretical interest of this topic, it has practi- 
cal relevance because it aids managers in understanding the 
development of customer satisfaction and what types of fac- 
tors are most critical at different stages in the process. 


Development of Hypotheses 


According to Oliver (1997), a satisfaction evaluation is a 
specific type of attitude (i.e., postconsumption evaluation). 
Therefore, to develop our hypotheses on the customer satis- 
faction formation process, we draw on research on attitudes. 
Research in this area has found that newly formed attitudes 
are unstable and held with less certainty and therefore are 
more difficult to predict (Eagly and Chaiken 1993). Cer- 
tainty has been shown to increase as the quantity of infor- 
mation about the product increases (Berscheid et al. 1976; 
Dover and Olson 1977; Farley, Katz, and Lehmann 1978). 
Over repeated consumption experiences, customers gain 
more information about the product, and therefore their 
judgment certainty should increase (Fazio 1986; Smith and 
Swinyard 1983). Thus, with repeated experience, the cogni- 
tive and affective judgments underlying these attitudes 
become associated with higher certainty (i.e., become 
stronger and more stable) and should be better able to pre- 
dict the satisfaction evaluation (Chandrashekaran et al. 
2000). We hypothesize the following: 


Hı: The variance in customer satisfaction jointly explained by 
cognitive and affective factors increases as experience 
accumulates. 


As we mentioned previously, a key contribution of this 
article is its examination of whether the impact of cognition 
and affect on the satisfaction judgment changes over time. 
In particular, we are interested in the early stages of the sat- 
isfaction formation process (i.e., the first consumption 
experiences with the product or service). Whenever cus- 
tomers encounter a new product or service, there is typi- 
cally little information to draw on from memory (i.e., no 
affect- or cognition-based evaluations). Therefore, cus- 
tomers are more likely to rely on their feelings because feel- 
ings are often elicited immediately on exposure to a new 
stimulus (Pham et al. 2001). In other words, affective 
responses to stimuli are evoked much quicker than cogni- 
tive responses. Consistent with this view, several studies 
have examined a feelings-based inference called the "how- 
do-I-feel-about-it" heuristic, whereby customers can make 
satisfaction or dissatisfaction judgments based on the 
valence of their feelings (Gorn, Goldberg, and Basu 1993; 
Pham 1998; Schwartz and Clore 1988). 

This notion is also related to strategies outlined in the 
affect infusion model (AIM; Forgas 1994, 1995), in which 
affect infusion is highly likely to occur. Affect infusion 
refers to the process in which affectively loaded information 
influences and becomes part of the judgmental process, 
entering the judge’s constructive thought process and even- 
tually coloring the judgmental outcome (Forgas 1995, p. 
39). The AIM is a cognitively oriented approach for 
describing the role of affect in judgment processes and thus 
fits well with the decision context of the current study. The 
AIM suggests that it 1s the constructive, generative nature of 


most judgments that is critical for affect infusion to occur 
(Fiedler 1991). Constructive processing involves the devel- 
opment of a judgment or decision rule on the spot rather 
than the reliance on a well-formed rule or ene 
(Bettman, Luce, and Payne 1998). 

Applied to the current context, for the first sonan 
experience, customers should have little information stored 
in their memory on which to base a judgment, and therefore 
they should rely on affective inputs. As the number of con- 
' sumption experiences increases, however, customers 
acquire increasing amounts of information about the prod- 
uct or service (i.e., through trial and other sources). When 
making a satisfaction judgment on subsequent occasions, 
customers can recall this prior information to help them 
make a satisfacton judgment. In support of this notion, 
studies have found that previous satisfaction judgments 
affect subsequent satisfaction evaluations (Bolton 1998; 
Bolton and Drew 1991; Boulding et al. 1993; Mittal, 
Kumar, and Tsiros 1999). This notion is consistent with the 
direct access strategy of the AIM model, which involves the 
strongly cued retrieval of a crystallized judgment from 
memory, and is a robust process more resistant to affect 
infusion. 

Thus, we expect that affect has a greater impact on the 
satisfaction judgment in the earlier stages of the satisfaction 
development process when judgments require a higher 
degree of constructive processing. In support of this notion, 
Smith and Bolton (2002) find that affect is more likely to 
predict transaction-specific customer satisfaction than 
cumulative satisfaction (which would be based on several 
consumption experiences). Thus, we offer the following 
hypothesis: 

H5: As experience accumulates, (a) the impact of affective 
factors on customer satisfaction decreases, and (b) the 
impact of cognitive factors on customer satisfaction 
increases. 

Further refining our theoretical reasoning, we propose 
that the previously mentioned pattern of processing is con- 
tingent on the consistency of the consumption experience 
over time. Thus, we investigate the impact of consistent ver- 
sus inconsistent performance experiences on the proposed 
relationships in H4. A consistent experience would result 
from performance feedback that is in the same direction for 
each consumption occasion (i.e., all consumption experi- 
ences are positive, or all are negative). This leads to the 
building of a more reliable and diagnostic knowledge base 
that can be used as a basis for evaluation on subsequent 
occasions. In support of this notion, studies have found that 
as consistency of information increases, attitude certainty 
also increases (Heslin, Blake, and Rotton 1972; Kahneman 
and Tversky 1973). Therefore, we expect that customers 
switch in early stages from a more constructive processing 
strategy to a direct access strategy. 

Conversely, inconsistent experiences would generate 
conflicting information, which would result in less reliable 
and less diagnostic information stored in memory. There- 
fore, customers would need to continue engaging in con- 
structive information processing, which, as we mentioned 
previously, involves the significant role of affective factors. 


Thus, we assume that the relative impact of affective factors 
does not decrease as heavily as in the case of consistent 
experiences. 


H4: The suggested pattern (1e, the impact of cognition 
increases, whereas the impact of affect decreases) is more 
pronounced in the case of consistent experiences than in 
the case of inconsistent experiences 


A Longitudinal Experimental Study 


Overvlew 


To enhance the realism of the study, we placed the "cus- 
tomers" in an actual consumption situation. Specifically, we 
asked them to evaluate a newly created CD-ROM tutorial, 
which could be used and purchased to provide academic 
assistance in a difficult pricing class. We gave them three 
sample chapters (trials) of the CD-ROM tutorial over time 
in a computer-based format. After each trial, we asked them 
to solve a pricing problem related to the sample chapter, 
and then we gave them feedback on their performance. We 
manipulated performance by varying the quality of the tuto- 
rial and the feedback. After we provided the feedback, we 
elicited responses to key measures. 


Sample 


Our sample consisted of 157 marketing students enrolled in 
a graduate-level pricing class at a large German university. 
Because students represent the target market for the CD- 
ROM study guide, this sample is particularly relevant and 
appropriate. Participants were aware that previous students 
had experienced difficulties in this pricing class. 


Research Design 


The research design consisted of an 8 (levels of perfor- 
mance) x 3 (trial) full factorial design. Performance was a 
between-subjects factor, and trial was a within-subjects fac- 
tor To manipulate performance, we gave participants a 
sample chapter and asked them to solve a related pricing 
problem. In the high-performance condition, the content of 
the CD-ROM sample chapter made it easy to understand 
and to solve the pricing problem. This was accomplished by 
providing the different steps necessary to complete the pric- 
ing task successfully in a logical order. Furthermore, par- 
ticipants in this condition received positive feedback that 
the CD-ROM helped them solve the problem. Note that this 
feedback was designed to ensure that participants focused 
on the performance of the CD-ROM rather than on their 
own performance. 

In the Jow-performance condition, the content of the 
CD-ROM chapter was not in logical order and therefore 
was difficult to read. It also contained little information 
related to the pricing problem. We also provided partici- 
pants with negative feedback that the CD-ROM did not help 
them solve the pricing problem. Again, the goal was to 
focus participants on the performance with the CD-ROM 
rather than on their own performance. 

To examine the proposed dynamic relationships, we 
manipulated performance across three different trials. In 
each trial, we presented participants with a different chapter 
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from the CD-ROM tutorial and then asked them to solve a 
related pricing problem. Note that the three sample chapters 
covered completely different pricing issues and did not 
build on one another. Thus, learning effects, in the sense 
that knowledge gained in previous chapters could be helpful 
for understanding subsequent chapters and pricing tasks, 
were not present. In the high-performance condition, par- 
ticipants received the high-performance manipulation on all 
three trials. In the low-performance condition, participants 
received the low-performance manipulation on all three tri- 
als. These two conditions represent consistent performance 
experiences. We then manipulated intermediate levels of 
performance through high- and low-performance combina- 
tions across the three trials. To represent inconsistent expe- 
riences, we constructed two conditions with a performance 
experience on each subsequent trial that was different from 
the most recent one (+ — + and — + —). We outline the differ- 
ent conditions in Table 1. 


Experimental Procedure 


Before the first trial, we gave participants the introductory 
section, which described the purpose of the CD-ROM and 
content. We told them that a CD-ROM study guide had 
been developed to assist participants in solving difficult 
pricing problems in the course. Furthermore, we informed 
them that they would have the chance to test the CD-ROM 
tutorial before deciding if they wanted to purchase it. We 
then set up expectations about the CD-ROM tutorial and 
held them constant across the experimental conditions. We 
told participants that the CD-ROM tutorial contained 73 
chapters, which would be similar to the ones they received 
in the testing phase but would cover different pricing topics. 
The introductory section of the CD-ROM tutorial informed 
participants that the purpose of the study guide was to help 
course participants understand difficult material in the class, 
and it provided an overview of the content. 

Participants then received one sample chapter of the 
CD-ROM study guide (first trial), after which they solved a 
problem related to the material. We manipulated perfor- 
mance in the manner described previously. The measure- 
ment of cognition and affect occurred immediately after 
participants received the feedback from the performance 
manipulation. We then asked intervening questions to dis- 





TABLE 1 
Overview of Experimental Conditions 

Experimental Ui 

Condition 1 2 3 
1 + + + 
2 + + — 
3 + — + 
4 + — — 
5 — + + 
6 = + = 
7 — — + 
8 = zs = 


Notes: + = high performance, and — = low performance. 
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tract participants from the evaluation situation, after which 
we measured customer satisfaction. After the first trial, par- 
ticipants completed an intervening task. This involved read- 
ing a newspaper article about a recent pricing problem in 
practice and answering some open-ended questions about 
the content of the article. 

We conducted the second and third trials in a similar 
manner. We told participants that they had the chance to get 
more experience with the CD-ROM study guide. They then 
received another sample chapter of the CD-ROM, solved a 
pricing task, and received feedback. After we measured the 
key variables, we distracted participants between the second 
and the third trial with an intervening task similar to the one 
described previously. After the third trial, participants com- 
pleted a final set of general questions and provided demo- 
graphic and classification information. 


Measurement of Variables 


The key variables of interest were cognition and affect. We 
measured cognition in terms of disconfirmation, which is 
the key cognitive component in the confirmation/disconfir- 
mation paradigm (Oliver 1980). We measured disconfirma- 
tion with the following item, which parallels previous 
approaches in measuring disconfirmation (Fornell et al. 
1996): “The performance of the CD-ROM meets my expec- 
tations.” We evaluated the item on a seven-point Likert-type 
scale anchored by “I totally disagree" (1) and “I totally 
agree” (7). 

We measured affect with three items. These items 
included elation, delight, and joy, which have also been 
measured in other studies examining affect (e.g., Holbrook 
and Batra 1987; Oliver 1993; Westbrook 1987; Westbrook 
and Oliver 1991). We asked participants how intensively 
they experienced these types of affect on a seven-point 
Likert-type scale. 

In addition, we assessed affect and cognition with an 
open-ended, verbal protocol measures. We asked partici- 
pants to describe any feelings and/or thoughts they had 
while using the tutorial. Responses were then coded into 
affective and cognitive categories. Responses were coded as 
affective if the statements related to any type of affect or 
feelings; they were coded as cognitive if they related to the 
content and functional performance of the tutorial. 
Research assistants carried out the codings. In terms of 
coder reliability, the coders agreed in 96% of the cases, and 
they resolved disagreements through discussions. 

The key dependent variable was customer satisfaction, 
which we measured with the following three items: “All in 
all, I am satisfied with the CD-ROM tutorial"; “The CD- 
ROM tutorial compares to an ideal CD-ROM tutorial”; and 
“Overall, how satisfied are you with the CD-ROM tuto- 
rial?” We measured the items on an 11-point Likert-type 
scale. For the first two items, the scale ranged from 
“strongly agree” to “strongly disagree,” and for the third 
item, it ranged from “very satisfied” to “very dissatisfied.” 
The internal consistency of the satisfaction scale was excel- 
lent across the three trials (first trial: Cronbach's a = .95; 
second trial: Cronbach's a = .97; and third trial: Cronbach's 


& = .96). Thus, for further analyses, we calculated the satis- 
faction scores as the means of the satisfaction scale items. 

Finally, we collected several variables as possible 
covariates. These included participants” age, gender, 
income, degree to which the budget for studying material is 
exhausted, perceived pressure to buy the CD-ROM tutorial, 
price consciousness, value consciousness, and self- 
confidence. Analyses indicate that the fixed effects of these 
covariates are not significant.! Thus, we dropped them from 
further analysis. 


Results 


H, predicted that the variance in customer satisfaction 
jointly explained by cognitive and affective factors 
increases as experience accumulates. As participants gain 
more experience with the CD-ROM tutorial across the three 
trials, the satisfaction judgment becomes more cumulative. 
To test H,, we estimated the following regression model for 
each of the three trials: 


(1) CS = bg +b, x CO b; x AF +r, 


where CS is customer satisfaction, CO is the cognitive pre- 
dictor, AF is the affective predictor, and r is the random 
error. 

We estimated the three regression models with the 
maximum likelihood method using the procedure MIXED 
in SAS 8.02.2 The results appear in Table 2. First, in each of 
the three trials, there is a positive and statistically signifi- 
cant relationship between the cognitive predictor and satis- 
faction, and there is a positive and statistically significant 
relation between the affective predictor and satisfaction. 
Second, the results indicate that the R-square values sub- 
stantially increase from the first (.645) to the third (.731) 
trial. These results support H}. 

Our second analysis pertained to the changing role of 
cognitive and affective factors over time. H5, predicted that 
the impact of affective factors decreases as experience accu- 
mulates, and H», predicted that the role of cognitive factors 
increases as experience accumulates. To conduct a test of 
these hypotheses, we estimated the following regression 
model with interaction effects: 


(2) CS, = bg + b; X CO, + b; X AF, + b, x time, 
+ b4 X CO, x time, + bs x AF, x time, + r, 


where CS, is customer satisfaction, CO, is cognition, and 
AF; is affect at time,. The different points in time are repre- 


lThe individual significance levels were as follows: age: p = 
.485; gender: p = .140; income: p = .825; degree to which the bud- 
get for studying material is exhausted: p — .354; perceived pres- 
sure to buy the CD-ROM tutorial: p — .563; price consciousness: 
p = .223; value consciousness: p = .295; and self-confidence: p = 
.103. 

2We used full maximum likelihood estimation because the focus 
of the analysis is on deviance tests and not on the random part 
parameters, for which the restricted maximum likelihood method 
is preferable (Singer and Willett 2003; Snijders and Bosker 1999). 


sented by the different trials (e.g., trial 1 = time 1). The ran- 
dom error associated with time, is represented by r,. 

In the testing of H» the coefficients of the two interac- 
tion terms, b4 and bs, are of key interest. These interaction 
terms are similar to slope-shift parameters (Mittal, 
Katrichis, and Kumar 2001), which enable us to estimate 
the differential slopes over time as experience accumulates. 
A significant, positive coefficient for tbe interaction term 
implies that the slope is higher for those who have more 
experience with the CD-ROM. In other words, the corre- 
sponding factor has a larger weight for customers who have 
more experience with the CD-ROM than for those who 
have less experience with it. On the basis of our theoretical 
reasoning, we expect the coefficient b, to be positive. 

A negative coefficient for the interaction term implies 
that the slope is lower for those who have more experience 
with the CD-ROM. The corresponding factor has a smaller 
weight for customers who have more experience with the 
CD-ROM than for those who have less experience with it. 
On the basis of our theoretical reasoning, we expect the 
coefficient bs to be negative. Furthermore, statistically non- 
significant coefficients imply that the weight of the corre- 
sponding factor is invariant to the experience a customer 
has with the CD-ROM. We estimated the model with the 
maximum likelihood method using the procedure MIXED 
in SAS 8.02. The estimation results appear in Table 3. 

The results indicate that there is an overall statistically 
significant, positive relationship between cognition and sat- 
isfaction (b; = 22.380, p < .001) and between affect and sat- 
isfaction (bz = 8.490, p < .001). In addition, there is a sig- 
nificant, positive interaction between cognition and time 
(b4 = 1.960, p « .05). This means that the impact of cogni- 
tive factors on the satisfaction judgment increases over 
time. Thus, there is support for H,,. 

Furthermore, there is a significant, negative interaction 
between affect and time (bs = —2.330, p < .05), supporting 
the notion that the impact of affective factors on customer 
satisfaction decreases over time. Thus, there is also support 
for Ha. Taking these two findings together, the impact of 
cognitive factors on the satisfaction judgment increases 
over time, whereas the impact of affective factors decreases. 

Further support for these hypotheses is provided by 
examining the results for the verbal protocol measures. As 
we mentioned previously, open-ended responses were 
coded into affective and cognitive categories. Overall, there 
were more cognitive statements than affective ones. We 
expected this because the product category is cognitive in 
nature. More important, however, the pattern of responses 
over the trials was consistent with the hypotheses. Specifi- 
cally, the largest number of emotional statements was made 
in the first trial (23) and then decreased (14 in both the sec- 
ond and the third trials). For cognitive statements related to 
the content and functional performance, participants exhib- 
ited an increasing trend. On the first trial, 74 such state- 
ments were made, followed by 89 in the second trial and 90 
in the third. 

H4 proposed that an inconsistent performance expen- 
ence would attenuate the increasing impact of cognitive fac- 
tors and the decreasing impact of affective factors over time 
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(compared with a consistent performance experience). To 
test this hypothesis, we created two subgroups from the data 
set. The first group had consistent satisfaction/performance 
experiences (+ + + or — — —), whereas the second group had 
completely inconsistent performance experiences (+ — + or 
—+-), 

We then ran the regression model with interaction 
effects described previously for each group separately. As 
we show in Table 4, for the consistent performance experi- 
ences, the impact of cognition increases slightly, and the 
impact of affect decreases over time, as is evidenced by the 
results for the interaction effects (by = .119, p < .06; bs = 
—.156, p < .05). In the case of inconsistent performance 
experiences, however, neither of these interaction terms is 
significant (see Table 4). In other words, in the case of 
inconsistent performance experiences, there is neither a sig- 
nificant increase of cognition over time nor a significant 


decrease of affect over time in predicting customer satisfac- 
tion. Thus, inconsistent performance experiences attenuate 
these effects. These results support H3. 


Discussion 


The goal of this study was to examine the joint effects of 
cognitive and affective factors on satisfaction judgments in 
a dynamic setting. The results of our study show that 
together, cognitive and affective factors explain the variance 
in satisfaction judgments well and that the strength of asso- 
ciation increases over time. More important, we found that 
as the number of experiences increases over time, the influ- 
ence of cognitive factors increases, whereas the influence of 
affective factors decreases. However, this effect attenuates 
with inconsistent consumption experiences. These findings 
have several theoretical and managerial implications. 











TABLE 3 
Regression Results with Interaction Effects 
—2 log-likellhood 1566.9 
Solution for Fixed Effects 
Parameter Effect Estimate t-Value p 
Do Intercept 6.072 103.200 .00 
bı Cognition 173 22.380 .00 
De Affect 275 8.490 .00 
D4 Time —.248 —3.440 .00 
Da Cognition x time .083 1.960 .05 
bs Affect x time —.091 —2.330 .02 
Solution for Residual Effect 
Parameter Estimate Z Value p 
o? 1.630 15.350 .00 
TABLE 4 
Regression Results for Consistent and Inconsistent Performance Experience 
Consistent Performance Inconsistent Performance 
Experience Experlence 
—2 log-likelihood 449.5 422.8 
Solution for Fixed Effects Solution for Fixed Effects 
Parameter Effect Estimate t-Value p Estimate t-Value p 
bg Intercept '6.095 59.660 .00 6.268 48.320 .00 
bi Cognitlon 742 14.610 .00 833 10.450 .00 
Do Affect 335 6.600 .00 .249 8.490 .00 
bs Time —.197 —1.590 x ~.119 3.220 45 
b4 Cognition x time .119 1.870 .06 .048 —.750 62 
Ds Affect x time —.156 —2.500 .01 .111 1.200 23 
Solution for Residual Effect Solution for Residual Effect 

Parameter Estimate Z Value p Estimate Z Value p 
o? 1.419 15.350 .00 1.9844 15.350 .00 
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Implications 


Several implications follow from our study. Most important, 
our study contributes to the existing literature on customer 
satisfaction. First, we illustrate the importance of examining 
customer satisfaction from a dynamic perspective. We show 
that the antecedents leading to customer satisfaction vary 
over time. As we mentioned previously, it is well recog- 
nized that both cognition and affect have important influ- 
ences on the satisfaction judgment. However, the key find- 
ing of this study points to the differential roles of these 
factors as experience accumulates. To the best of our 
knowledge, this is the first study to examine the role of cog- 
nition and affect simultaneously in a dynamic setting. Thus, 
we provide a deeper integrative understanding of how cog- 
nition and affect combine to form the customer satisfaction 
judgment. Our findings show that these judgments are 
based on different factors, depending on the amount of 
experience that has been accumulated over time. Thus, cap- 
turing satisfaction judgments at only one time may not pro- 
vide a full picture of the underlying process. 

Second, most studies in this area examine in a cross- 
sectional manner the satisfaction judgment process after 
customers already have some experience or knowledge with 
the product or service. In the current study, we investigated 
the customer satisfaction formation process beginning with 
the very first consumption experience. This provides a 
deeper insight into how the satisfaction judgment crystal- 
lizes over time. In this context, we found that affective fac- 
tors have their strongest role in the early stages. The cogni- 
tive component maintains its importance and increases its 
role over time. 

Third, we examine the role of cognition and affect in 
influencing satisfaction judgment processes. Our findings 
suggest that affect is particularly important in the early 
stages of the judgment formation process in which cus- 
tomers have little knowledge or experience related to the 
product. In other words, when a stimulus is new and unfa- 
miliar, feelings appear to play a critical role in the construc- 
tion of satisfaction judgments, consistent with the how-do- 
I-feel-about-it heuristic (Gorn, Goldberg, and Basu 1993; 
Pham 1998; Schwartz and Clore 1988). As experience accu- 
mulates, however, the role of cognition becomes more 
prominent. That is, a direct access strategy (Forgas 1994, 
1995) can be employed in which evaluations from previous 
trials (based on a more detailed evaluation of product attrib- 
utes) can be recalled from memory. 

Fourth, we provide insights into how the inconsistency 
versus consistency of performance feedback can also affect 
the customer satisfaction formation process. Specifically, 
when experiences are inconsistent, the related judgments 
are less stable and predictable. In this case, the changing 
patterns of affect and cognition do not emerge. An implica- 
tion of our findings is that affective factors are particularly 
important and relevant when the judgments are based on lit- 
tle information (i.e., they are newly formed) or when infor- 
mation is inconsistent. In these cases, the cognitive infor- 
mation has unstable predictive validity, so customers are 
less likely to rely on this information. This finding empha- 
sizes the importance of identifying how the pattern or 


. 
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nature of experiences can influence satisfaction judgments 
as well as the importance of managing customer experi- 
ences. If customers have inconsistent experiences, their 
evaluations of the product/service will be less crystallized 
and less stable. Thus, in an effort to improve customer satis- 
faction and loyalty, it is critical to ensure that customer 
experiences are positive and consistent. 

Fifth, the findings provide further support for research 
on attitude strength in an alternative context (Jaccard et al. 
1995; Jacoby et al. 2002; Petty and Krosnick 1995). That is, 
cognitive and affective factors were better able to predict 
satisfaction evaluations as the number of trials increased, 
particularly in the case of consistent experiences. This is in 
line with the notion that newly formed attitudes are less sta- 
ble and, therefore, less predictive of information processing 
and behavior (Eagly and Chaiken 1993). As knowledge and 
experience accumulates, the relationship between the cogni- 
tive and affective factors and satisfaction is strengthened. 

Sixth, our study also makes a methodological contribu- 
tion by developing a procedure for studying these dynamic 
effects in a tightly controlled, yet realistic setting. This 
methodology could be employed to examine judgment for- 
mation processes in various other product/service contexts 
as well. 

Finally, on a general level, our study suggests that it is 
insightful to investigate marketing phenomena from a 
dynamic perspective. As our study shows, judgment pat- 
terns change over time. 


Limitations and Avenues for Further Research 


Several limitations of this study are worth mentioning. In 
turn, these provide avenues for further research. First, the 
study was based on a highly utilitarian product (i.e., a CD- 
ROM tutorial). In general, the consumption of utilitarian 
goods is highly cognitively driven and accomplishes a func- 
tional or practical task (Strahilevitz and Myers 1998). This 
may partially explain why the role of cognition (affect) 
became stronger (weaker) over trials. In other words, as 
customers gained experience in evaluating the utilitarian 
benefits of the product, these cognitive factors gained 
importance in influencing satisfaction. However, it is not 
clear whether the same pattern would occur for highly 
hedonic products/services. The consumption of hedonic 
products/services is characterized by highly affective and 
sensory experiences of sensual and/or aesthetic pleasure 
(Dhar and Wertenbroch 2000; Hirschman and Holbrook 
1982; Okada 2005). Thus, in such cases, the impact of 
affect might not decline over time. It is possible that a direct 
access strategy will still be employed as experience accu- 
mulates (i.e., a recall of a previous crystallized evaluation 
from memory), but this prior evaluation would be based 
more on affective than on cognitive factors. In support of 
this view, Kempf (1999) finds that for hedonic products in 
particular, affective responses to the trial are powerful 
antecedents of consumers' evaluations of a trial experience 
and, subsequently, brand attitude. In addition, it might be 
the case that the relative importance of cognitive versus 
affective drivers of customer satisfaction varies across prod- 
uct categories. It is plausible to assume that the relative 
importance of cognitive factors is higher for utilitarian 


products than for hedonic ones. These speculations, which 
are beyond the scope of this article, show that a more 
detailed investigation of affective and cognitive drivers of 
customer satisfaction that accounts for different product 
categories is a fruitful avenue for further research. 

Second, in addition to the product category type, cus- 
tomers’ product involvement could also play an important 
role in this context. Product involvement can be defined as 
the perceived personal relevance of a product and a cus- 
tomer’s inherent interest in a product (Wilkie 1994; 
Zaichkowsky 1985). This study investigated the proposed 
relationships in a high-involvement context. In these situa- 
tions, customers usually employ a more analytical 
information-processing strategy and exhibit increased cog- 
nitive elaborations (Petty, Cacioppo, and Schumann 1983; 
Wright 1973). Thus, they are likely to base their decisions 
more on cognitive than on affective factors. Conversely, in 
low-involvement situations, information processing is usu- 
ally more holistic, and thus decisions might be influenced 
more by affective factors. Apart from these effects, it might 
be interesting to investigate whether and how these poten- 
tial effects of customers’ product involvement interact with 
the product category type (hedonic and utilitarian). Thus, 
studying the relevance of involvement in the context of 
affective and cognitive drivers of customer satisfaction is 
another interesting area for further research. 

Third, in our empirical study, we measured only the 
prevalence of positive affect and did not consider negative 
affect. Because it has been shown that positive and negative 
affect can have asymmetric effects on customers’ judgments 
and choices (Mittal, Ross, and Baldasare 1998), a promis- 
ing avenue for further research might be to investigate posi- 
tive and negative affect simultaneously. Prior research also 
demonstrates that different types of (negative) emotions can 
have opposing moderating effects (Garg, Inman, and Mittal 
2005). Thus, additional research on the role of cognition 


and affect in the formation of customer satisfaction might 
further differentiate between different types of emotions. 


implications for Marketing Practice 


Our findings also have several important implications for 
marketing managers. Our study helps managers understand 
customer satisfaction in a more thorough way. It sheds light 
on the formation process of customer satisfaction, and man- 
agers learn that customer satisfaction has a more stochastic 
character in the early stages. This gives managers the 
Opportunity to influence the satisfaction judgment to a 
greater extent in the early stages because the satisfaction 
has not been crystallized. 

Furthermore, it is common in practice for managers to 
consider customer satisfaction in a logical, rational manner 
(i.e., if the product or service performs well, satisfaction 
will be higher). The results of our study point out that affec- 
tive factors can play a critical role as well, particularly in 
the early stages of the satisfaction formation process. Thus, 
achieving customer satisfaction involves not merely discon- 
firmation judgments but a subjective, affective component 
as well. This point is particularly important when compa- 
nies are in the early stages of establishing a relationship 
with a customer. Thus, for new relationships or new prod- 
ucts, managers must pay close attention to affective aspects 
and be careful to manage them effectively. 

Moreover, our study provides insights into managing 
customer satisfaction. For example, it shows how important 
itis to manage affect when customers have inconsistent 
experiences over time. Thus, if companies are trying to 
recover from a situation in which their products/services 
have suffered from inconsistent performance, it appears that 
it is important to manage the affective factors surrounding 
the satisfaction in addition to product/service quality con- 
siderations. In addition, providing customers with consis- 
tent, positive experiences is critical to the development of 
stable satisfaction judgments. 
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Anick Bosmans 


Scents and Sensibility: When Do 
(In)Congruent Ambient Scents 
Influence Product Evaluations? 


This research examines when and how ambient scents (i.e., scents that are present in the environment and do not 
emanate from a specific product) affect product evaluations. Consistent with theory in environmental psychology, 
the author predicts that such ambient scents can elicit emotional responses that can influence subsequent 
judgments, but only when specific conditions are met. Three factors are hypothesized to moderate the effects of 
pleasant ambient scents on evaluations: (1) the congruence of the scent with the product, (2) the salience of the 
scent, and (3) consumers' motivation to correct for extraneous influences. The findings of three experiments 
suggest that as long as ambient scents are congruent with the product, they continue to affect consumers' 
evaluations, even when their influence becomes salient or when consumers are sufficiently motivated to correct for 
extraneous influences. In addition, as long as the scent is not completely incongruent with the product, 
noncongruent scents are corrected for only when their influence becomes salient or when consumers are 
sufficiently motivated. These findings indicate that the impact of pleasant ambient scents on product evaluations 


may be stronger than that of other affective environmental cues. 


for a long time to influence consumers' buying 

behavior and satisfaction. The air from an in-store 
bakery is often blown back into the store to increase con- 
sumers' interest in that department and to signal freshness 
and quality. In the past, retailers were limited to using 
product-specific scents emanating from the products them- 
selves, but recent technologies now enable retailers to apply 
a diverse selection of (artificial) ambient scents into the 
retail environment. For example, British Airways distributes 
a fragrance called Meadow Grass into the air of its business 
lounges to make customers feel comfortable. Similarly, 
Thomas Pink, an upscale British shirt retailer, uses sensors 
that emit the smell of freshly laundered cotton (Ellison and 
White 2000). 

Contrary to product-specific scents, ambient scents do 
not emanate from a specific product and therefore can be 
considered extraneous environmental cues. Ambient scents 
are not necessarily related to specific products, and their 
possible influence on consumer behavior goes beyond that 
of signalling mere product quality. Whereas the effects of 
product-specific scents are well understood (see, e.g., Bone 
and Jantrania 1992), much less is known about how ambient 
scents influence product evaluations and whether con- 
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sumers can control for their extraneous influence. For 
example, consider a supermarket that distributes the famous 
fragrance Channel No. 5 to create a pleasant store atmos- 
phere. Imagine this fragrance reaching the fresh vegetables 
and fruit department. How would this ambient scent affect 
consumers' product evaluations? 

Despite marketing practitioners’ increased interest 1n the 
use of ambient scents and despite the technological 
progress in scent applications, research investigating the 
impact of ambient scents on product evaluations is limited. 
In addition, the few studies that have examined the impact 
of ambient scents on consumers’ judgments have yielded 
mixed results (Bone and Ellen 1999): Although pleasant 
scents sometimes increase evaluations (Spangenberg, 
Crowley, and Henderson 1996), they do not always seem to 
do so (Morrin and Ratneshwar 2003). It is unclear whether 
and to what extent ambient scents influence consumers’ 
product evaluations. 

This research examines the impact of ambient scents on 
product evaluations in more detail. Consistent with theory 
in environmental psychology, it is assumed that as is the 
case with other environmental stimuli, such as in-store 
music or colors, ambient scents elicit emotional responses 
that can influence product and store evaluations (Bitner 
1992; Gorn, Pham, and Sin 2001; Zeithaml and Bitner 
2000). In addition, because ambient scents are extraneous 
environmental cues, consumers are likely to correct for their 
possible erroneous influences. More specifically, it is pro- 
posed that correcting for the extraneous influence of ambi- 
ent scents depends on at least three conditions: (1) how con- 
gruent the scent is with the product category, (2) how 
salient the extraneous influence is, and (3) consumers' moti- 
vation to correct for possible extraneous influences. The 
results of three experiments show that, indeed, consumers 
seem to correct for the extraneous influence of ambient 
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scents only when they perceive the scent as incongruent 
with the product category. In addition, discounting the 
influence of incongruent ambient scents occurs only when 
the influence of the ambient scent is made salient to respon- 
dents (Experiment 1) or when consumers have sufficient 
processing motivation to correct for the extraneous influ- 
ence (Experiments 2 and 3). Consumers are able to correct 
regardless of their processing motivation only when ambi- 
ent scents are highly incongruent with the product category 
(Experiment 3). 


Ambient Scents and Product 
Evaluations 


Ambient scents can be considered affective cues that guide 
consumers’ subsequent product evaluations. This assump- 
tion is consistent with the observation that people’s primary 
reaction to scents is one of like or dislike. Indeed, the most 
salient attribute of a scent seems to be its pleasantness or 
unpleasantness (Engen 1982). Moreover, smell is directly 
linked to the experience of emotions (Herz and Engen 
1996): The primary olfactory cortex forms a direct, 
anatomical link with the amygdala-hippocampal complex 
of the limbic system, and only two synapses separate the 
olfactory nerve from the amygdala, which is critical for the 
experience of an emotion (Aggleton and Mishkin 1986). No 
other sensory system makes this kind of direct and intense 
contact with the neural substrates of an emotion. As a result, 
olfaction is often considered the emotional sense (Ehrlich- 
man and Halpern 1988; Engen 1982). Consistent with this 
notion, various streams of research have suggested that 
scents influence human behavior through the affect they 
elicit (Baron 1990; Baron and Bronfen, 1994; Baron and 
Thomley 1994; Knasko 1995; Rotton et al. 1978). For 
example, both task performance and helping behavior 
increase when a pleasant scent is present in the environ- 
ment, and this effect seems to be mediated by people’s 
affective reactions (Baron 1990; Baron and Thomley 1994). 

Despite the ample evidence that pleasant ambient scents 
increase approach behavior, such as increased in-store lin- 
gering (Knasko 1989), increased time spent looking at 
advertisements (Knasko 1995), and increased amounts of 
money spent in slot machines (Hirsch 1995), little is known 
about how ambient scents influence product. evaluations. 
Few studies have investigated the impact of ambient scents 
on evaluations of products, environments, or people. In 
addition, these studies have yielded mixed results (Bone 
and Ellen 1999). Although scents sometimes appear to 
affect evaluations (e.g., Rotton et al. 1978; Spangenberg, 
Crowley, and Henderson 1996), they do not always do so 
(Cann and Ross 1989; Morrin and Ratneshwar 2003). The 
purpose of the current research is to investigate the impact 
of pleasant ambient scents on product evaluations in further 
detail and, in particular, to test conditions in which scents 
affect these evaluations. 

Consistent with the affect-as-information literature 
(Pham 1998; Schwarz and Clore 1983), it is argued that as 
is the case with other affective environmental stimuli, such 
as music or the weather, ambient scents can affect subse- 
quent evaluations because people often misconsider their 


reactions toward the extraneous stimulus as a reaction 
toward the product under evaluation. Indeed, when con- 
sumers evaluate products, they often inspect their momen- 
tary feelings (i.e., “How do I feel about it?") and consider 
them informative for their evaluations (Pham 1998), even 
when these feelings originate from irrelevant sources, such 
as in-store music, the weather, or, in the current context, 
ambient scents. Thus, given the strong emotional potency of 
ambient scents, it is logical to assume that they can provide 
affective information (i.e., “I feel good about it, so the prod- 
uct must be good”), even when these scents do not arise 
from the product under evaluation. 


Controlling for the Influence of 
Ambient Scents 


In general, relying on extraneous affect when making judg- 
ments seems to be a spontaneous process, in that people 
appear to automatically consider their affect representative 
for the target (Martin 1986; Schwarz 1990). However, 
because most people like to render judgments that are free 
of extraneous influences, they may subsequently assess the 
appropriateness of their initial reaction (Wegener and Petty 
2001). 

This article examines how ambient scents affect subse- 
quent product evaluations and when consumers control for 
these extraneous influences. On the basis of previous theo- 
rizations in the domain of contextual correction (Martin 
1986; Schwarz and Bless 1992; Wegener and Petty 2001), it 
is argued that whether people control for the extraneous 
influence of an ambient scent depends on at least three con- 
ditions: (1) the scent’s congruence with the product cate- 
gory, (2) the salience of the ambient scent, and (3) con- 
sumers’ motivation to correct for possible extraneous 
influences. 


Congruence of the Ambient Scent with the 
Product Category 


It seems reasonable to assume that correction for the erro- 
neous influences of ambient scents is more likely when 
these scents are perceived as inappropriate for the product 
evaluation. In addition to their level of pleasantness, ambi- 
ent scents provide semantic information (Bone and Jantra- 
nia 1992; Mitchell, Kahn, and Knasko 1995). For example, 
the pleasant smell of lavender not only elicits pleasant feel- 
ings but also may activate memory concepts of a holiday in 
the French Provence. Similarly, the smell of roses may 
remind a person of baby care products and may activate 
baby-related memory concepts. As a result of these seman- 
tic connections, ambient scents can vary in their level of 
congruence with the product under evaluation. 

Given these strong semantic connections, it can be 
expected that the effect of ambient scents on product 
evaluations depends on how congruent they are with the 
product. According to the contextual correction literature 
(Martin 1986; Schwarz and Bless 1992; Wegener and Petty 
2001), the level of congruence between an extraneous cue 
and the target is indeed a determinant of whether correction 
will occur. When overlap is high, separating a person’s 
reaction toward the context from his or her reaction toward 
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the target is difficult because both the context and the target 
jointly activate similar memory concepts. As a result, cor- 
rection is less likely when overlap is high. In addition, 
extraneous cues that are incongruent with the target are con- 
sidered “inappropriate” and alert people to their potentially 
biasing influence (Campbell and Kirmani 2000; Wegener 
and Petty 2001). In contrast, cues that are congruent with 
the target are considered appropriate and therefore are less 
likely to promote correction processes. 

' Thus, it is hypothesized that when the semantic proper- 
ties of an ambient scent are highly congruent with those of 
the product, the similarity between the product and the 
scent makes it less likely for consumers to correct for the 
erroneous influence of the extraneous scent. This hypothe- 
sis is consistent with recent findings from the affect-as- 
information literature that suggest that extraneous affective 
information that bears some domain resemblance with the 
target is more likely to influence judgments than affective 
information that is unrelated to the target (Bosmans and 
Baumgartner 2005; Raghunathan, Pham, and Corfman 
2006). Thus, consumers are more likely to control for the 
influence of ambient scents when these scents are perceived 
as incongruent with the product. 

Although this expectation makes sense from a theoreti- 
cal perspective and is consistent with the existing literature 
on contextual correction (Martin 1986; Schwarz and Bless 
1992; Wegener and Petty 2001), the ambient scent literature 
has been less supportive for the argument that congruent 
scents are more likely to affect product evaluations than 
incongruent scents (Knasko 1995; Morrin and Ratneshwar 
2003). For example, people who were exposed to a choco- 
Jate scent evaluated a set of chocolate slides equally positive 
compared with people who were exposed to a baby powder 
scent (Knasko 1995). However, these studies appear to have 
investigated congruency effects in isolation and have not 
examined factors that may moderate the effects of congru- 
ent versus incongruent scents on product evaluations. 

An incongruent scent does not necessarily mean that 
consumers automatically consider its influence extraneous 
to the judgmental process or that consumers correct for its 
erroneous influence. On the basis of several streams of liter- 
ature (Martin, Seta, and Crelia 1990; Petty et al. 1993; 
Schwarz and Clore 1983), it is proposed that whether con- 
sumers control for the inappropriate influence of ambient 
scents depends on (1) how salient the ambient scent is and 
(2) how motivated consumers are to correct for erroneous 
influences. It is believed that both conditions make people 
more aware of the erroneous influence of the ambient scent. 


Salience of Ambient Scents 


In most cases, people consider their initial reaction a genu- 
ine affective response to the target (Pham 1998; Schwarz 
and Clore 1983). However, as the salience of the extraneous 
source increases, people become more aware that a source 
other than the target can be responsible for their initial reac- 
tion. As a result, its effect is usually discredited. For exam- 
ple, reminding people that the weather can influence their 
mood state eliminates the effect of the weather on judg- 
ments of life satisfaction (Schwarz and Clore 1983). Indeed, 
merely making the external source salient (e.g., by asking 
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participants, *By the way, how is the weather out there?") 
seems to eliminate the effect of the extraneous cues on 
judgments (Schwarz and Clore 1983). 

Given that, by definition, ambient scents are extraneous 
to the product, it can be expected that when they are made 
salient, their effect on product evaluations dissipates, espe- 
cially when there is little or no semantic overlap with the 
product. However, when semantic overlap is high, it is 
expected that ambient scents will continue to affect evalua- 
tions, regardless of how salient they are. This reasoning 
leads to the following hypotheses: 


H,: When the ambient scent is not salient, pleasant ambient 
scents increase product evaluations, regardless of their 
congruence with the product category. 

H5: When the ambient scent 1s salient, pleasant ambient scents 
increase product evaluations only when they are congru- 
ent with the product category. 


These hypotheses are the focus of Experiment 1. 


Motivation to Correct 


People usually assess the correctness of their initial reaction 
by elaborating on the differences and similarities between 
the extraneous cue and the target (Meyers-Levy and Stern- 
thal 1993). This implies that consumers should be suffi- 
ciently motivated to elaborate on differences and similari- 
ties between the contextual cue and the target (Petty et al. 
1993). In addition, correcting for erroneous influences is 
resource demanding and depends largely on the cognitive 
resources that a person is able or willing to devote to the 
judgmental task (Clore, Schwarz, and Conway 1994; Mar- 
tin, Seta, and Crelia 1990). Thus, correcting for extraneous 
influences is more likely under conditions of high process- 
ing motivation. Formally, 


H4: When processing motivation is low, pleasant ambient 
scents increase product evaluations, regardless of their 
congruence with the product category. 

H,: When processing motivation is high, pleasant ambient 
scents increase product evaluations only when they are 
congruent with the product category. 


These hypotheses are tested in Experiments 2 and 3. To 
increase the generalizability of the results, Experiment 2 
investigates the moderating role of processing motivation 
by examining individual differences in consumers’ need for 
cognition (NFC; Cacioppo and Petty 1982), and Experi- 
ment 3 focuses on differences in personal involvement with 
the evaluation task. 


Experiment 1: Salience of Ambient 
Scents 


Experiment 1 investigates whether consumers correct for 
the influence of pleasant ambient scents when they are 
made aware of their presence. As argued previously, it is 
expected that people are more likely to control for the influ- 
ence of ambient scents when there is little or no semantic 
overlap between the scent and the product and when they 
are aware of the presence of an extraneous source (i.e., the 
ambient scent). As a result, when the ambient scent is made 
salient, consumers are more likely to correct for its influ- 


ence when the scent is incongruent with the product cate- 
gory. When the extraneous scent is not salient, both congru- 
ent and incongruent pleasant scents are expected to increase 
product evaluations. 


Design and Measures 


The sample consisted of 80 undergraduate students, who 
received monetary compensation of $5 for their participa- 
tion. A 2 (salience of ambient scent: salient versus not 
salient) x 2 (congruence of ambient scent: congruent versus 
not congruent) between-subjects design was used. In addi- 
tion, a control condition was included in which no ambient 
scent was present. Participants were randomly assigned to 
one of the experimental conditions. 

This experiment was the first in a battery of three (unre- 
lated) consumer behavior experiments. All three experi- 
ments were conducted in different rooms (with a different 
experimenter in each room). Participants were received in 
groups of up to two and were asked to evaluate a new (ficti- 
tious) brand of orange juice (Sano), which was displayed in 
a color print advertisement that pictured the new brand, 
together with some oranges, and claimed, “Sano ... a natu- 
ral source of vitamins.” Attitudes toward the brand were 
assessed with two general evaluation items on nine-point 
scales (negative/positive and I dislike it/I like it; r = .69). 

In the salient condition, consumers were told up front 
that an ambient scent was present in the environment (with- 
out mentioning the nature of the scent) and that the experi- 
ment was intended to investigate the extent to which envi- 
ronmental scents affect consumer behavior. In the 
nonsalient condition, there was no reference to the presence 
of the ambient scent. 

In the scented conditions, the ambient scents were emit- 
ted by two pieces of scented papers (each containing three 
drops of essential oil) underneath the participants’ desk. The 
scented papers were replaced before participants entered the 
room. The experiment was run in two identical rooms (i.e., 
similar furniture and same size), and only one scent per 
room per day was used to avoid mixing odors (the different 
experimental conditions were counterbalanced between 
rooms). The rooms were fully ventilated overnight. Each 
morning, the rooms were checked by at least two experi- 
menters to exclude the possibility that scents remained ın 
the room. In the congruent condition, a citrus scent was 
emitted, and in the noncongruent condition, a forest scent 
was used. A pretest ensured that both scents differed with 
respect to the congruency with the advertised product but 
did not differ with respect to their hedonic properties. 


Pretest 


Small glass bottles that contained a cotton ball with three to 
four drops of the essential oil were presented to 30 volun- 
teer undergraduate students (adapted from Spangenberg, 
Crowley, and Henderson 1996). To avoid possible effects of 
color, opaque bottles were used. The bottles were labeled 
with random numbers and did not refer to the specific 
nature of the scents. Participants were asked to sniff the bot- 
tles at approximately six inches from the nose. They could 
sniff as many times as they considered necessary to answer 
the questions. 


Congruency of the scents was assessed by asking par- 
ticipants about the appropriateness of the scent for the 
orange juice presented in the advertisement (see Morrin and 
Ratneshwar 2003). Answers were given on a nine-point 
scale ranging from “not at all appropriate" (1) to “highly 
appropriate" (9). The citrus scent was perceived as signifi- 
cantly more appropriate for the advertised product than the 
forest scent (Matus = 4.77 versus Mg. = 1.57; t(29) = 
7.59, p « .001). 

The hedonic properties of the scents were assessed with 
items from Fisher's (1974) environmental quality scale. 
Both scents were perceived as equally pleasurable (positive/ 
negative, pleasurable/unpleasurable, comfortable/uncom- 
fortable, good/bad, and attractive/unattractive; a = .93) on a 
nine-point scale (Mat, = 4.33 versus Me, = 3.79; t(29) = 
1.60, not significant [n.s.]). In addition, no differences were 
found in arousal between scent conditions (tense/relaxed, 
boring/stimulating, unlively/lively, and dull/bright; a = .76; 
Matus = 4.13 versus Mforest = 4.43; t(29) = 1.44, n.s.). 


Results and Discussion 


A two-way analysis of variance (ANOVA) was performed 
with salience and congruence of the ambient scent as inde- 
pendent variables and evaluation of the new brand of orange 
juice as the dependent variable. In addition, the effects of 
the congruent and the noncongruent scent were compared 
with the nonscented control condition for each salient con- 
dition separately. The results appear in Figure 1. 

There was a main effect of congruence (F(1, 60) = 4.80, 
p < .05) and a marginally significant main effect of salience 
(F(1, 60) = 3.34, p = .07). More important, however, there 
was a significant interaction effect (F(1, 60) = 5.87, p = 
.01). 

In support of H,, when the ambient scent was not made 
salient, evaluations of the new brand of orange juice did not 
differ significantly as a function of congruence of the ambi- 


FIGURE 1 
Experiment 1: Mean Product Evaluations as a 
Functlon of Salience and Congruence of Ambient 
Scent 
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ent scent. Participants evaluated the brand equally positive 
when either a congruent or a noncongruent scent was pre- 
sent in the room ent = 5.58 versus Moot congruent = 
5.63; F(1, 60) < 1). Additional analyses revealed that when 
the ambient scent was not made salient, evaluations in both 
the congruent and the noncongruent ambient scent condi- 
tions were significantly more positive than evaluations in 
the nonscented control condition (Mss, = 4.94; FC, 51) = 
5.44, p < .05; and F(1, 51) = 6.37, p < .05, respectively). 

Consistent with H5, when the ambient scent was made 
salient, evaluations were more positive when the scent was 
congruent than when it was not congruent with the product 
category (Meongruent = 9.71 versus Mnot congruent = 4.67; 
F(1, 60) = 8.94, p < .01). Additional analysis revealed that 
evaluations in the congruent condition were significantly 
more positive than those in the nonscented control condi- 
tion (F(1, 39) = 4.10, p = .05). In contrast, no difference 
was found between evaluations in the noncongruent versus 
the control condition (F(1, 39) < 1). 

In support of the predictions, consumers corrected for 
the influence of ambient incongruent scents only when the 
extraneous influence of the scents was made salient. In con- 
trast, congruent ambient scents continued to influence prod- 
uct evaluations even when their extraneous influence 
became salient. 

Whereas the influence of other affective cues (e.g., 
weather, music) is discounted as soon as their extraneous 
influence becomes salient (e.g., Pham 1998; Schwarz and 
Clore 1983), the influence of ambient scents seems to be 
stronger. They continue to affect product evaluations as 
long as they are congruent with the product category and 
perceived as appropriate for evaluations, regardless of 
whether the source of the extraneous influence becomes 
salient. 


Experiment 2: Processing 
Motivation 


The purpose of Experiment 2 is threefold. A first goal is to 
examine whether consumers control for the influence of 
pleasant ambient scents when they have sufficient process- 
ing motivation to correct for their extraneous influence. Fil- 
tering out extraneous influences requires that people have 
sufficient resources available to elaborate on differences 
and similarities between the contextual cue and the target 
(Petty et al. 1993). In addition, correcting for erroneous 
influences seems to be resource demanding because people 
carefully need to distinguish their reactions toward the tar- 
get from their reactions toward the product (Martin 1986; 
Martin, Seta, and Crelia 1990). As a result, it can be 
expected that consumers correct for ambient scents only 
when they are sufficiently motivated to correct for possible 
extraneous influences. Consistent with the results obtained 
in Experiment 1 and with previous theories on the effects of 
congruent versus incongruent extraneous cues on evalua- 
tions (Meyers-Levy and Sternthal 1993), it is expected that 
this correction process is more likely for incongruent ambi- 
ent scents. 

A second purpose of Experiment 2 is to examine the 
mediating role of consumers’ affective reactions in the rela- 
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tionship between ambient scents and product evaluations. 
As argued previously, it is assumed that ambient scents act 
like affective cues and that their effect on product evalua- 
tions is due to a misattribution of people’s affective reac- 
tions (elicited by the scent) to their reaction to the product. 
Consequently, consumers’ affective reactions should medi- 
ate the hypothesized effect of (in)congruent scents on prod- 
uct evaluations. 

In Experiment 1, the ambient scent was emitted by 
attaching scented papers underneath participants’ desks. 
This was done to make the ambient scent extraneous to the 
judgmental process and to avoid participants considering 
the ambient scent diagnostic for their evaluation by, for 
example, considering the scent a sensory sample of the 
advertised product (“the advertiser wants to give me some 
information about the smell and taste of the juice"). 
Although the extraneous manipulation is reasonable from a 
theoretical point of view, it weakens the external validity of 
the findings. Indeed, advertisers may be less inclined (and 
able) to scent the environments in which consumers read 
advertisements, but they may be more inclined to scent the 
advertisements themselves. 

Therefore, a third objective of Experiment 2 is to gener- 
alize the results obtained in Experiment 1 to a more practi- 
cal setting in which the advertisement, not the environment, 
is scented. Strictly speaking, when a scent surrounds an 
advertisement, it can be considered an extraneous cue 
because it is emitted by a source other than the product 
under evaluation. 


Design and Procedure 


One hundred eighteen undergraduate students volunteered 
to participate in this study. They were randomly assigned to 
a condition of a 2 (processing motivation: high versus 
low) x 3 (congruence of ambient scent: congruent, not con- 
gruent, or nonscented control) factorial between-subjects 
design. 

As in Experiment 1, respondents participated in groups 
of no more than two and were asked to evaluate a new (fic- 
titious) brand of grapefruit juice (Pompeltje) that was pre- 
sented to them as a colored print advertisement. The adver- 
tisement displayed the new brand, together with the image 
of a grapefruit, and stated, “Pompeltje: All Fruit!" In addi- 
tion, some product claims were provided that promoted the 
benefits of the juice. 

In this context, NFC (Cacioppo and Petty 1982), a 
measured individual difference variable, was considered an 
index for consumers’ processing motivation. People who 
score high on NFC usually think about things thoroughly 
and, in general, are motivated to elaborate on information. 
People who score low on the NFC measures are 
characterized by shallow information processing and, most 
of the time, are not motivated to process. Need for cognition 
(on a scale ranging from —3 to +3) was measured at the end 
of Experiment 2, after some unrelated filler tasks. 
Participants with mean scores below the NFC median 
(Mdnrc = .97) were assigned to the low-processing- 
motivation condition (Mio, processing motivation = -2/), and 


-participants who scored greater than or equal to the median 


were assigned to the high-processing-motivation condition 


(Mhaigh processing motivation = 1.52). A t-test revealed that both 
conditions differed significantly with respect to their mean 
NFC scores (t(116) = 15.03, p < .001). 

The advertisement was scented by attaching a scented 
paper (containing one to two drops of essential oil) to the 
flip side of the advertisement. In the congruent condition, 
the advertisement was scented with a citrus scent; and in the 
noncongruent condition, a lavender scent was used. 


Pretest 


Again, the congruence of the scents with the product adver- 
tised and their hedonic properties were assessed in a pretest. 
The procedure was similar to the one described in the 
pretest of Experiment 1. The citrus scent was perceived as 
significantly more appropriate for the advertised product 
than was the lavender scent (Motus = 4.06 versus 
Miavender = 2-63; t(29) = 5.58, p < .001). In addition, both 
scents had similar hedonic qualities (pleasure measures: 
Matus = 4.33 versus Migvender = 3-88; (29) = 1.30, n.s.; 
arousal measures: Matas = 4.43 versus Miavender = 4.17; 
t(29) = 1.18, n.s.). 


Dependent Measures 


Evaluation of the new brand of grapefruit juice was 
assessed with four general evaluation items (negative/posi- 
tive, undesirable/desirable, I dislike it/I like it, and bad/ 
good; a = .90). In addition, participants’ affective and cog- 
nitive reactions were measured by asking them about their 
feelings (depressing/cheerful, ugly/beautiful, and irritating/ 
nice; & = .74) and cognitions (not useful/useful, not effi- 
cient/efficient, and informative/not informative; « = .76) 
about the advertisement that was presented to them. A fac- 
tor analysis revealed that the affect-related items loaded on 
the affective factor (explaining 33% of the total variance), 
whereas the cognition-related items loaded on the cognitive 
factor (explaining 36% of the total variance). All factor 
loadings were greater than .66. If ambient scents are indeed 
affective cues, congruence of ambient scents should mainly 
affect consumers’ affective reactions (i.e., how pleasurable 
the scented advertisement is) and not, or to a lesser extent, 
consumers’ cognitive reactions (i.e., how useful and inter- 
esting the scented advertisement is). 

The ambient scent was not made salient in any of the 
conditions. Awareness of the scent was controlled for by 
asking participants to describe their experiences when par- 
ticipating in the experiment (because the experiment was 
designed presumably to examine people’s experiences when 
participating in experiments in general) as precisely and as 
detailed as possible. They were also asked to “focus on the 
things you have noticed or that may have struck you.” None 
of the participants spontaneously referred to the presence of 
the ambient scent. 


Results and Discussion 


Product evaluations. A two-way ANOVA revealed a 
main effect of processing motivation (F(1, 112) = 4.37, p « 
.05) and a main effect of congruence of ambient scent 
(F(2, 112) = 9.76, p < .001). More important, however, a 
significant interaction effect was observed (F(2, 112) = 
5.00, p < .01). These results appear in Figure 2. 


FIGURE 2 
Experiment 2: Mean Product Evaluations as a 
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Planned comparisons showed that in the low- 
processing-motivation condition, and consistent with Ha, 
evaluations in both the congruent (Moongruent = 5.56) and the 
noncongruent (Mio congruent = 5-47) condition were signifi- 
cantly more positive than those in the nonscented control 
condition (Moontro} = 4.81; F(1, 112) = 7.54, p < .01, and 
F(1, 112) = 5.40, p « .05, respectively). No difference was 
found between evaluations in the congruent and the non- 
congruent ambient scent conditions (F(1, 112) < 1). 

In contrast, and consistent with H4, when processing 
motivation was high, evaluations in the congruent condition 
(Mecongrnent = 5-63) were significantly more positive than 
those in the noncongruent (Moot congruent = 4.42) and non- 
scented control (Meontrol = 4.78; F(1, 112) = 18.78, p < .001, 
and F(1, 112) — 9.88, p « .01, respectively) conditions. No 
difference was found between evaluations in the noncon- 
gruent and nonscented control conditions (F(1, 112) = 1.81, 
p» 18). 


Affective versus cognitive reactions. A three-way 
ANOVA with processing motivation and congruence of 
ambient scent as between-subjects variables and reaction 
(affective versus cognitive) as a within-subjects variable 
was performed to gain insight into the effect of ambient 
scents on the specific nature of consumers’ reactions. This 
analysis revealed a significant main effect of congruence 
(FQ, 112) = 7.73, p < .001) and a significant main effect of 
reaction (F(1, 112) = 42.90, p « .001). Overall, reactions 
were more positive when the ambient scent was congruent 
(Meongruent = 5-17) than when it was not (Mao congruent = 
4.87) or when there was no scent present (Moontro) = 4.54). 
In addition, affective reactions were more positive than cog- 
nitive reactions (Mag, = 5.08 versus Meognitve = 4-63). 
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More important, however, a significant three-way interac- 
tion was observed (F(2, 112) = 3.64, p < .05). The results 
appear in Figure 3. 

Planned comparisons revealed a marginally significant 
two-way interaction between processing motivation and 
congruence on participants’ affective reactions (F(2, 112) = 
2.45, p = .09). When processing motivation was low, affec- 
tive reactions were more positive when the ambient scent 
was either congruent (Moongment = 5-58) or not congruent 
(Mot congruent = 9-30) compared with the nonscented control 
condition (Meontroi = 4.63; F(1, 112) = 15.37, p < .001, and 
F(1, 112) = 6.94, p < .01, respectively). When processing 
motivation was high, affective reactions were more positive 
than the nonscented control condition (Mott; = 4.89) only 
when the scent was congruent (Mcongment, = 3-39; 
F(1, 112) = 2.89, p = .09, marginally significant). No differ- 
ence was found between affective reactions in the noncon- 
gruent condition (Mpot congruent = 4-82) and in the non- 
scented control condition (F(1, 112) < 1). These results 
resemble those of participants’ overall evaluation scores. 

With regard to participants’ cognitive reactions, only a 
significant main effect of congruence was found 
(F(1, 112) = 4.74, p = .01), indicating that compared with 
the nonscented control condition (Meontro] = 4.31), cognitive 
reactions were more positive when a congruent scent 
(Moongruent = 4-90) or a noncongruent scent (Maot congruent = 
4.68) was present (F(1, 112) = 9.31, p < .01, and 
F(1, 112) = 3.56, p = .06, respectively). No difference in 
cognitive reactions was found between the congruent and 
the noncongruent conditions (F(1, 112) = 1.21, n.s.). These 
results suggest that the presence of a pleasant ambient 


FIGURE 3 
Experiment 2: Mean Affective and Cognitive 
Reactions as a Function of Processing Motivation 
and Congruence of Amblent Scent 
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scent, regardless of its level of congruence with the product 
category, is likely to increase consumers' cognitive 
reactions. 


Affective mediation. It is expected that consumers’ 
affective reactions mediate the effect of the congruence of 
the ambient scent on product evaluations. Structural equa- 
tion modeling (AMOS, implemented in SPSS) was used to 
investigate the mediating role of consumers’ affective reac- 
tions in the influence of ambient scents on product 
evaluations. 

As indicated by the results of the low-processing- 
motivation condition, no significant effect of congruent ver- 
sus noncongruent scents was found on product evaluations 
(B = .08, n.s.). In the high-processing-motivation condition, 
the direct effect of the congruence of the ambient scent on 
product evaluations (B = .52, p < .001) was significantly 
mediated by consumers’ affective reactions and not, or to a 
lesser extent, by their cognitive reactions. The results of the 
structural model (%2(40) = 67.04, p < .01; comparative fit 
index = .87) indicate that when consumers’ affective and 
cognitive reactions are included, the relationship between 
the congruence of the ambient scent and product evalua- 
tions drops to .33 (p < .05). Both consumers’ affective (B = 
56, p « .01) and cognitive (B = .33, p « .05) reactions are 
predictive for consumers' final evaluations, but congruence 
is a significant predictor only for consumers' affective reac- 
tions (B = .31, p = .09). Congruence does not predict con- 
sumers' cognitive reactions (B = .26, n.s.). Thus, as 
expected, consumers' affective reactions seem to mediate 
the relationship between the congruence of ambient scents 
and product evaluations. 

As in Experiment 1, it was found that congruent extra- 
neous scents continue to influence final product evaluation. 
In addition and consistent with expectations, consumers 
seem to control for the influence of incongruent scents only 
when they are sufficiently motivated to correct for erro- 
neous influences. Moreover, consumers' affective reactions 
in response to the scent mediated the effect of congruence 
on product evaluations. 


Experiment 3: Processing 
Motivation and Degree of 
Congruence 


Experiments 1 and 2 show that the effects of pleasant ambi- 
ent scents may be stronger than the effects of other affective 
cues in that as long as they are congruent with the product 
category, they continue to affect product evaluations even 
when the scent is made salient or when consumers have suf- 
ficient processing motivation to correct for extraneous 
influences. In addition, when a scent is incongruent, con- 
sumers seem to correct for it only when the scent becomes 
salient or when they have sufficient processing resources to 
correct for extraneous influences. These results can be 
explained by the notion that the stronger the semantic over- 
lap between the ambient scent and the product, the more 
difficult it is to distinguish reactions toward extraneous cues 
from reactions toward the target. Indeed, the weaker the 
semantic link between the extraneous cue and the target, the 


fewer cognitive resources seem to be required to detect and 
filter out inappropriate extraneous influences (Meyers-Levy 
and Tybout 1997). As a result, it can be expected that the 
more incongruent the scent is with the product, the more 
likely it will be that consumers will correct for its extrane- 
ous influence. The main purpose of Experiment 3 is to 
investigate the effects of level of perceived congruence of 
the ambient scent on consumers’ product evaluations. More 
specifically, 


Hs: When the ambient scent is highly congruent with the 
product category, ıt continues to affect product evalua- 
tions, regardless of consumers’ processing motivation. 


Hs: When the ambient scent is of medium congruence with 
the product category, it affects product evaluations when 
processing motivation 1s low but not when processing 
motivation 1s high. 

H7: When the ambient scent is incongruent with the product 
category, it does not affect product evaluations 


Thus, consistent with the results of Experiments 1 and 
2, scents that are highly congruent with the product cate- 
gory are expected to continue to affect judgments, regard- 
less of whether consumers are sufficiently motivated to cor- 
rect for extraneous influences. In addition, however, it is 
argued that whether consumers correct for the effects of 
incongruent ambient scents depends on the degree of incon- 
gruence between the scent and the product. Given that 
fewer resources seem to be required to detect and filter out 
highly incongruent external influences, it is proposed that 
whereas medium incongruent scents are corrected for only 
when consumers are sufficiently motivated to discount 
extraneous influences, the effects of highly incongruent 
scents are corrected for regardless of consumers’ processing 
motivations. 

Another purpose of Experiment 3 is to generalize the 
results obtained in Experiments 1 and 2 to a retail environ- 
ment. Whereas participants in Experiments 1 and 2 were 
asked to evaluate new brands in an advertisement context, 
Experiment 3 investigates the effect of ambient scents in a 
setting in which people are asked to evaluate autbentic 
products. In addition, instead of emitting scents with 
scented papers (either in the environment or attached on the 
flip side of an advertisement), the room was scented with an 
electric diffuser, as in in-store environments. 


Design and Procedure 


The sample consisted of 75 undergraduate students, who 
were compensated $5 for their participation. The design 
was a 2 (processing motivation: high versus low) x 2 (pres- 
ence of ambient scent: pleasant banana scent versus non- 
scented control) x 3 (product: banana, apple, or tomato) 
mixed-subjects design. 

As in Experiments 1 and 2, groups comprised no more 
than two participants, and they were asked to evaluate sev- 
eral products that were highly congruent, of medium con- 
gruence, or incongruent with the ambient scent that was 
present in the environment. In the scented conditions, the 
smell of bananas was distributed by means of an electric 
diffuser that was placed out of participants’ sight. The 
experiment was the first in a battery of three (unrelated) 


consumer behavior experiments that were all conducted in 
different rooms. Each morning, the consumer lab was 
scented ten minutes before participants were scheduled to 
arrive. The diffuser emitted the scent continuously into the 
lab's atmosphere at a medium intensity level. The lab was 
fully ventilated overnight. 

The study was presented as being about the evaluation 
of "the fruits and vegetables of a new (fictitious) fruit auc- 
tion company, ‘Westham Corporation.” Processing motiva- 
tion, a between-subjects factor, was manipulated by increas- 
ing participants’ personal involvement with the task (see 
also MacInnis and Park 1991). In the low-processing- 
motivation condition, participants were instructed to “evalu- 
ate these fruits and vegetables that are auctioned by the new 
fruit auction company Westham. This company is conduct- 
ing a lot of interviews these days.” In the high-processing- 
motivation condition, participants were instructed to “evalu- 
ate these fruits and vegetables that are auctioned by the new 
fruit auction company Westham. They appreciate your opin- 
ion a lot, and after your evaluation task, a manager of the 
corporation will interview you about your opinion.” After- 
ward, participants were asked to evaluate three products: a 
banana (highly congruent with the ambient scent), an apple 
(medium congruent with the ambient scent), and a tomato 
(incongruent with the ambient scent). The experimenter 
handed one fruit product at a time to participants, and they 
were asked to evaluate them subsequently. The order in 
which the fruits were presented to participants was counter- 
balanced across conditions. A pretest revealed that partici- 
pants perceived the banana scent as more appropriate for a 
banana (Mbanana = 4.43) than for an apple (Mappie = 3.57; 
t(29) = 10.47, p < .01) or for a tomato (Migmato = 1.73; 
t(29) = 30.16, p « .001). In addition, the banana scent was 
more appropriate for an apple than for a tomato (t(29) = 
7.73, p « .01). 


Dependent Measures 


Evaluations of the fruits and vegetables were assessed by 
means of four nine-point evaluation items (1.e., fresh, good, 
I like it, and tasty; & = .52). In addition, the effectiveness of 
the processing motivation manipulation was assessed by 
asking participants about their involvement when evaluating 
the products (this research is unimportant/important to me, 
does not concern/concerns me, means nothing to me/means 
a lot to me; a = .81). Again, no reference was made to the 
presence of the ambient scent. In the debriefing, none of the 
participants indicated that they noticed the presence of a 
scent. 


Results and Discussion 


Manipulation check processing motivation. A two-way 
ANOVA with processing motivation and the presence of an 
ambient scent as between-subjects factors and involvement 
with the task as a dependent variable revealed only a main 
effect of processing motivation (F(1, 71) = 5.23, p < .05). 
Participants in the low-processing-motivation condition 
reported being less involved (Miow processing motivation = 3-98) 
than participants in the high-processing-motivation condi- 
tion (Mish processing motivation ~ 4.24). 
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Product evaluations. The ANOVA with processing 
motivation and the presence of an ambient scent as 
between-subjects variables and product as the within- 
subjects variable revealed a significant three-way interac- 
tion effect (F(2, 142) = 3.31, p < .05). Other than a margin- 
ally significant main effect of the presence of an ambient 
scent (F(1,71) = 3.63, p = .06), indicating that, overall, 
evaluations were more positive when a scent was present 
than when a scent was not present (M,,.e, = 2.06 versus 
Meontro! = 1.16), no other effects were significant. The 
results appear in Figure 4. 

When the product was highly congruent with the scent, 
neither the main effect of processing motivation nor the 
interaction effect was significant (both Fs < 1). Consistent 
with Hs, a significant main effect of the presence of an 
ambient scent was found (F(1, 71) = 4.97, p < .05), indicat- 
ing that evaluations of the banana were more positive when 
a scent was present than when a scent was not present 
(Mbanana scent = 5.48 Versus Mpanana control = 4-98), regardless 
of processing motivation. 

When the product was of medium congruence with the 
scent, the interaction between the presence of an ambient 
scent and processing motivation was significant (F(1, 71) = 
5.64, p < .05). Consistent with Hs and with the results of 
Experiments 1 and 2, when processing motivation was low, 
evaluations of the apple were more positive when a scent 
was present than when a scent was not present 
(Mapple scent = 9.69 versus Mapple contro] = 4.89; F(1, 71) = 
5.67, p « .05). When processing motivation was high, no 
difference was found between the scented condition and the 
nonscented control condition (F(1, 71) = 1.05, ms.; 
Mapple scent = 4.94 versus Mapple control = 5.29). Other than 
this interaction effect, no other effects were significant (all 
Fs < 1). 


Finally, when the product was highly incongruent with 
the scent, none of the main effects (both Fs < 1) nor the 
interaction effect (F(1, 71) = 1.29, n.s.) was significant. 
Consistent with H7, when the scent was highly incongruent, 
consumers found it easier to control for its possible erro- 
neous influence, regardless of their processing motivation. 

These results support H;-H; and demonstrate that 
whether consumers correct for the influence of ambient 
scents depends on the level of congruence of the scents with 
the product category. The stronger the semantic linkage 
between the scent and the product, the more difficult it is to 
correct for extraneous influences, and the more resources 
are needed to filter out the extraneous influences. Consis- 
tent with this, when the product was highly incongruent 
with the ambient scents, respondents corrected for extrane- 
ous influences, even when they were not motivated to 
process. In contrast, when overlap between the scent and 
the product was high, scents continued to influence partici- 
pants' evaluations, regardless of processing motivation. 
Only when the product was of medium congruence with the 
scent did consumers correct for its influence when they 
were highly motivated to process. 


General Discussion 


The three experiments demonstrate that ambient scents, 
which are extraneous to consumers' judgmental process, 
can have a strong influence on consumers' product evalua- 
tions. As long as ambient scents are congruent with the 
product category, they continue to affect product judgments, 
even when they become salient or when consumers are suf- 
ficiently motivated to discount their potential influences. In 
addition, scents that are not congruent with the product 
category can still influence consumers’ product evaluations 


FIGURE 4 
Experiment 3: Mean Product Evaluations as a Function of Processing Motivation, Presence of the 
Ambient Scent, and Product 
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when they are not salient or when consumers lack process- 
ing motivation. Only when the ambient scent is completely 
incongruent with the product do consumers seem to be able 
to cope well with their possible influence. 

This research reveals three important conditions on 
which scent effects are contingent: (1) the congruence of 
the scent with the product category, (2) the salience of the 
extraneous influence, and (3) consumers’ motivation to cor- 
rect for possible extraneous influences. Whereas previous 
studies have convincingly shown that congruent scents have 
beneficial effects on variety seeking, information search, 
and recall (e.g., Mitchell, Kahn, and Knasko 1995; Morrin 
and Ratneshwar 2003), research has been less conclusive 
about the impact of congruency on product evaluations 
(Knasko 1995; Morrin and Ratneshwar 2003). By taking 
into account not only the congruence of a scent with the 
product category but also its salience and consumers' pro- 
cessing motivation, the current findings clarify the mixed 
results that were previously obtained with regard to the 
impact of ambient scents on product evaluations and the 
moderating role of consumers' awareness of ambient scents. 

The results further show that ambient scents can act as 
affective cues, providing affective information for product 
evaluation. The results of Experiment 2 suggest that con- 
sumers' affective reactions (as a result of the pleasantness of 
the scent) are predictive for product evaluation and mediate 
the effect of congruence of the scent on judgments. How- 
ever, note that the results do not exclude the possibility that 
ambient scents influence cognitive reactions, as, for exam- 
ple, Chebat and Michon (2003) suggest. Indeed, the results 
obtained in Experiment 2 suggest that cognitive reactions 
were more positive as a result of the presence of a pleasant 
ambient scent. However, cognitive reactions were affected 
regardless of the congruence of the scent and consumers' 
processing motivation, and they did not mediate the effect 
of congruence on product evaluations. Further research is 
necessary to investigate these unexpected results in closer 
detail and to clarify how the presence of a pleasant ambient 
scent influences consumers' cognitive reactions. 

Research related to the impact of affective cues on per- 
suasion has mostly concentrated on the pleasantness dimen- 
sion of the affective cues (i.e., how positive or negative the 
cue is). In addition to their level of pleasantness, ambient 
scents provide more semantic information that increases or 
decreases their congruence with the target product. Accord- 
ing to the results obtained in Experiments 1—3, this congru- 
ence dimension is a necessary condition for explaining the 
effects of ambient scents on product evaluations. Whether 
congruence is also important for explaining the effects of 
other affective cues remains unclear, but given that music, 
for example, can also provide information that goes beyond 
its level of pleasantness (North, Hargreaves, and McKen- 
drick 1999), it would be interesting for further research to 
investigate how important the congruence dimension 1s for 
affective cues other than ambient scents. 

When participants discounted the influence of ambient 
scents, they seemed to be able to filter out the extraneous 
influences. That is, contrast or overcorrection effects 


(resulting in more negative evaluations), which are fre- 
quently observed in the contextual correction literature 
(e.g., Martin, Seta, and Crelia 1990), were never observed. 
These results suggest that the use of pleasant ambient scents 
never hurts. However, further research is necessary to inves- 
tigate the possible occurrence of overcorrection in the pres- 
ence of incongruent ambient scents. In addition, whereas it 
is assumed in general that correction for extraneous influ- 
ences is a conscious and deliberative process, the results of 
the thought protocols suggest that correction for ambient 
scents operates below consumers' level of awareness. 
Indeed, participants did not refer spontaneously to the 
(influence of the) ambient scent in any of the thought proto- 
cols. Further research 1s needed to investigate the degree to 
which consumers are truly conscious about this correction 
process. 


Managerial Implications 


Ambient scents can be used in a variety of practical set- 
tings. Retailers can use in-store scent diffusers, advertisers 
can use scented advertisements, package designers can use 
scented wrappings, and so forth. The results of this study 
suggest that as long as the ambient scent is congruent with 
the product category, it will have a positive effect on con- 
sumers' product evaluations, regardless of how salient the 
scent is or how motivated (or involved) consumers are. This 
implies that using ambient scents to embellish store atmos- 
pherics or to increase consumers' overall evaluation may be 
more effective than using other affective cues, such as 
music or colors. Whereas the effects of other ambient 
stimuli are believed to dissipate when these cues become 
salient or when consumers’ processing motivation 
increases, the effects of congruent ambient scents continue 
to affect evaluations. 

Retailers may be worried about the effects of incongru- 
ent scents (e.g., when the scent of the bakery department 
reaches the detergents department, or vice versa; when the 
Channel No. 5 fragrance emitted at the entrance reaches the 
fresh vegetables and fruit department). Whether these 
scents influence consumers' product evaluations largely 
depends on bow salient the scent is or how motivated con- 
sumers are. As long as the ambient scent is not too salient or 
consumers' processing motivation (as a result of either indi- 
vidual differences or increased purchase/product involve- 
ment) is not too high, these pleasant scents will have a posi- 
tive impact on product evaluations. More important, 
however, the results suggest that these incongruent scents 
will not hurt evaluations; evaluations were not significantly 
worse in any of the incongruent scent conditions than in the 
nonscented control conditions. 

In summary, using pleasant smells can create promising 
opportunities for marketing practice. On the one hand, 
ambient scents are more effective as their congruence with 
the product category increases. On the other hand, mar- 
keters should not be worried about effects of these scents on 
incongruent product categories, because incongruent scents 
do not seem to hurt overall product evaluations. 
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From Fear to Loathing? How 
Emotion Influences the Evaluation 
and Early Use of Innovations 


Innovation adoption is rarely a short process for consumers; accordingly, recent research has explored adoption as 
a dynamic process that is characterized by changing patterns, or diffusion, of consumer use of the innovation. This 
research suggests that adoption is rarely a neutral process and that consumers can experience strong emotions in 
the initial use of innovations. However, given such emotions, two opposing arguments can be made as to whether 
the inclusion of emotional responses increases the predictive power of traditional models of diffusion. On the one 
hand, experienced emotion may simply be a function of gained benefits and, as such, may already be captured in 
extant models through cognitivo assessments of net benefits. On the other hand, and as data from two empirical 
and longitudinal studies demonstrate, the learning process is potentially emotion generating (independent of net 
benefits), and this emotion colors product evaluations. The emotional influence is sizable and, importantly, not a 
straightforward case of “easier is better” In this work, the authors present the E3 (expectation — emotion > 
evaluation) model, which describes how managers can better predict and influence the successful diffusion of 


complex technological products. 


ket at an unprecedented pace, recent reports have 
highlighted the influence of consumers' expecta- 
tions of and actual experiences in using a product on a prod- 
uct's ultimate success. For example, perceptions of usage 
difficulty have caused a significant number of consumers to 
delay purchases (e.g., 48% of potential digital camera buy- 
ers), and actual usage difficulty has caused them to return 
previously purchased products (e.g., 30% of all home net- 
working products; Endt 2004, p. E8). Epitomizing a com- 
pany caught by these trends, Royal Philips Electronics, 
which has suffered as a result of consumers' rejection of 
complex new products, recorded huge losses in 2000 and 
2001 and reduced its workforce by 55,000 employees (The 
Economist 2004). To understand the scope of the problem, 
Chief Executive Officer Gerard Kleisterlee sent 100 Philips 
managers home one weekend with an assignment to get 
various Philips gadgets to operate. Many failed and 
"returned frustrated and some even angry, whereas 


A: technological innovations reach the consumer mar- 
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“another group that succeeded returned quite proud” (Endt 
2004, p. E8). These managers were struck by the strength of 
their own response to product complexity in use, but we 
note the emotional nature of the managers’ responses. 

As this example and our own experiences show, learn- 
ing to use a new product can evoke an emotional response, 
independent of the emotions produced by the attributes or 
benefits of the product itself. The goal of this research is to 
understand the underlying causes of this type of emotional 
response and study its ramifications for complex product 
innovations.! Although the majority of prior innovation 
research has focused on a point of adoption, the field has 
observed recent calls for more process-oriented research 
(e.g., Golder and Tellis 1998; Rogers 1996) to look beyond 
purchase and address how early use affects ultimate adop- 
tion (Shih and Venkatesh 2004). This usage focus is increas- 
ingly important as managers strive to understand why some 
consumers fail to keep, or to keep using, new products. To 
better understand this consumer-centric process of innova- 
tion, we develop the E? (expectation — emotion — evalua- 
tion) model of emotional influence, test the model in two 
studies, and describe how managers can use the findings to 
help predict and influence new product success. 


lAlthough many of the 4000 diffusion articles and books pub- 
lished to date have demonstrated that perceived product complex- 
ity causes consumers to delay or avoid the purchase of a new prod- 
uct, fewer than 196 have examined the influence of perceived 
complexity on postadoption behavior (Rogers 1996, p. 91). This 
sparse attention is surprising, given that consumers' initial usage 
experiences are complex, especially for technological products 
(Mick and Fournier 1998), and are likely to influence product 
return rate, future brand loyalty, word-of-mouth communications, 
and desire to purchase other technologies. 
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Expectation, Emotion, and 
Evaluation 


It has been well established that new product success can be 
reliably predicted by an evaluation of the product’s benefits, 
costs, and relative advantage over competing alternatives 
(Moreau, Lehmann, and Markman 2001; Rogers 1996). In 
Rogers’s (1996) classic work on diffusion, he also identifies 
the innovation's “complexity” (i.e., the perceived magni- 
tude of the learning costs required to achieve its benefits) as 
an indicator of an innovation's success. Complexity is typi- 
cally indicative of slower diffusion rates (Rogers 1996) and 
can even create disutility through "feature fatigue" (Thomp- 
son, Hamilton, and Rust 2005). In the formative period of 
trial and early use of new, complex products, consumers 
learn how to achieve promised benefits. Rogers calls this 
"how-to" knowledge and acknowledges that it has been 
overlooked to date as a key factor in diffusion success. 
However, interest in use is growing; to wit, Shih and 
Venkatesh (2004) recently identified “usage diffusion" as an 
important conceptualization to understand better when con- 
sumers will fully integrate a newly adopted innovation into 
their lives and thus create a strong foundation for the inno- 
vation's current profitability and future growth. 

simply bringing consumers to the point of trying a new 
innovation, let alone establishing regular use, is a difficult 
process (Meuter et al. 2005). Thus, if firms expend great 
effort to bring consumers to the point of trial, it seems 
important that firms then act to encourage a trial environ- 
ment that is conducive to positive evaluation. Our research 
demonstrates how consumers' emotions play an influential 
role in early usage environments. We show that a significant 
proportion of these emotions results from the disconfirma- 
tion of consumer expectations about ease of use, implying 
that a firm may either predict or influence the consumer's 
emotional experience. Perhaps the most significant contri- 
bution of this research is the demonstration that emotions 
experienced in early use of the new product are not simply 
reflective of achieved benefits (or risks) but result directly 
from learning efforts. These emotions, both immediately 


and over time, significantly influence product evaluations, 
even after we account for the traditional diffusion 
indicators. 


The E3 Model of Emotional Influence 


We present a five-step conceptual model that describes how 
emotion influences the evaluation of complex innovations. 
This model fits well within the traditional models of diffu- 
sion but is unique in its focus on the important how-to 
period when consumers first learn to use a new product. The 
five-step E? model appears in Figure 1. The core steps that 
describe consumers' experience of early use and the theo- 
retical process by which emotions may influence diffusion 
(Steps 2, 3, and 4) are foundational to our theoretical contri- 
bution. The model also addresses how firms can either 
predict or influence the process through relevant and 
actionable antecedents (Step 1) and whether relatively 
ephemeral emotions can affect more lasting evaluations 
(Step 5). 


Steps 1 and 2: Complexity Expectations and Thelr 
(Dis)confirmation 


In general, expectations matter. In marketing, the power of 
expectations is demonstrated in satisfaction research (e.g., 
Oliver and Swan 1989); thus, we consider the types of 
expectations that may be particularly influential in the early 
stages of new product learning. We posit that complexity 
expectations (e.g., “How difficult will it be to begin using 
this?") are critical for predicting both emotions and product 
evaluations. 


Defining complexity expectations. We define “complex- 
ity expectations" as a priori predictions about ease of use, 
learning time, and learning difficulty. Theoretically, and 
akin to gap models of satisfaction, this conceptualization 
implies that a new product experience may be challenging 
but not necessarily disappointing if difficulty is expected. 

This raises two issues that require careful conceptual 
distinction. First, although the satisfaction literature empha- 
sizes the importance of expectations, these are expectations 


i FIGURE 1 
The E? Model of the Influence of Complexity Expectations on Innovation Evaluation and Diffusion 
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for the products’ promised net benefits (e.g., Cadotte, 
Woodruff, and Jenkins 1987) and thus are captured in exist- 
ing diffusion models through Rogers’s (1996) long-standing 
inclusion of net benefits. This work is not a mere replication 
of previous satisfaction research. Here, we address con- 
sumers' expectations not for the product's ultimate benefits 
but for its complexity in learning. In other words, we focus 
on what consumers expect during their early usage experi- 
ence (e.g., the rate at which they can achieve those benefits, 
the ease of learning, and their own affective response), and 
we posit that this is not a trivial construct. Thus, we investi- 
gate the potential for complexity expectations to influence 
product evaluation beyond the strong influence of the 
expected (and perceived) benefits and costs of the product. 
As we noted previously, extant diffusion models already 
successfully predict responses to new products (e.g., 
Moreau, Lehmann, and Markman 2001; Olshavsky and 
Spreng 1996), and the demonstration of any additive influ- 
ence due to consumers' emotional response to initial usage 
complexity would have broad ramifications for new product 
research. 

Second, complexity expectations should be more 
impactful when products are innovative than when they are 
new additions to well-understood product categories.? For 
some new products (e.g., a new orange juice), there is little 
uncertainty about the difficulty of use (Phillips and Baum- 
gartner 2002). However, in cases in which the product 
requires adaptation or learning (e.g., Palm Pilots), usage 
uncertainty abounds (Oliver and Winer 1987). Thus, com- 
plexity expectations are likely to be influential only if prod- 
ucts are both new and relatively complex. This leads to 
what is perhaps a marketplace paradox: The emotional 
experience of initial use is more influential for technologi- 
cal or functional innovations (e.g., a computer program, a 
Global Positioning System in an automobile) than for sim- 
ple experiential or aesthetic products (e.g., a book, a music 
CD, a candle), which are typically associated with (and 
designed to elicit) an emotional response. However, this 
paradox is not so counterintuitive in light of the notion that 
the emotional response we investigate herein is that which 
arises from learning, not that which is tied directly to prod- 
uct attributes or benefits. 


How are complexity expectations formed? For firms to 
be able to predict or influence consumers' complexity 
expectations, they must understand how such expectations 
are formed. As witb attitudes and preferences, we believe 
that complexity expectations are often constructed rather 
than retrieved, using both internal and external inputs— 
namely, (1) consumers' relevant prior knowledge, (2) the 
observable experiences of others, and (3) communications 
from marketers (Fournier and Mick 1999; Oliver and Winer 
1987; Shih and Venkatesh 2004). Although complexity 


7[t can also be argued that benefit-oriented expectations become 
more complex with new product categories, and though this topic 
1s still an 1mportant area for further research, it is addressed in the 
work of Meyers-Levy and Tybout (1989), Moreau, Lehmann, and 
Markman (2001), and Hoeffler and Ariely (1999). 


46 / Journal of Marketing, July 2006 


expectations may arise from several different factors, we 
focus on two relevant factors: product usage information 
provided by the retailer (a marketer action) and consumers’ 
prior expertise (a latent consumer characteristic). According 
to Oliver and Winer (1987), both of these types of informa- 
tion are hypothesized to influence the formation of expecta- 
tions. Notably, these two factors may also interact so that an 
expert consumer may be differentially affected by marketer/ 
retailer-provided information compared with a novice 
consumer. 

Complexity expectations may arise from marketer com- 
munications. We have discussed various conceptual ways 
that expectations can affect product evaluations. In many 
practical ways, managers can influence complexity expecta- 
tions, such as when they show a product being used in com- 
mercials, describe usage instructions in online formats, or 
use other high-involvement sales channels (e.g., sales repre- 
sentatives, infomercials). A tactic that marketers increas- 
ingly employ is the in-store product demonstration (Raphel 
and Raphel 1994). Product demonstrations reduce the con- 
sumer's perceived uncertainty before purchase and have 
become an important marketing-mix variable in the auto- 
mobile and computer industries (Heiman and Muller 1996) 
and in business-to-business domains (Gopalakrishna et al. 
1995). Product demonstrations can be both benefit oriented 
(e.g., let consumers gain information about experience 
attributes) and process oriented (e.g., teach consumers how 
to set up or use a product feature). Heiman and Muller 
(1996) suggest that the benefit-oriented value of demonstra- 
tions should be either positive or negative depending on 
situational characteristics (e.g., the type of product [or prod- 
uct attribute] and its inherent risk). Although Heiman and 
Muller do not address postpurchase evaluation and satisfac- 
tion, they show strong empirical evidence that the efficacy 
of demonstrations on purchase probabilities (they focus on 
benefit-oriented demonstrations) varies with consumers' 
product knowledge. Thus, it is easy to argue that the 
process-oriented value of demonstrations similarly depends 
on situational characteristics; in this case, the value of the 
demonstration depends on whether consumers set high or 
low expectations for their own difficulty in learning about 
and using the product. In line with Heiman and Muller's 
findings, a consumer's expertise may be an important factor 
in the expectations he or she sets both before and after the 
demonstration. 

Thus, we posit that consumers' product category exper- 
tise affects both initial complexity expectations and the 
extent to which these expectations are influenced by prod- 
uct demonstrations. Research in expertise suggests the 
likely pattern of this influence. Specifically, novices may 
not be able to imagine what they do not know (Kruger and 
Dunning 1999) and thus may be more positive when there is 
no demonstration. When a demonstration is provided for an 
unfamiliar product, the product demonstration may increase 
both the specificity and the negativity of the novices' expec- 
tations (McGill and Iacobucci 1992). Effectively, the 


3We thank an anonymous reviewer for articulating this point. 


demonstration may make the technological intricacies 
required to master the new product highly salient, increas- 
ing the novices’ expectations (perhaps unrealistically) for 
difficulty of use. Conversely, expert consumers often 
approach new product use confidently (Alba and Hutchin- 
son 2000). Because they rely heavily on their own prior 
experiences, experts often pay little attention to any usage 
information the firm provides (Wood and Lynch 2002). For- 
mally, we predict the following: 


Hy: Product demonstration (a) increases novice consumers’ 
expectations that the product will be difficult to use but (b) 
does not affect expert consumers’ expectations. 


We predict that demonstrations will not influence 
experts’ expectations, because experts base their expecta- 
tions largely on information stored in long-term memory, 
the ambiguity of which has been discounted (Oliver and 
Winer 1987). Such discounting diminishes the biasness of 
the expectation, and thus disconfirmation is less likely to 
occur. Conversely, novices are more likely to hold biased 
expectations, which can be disconfirmed in one of two 
ways: (1) Those exposed to a demonstration could experi- 
ence positive disconfirmation (“better than expected") at 
first use, and (2) those not exposed to a demonstration could 
experience negative disconfirmation ("worse than 
expected"). Formally, 


H5: (a) Novices who experience a product demonstration 
report more positive disconfirmation of complexity 
expectations (better-than-expected ease of use) after first 
use of the innovation than novices who do not experience 
a demonstration, whereas (b) product demonstration does 
not affect expert consumers' disconfirmation of complex- 


ity expectations. 


Step 3: The Influence of Disconfirmation on 
Experienced Emotion 


However, it would be hasty (and possibly ill-advised) to 
assume that support for H, and H, suggests that in-store 
product demonstrations are a bad idea, having no influence 
on experts and a negative influence on novices. Because of 
the relationship between disconfirmation of expectations 
and emotion, the potential negative influence of product 
demonstrations on novices may ultimately lead to more 
positive results. By depressing expectations of an easy 
experience, novices who receive a demonstration may sub- 
sequently be positively surprised by the actual usage experi- 
ence, seeing it as “not so bad after all,” a situation that can 
create positive emotions. The E? model suggests that con- 
sumers who experience different disconfirmation of their 
usage expectations have different emotional responses. 


Why do consumers experience emotions in using com- 
plex products? When people are confronted with a learning 
task, they develop goals and then monitor progress toward 
those goals. Emotions are mechanisms that communicate 
important information relative to expected progress toward 
goal achievement (Bagozzi, Gopinath, and Nyer 1999; 
Luce, Bettman, and Payne 2001); as such, they are likely to 
occur spontaneously when consumers first use a new, com- 
plex product. Specifically, negative emotions occur when 


expected progress toward the activated goal is impeded, and 
positive emotions occur when expected progress toward the 
activated goal is accelerated or when the goal is attained 
(Carver and Scheier 1990; Luce, Bettman, and Payne 2001). 

Although consumption emotions can be driven by 
actual product performance (Oliver 1993; Phillips and 
Baumgartner 2002), research also suggests that consump- 
tion emotions are a function of disconfirmation of expecta- 
tions (Westbrook and Oliver 1991). In the introduction of 
relatively noninnovative new products (e.g., batteries, hotels 
[Rust et al. 1999; Voss, Parasuraman, and Grewal 1998]) or 
in domains in which consumers have experience (e.g., cur- 
rent subscribers of an interactive television entertainment 
service [Bolton and Lemon 1999]), consumers may be able 
to generate accurate usage expectations, and disconfirma- 
tion is not likely to occur. Although uncertainty remains 
about the quality of the consumption experiences in these 
cases, the consumer has little uncertainty about tbe diffi- 
culty in consuming the product. These situations can be 
contrasted with those in which consumers are actively 
learning the new category (e.g., working with new self- 
service technologies [Meuter et al 2005], participating in 
coproduction [Bendapudi and Leone 2003]) in which real 
gaps are likely to exist between consumers' expectations of 
the ease/difficulty of goal attainment and their actual 
experience. Appraisal of progress, especially given unex- 
pected difficulty or ease, typically generates an emotional 
response (Smith and Ellsworth 1985). Thus, we propose the 
following: 


H4: Positive disconfirmation of complexity expectations is 
correlated with positive emotion during use, and negative 
disconfirmation is associated with negative emotion dur- 
ing use. 


Step 4: Influence of Emotion on the Evaluation of 
Innovative Products 


Can emotions affect product evaluations? It could be 
reasonably argued that emotions that arise from "tempo- 
rary" learning or trial periods are unlikely to influence more 
long-lasting product evaluations. However, recent research 
in emotion shows that affective influence is often stronger 
and more far-reaching than previously considered (e.g., 
Bagozzi, Gopinath, and Nyer 1999). Several recent articles 
have shown the significant link between consumption emo- 
tions and satisfaction (Oliver 1993; Phillips and Baumgart- 
ner 2002); most empirical findings indicate that both posi- 
tive and negative emotions influence satisfaction in their 
respective directions (Schwarz and Clore 1983). Yet demon- 
strations of emotional influence on product evaluation have 
been mixed. In a longitudinal study, Dubé and Morgan 
(1996) demonstrate that retrospective measures of emotion 
do not influence product—or, in their case, service— 
evaluation. 

Although we posit that the addition of emotional 
responses benefits traditional models of diffusion, the oppo- 
site could also be suggested. A reasonable argument may be 
that emotions result from the adoption process but that their 
effect on evaluations is negligible compared with traditional 
models that capture both net benefits and product complex- 
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ity. As we noted, net-benefit models have a strong. track 
record of effectively predicting innovation evaluations, 
sometimes accounting for more than 50% of explained eval- 
uative variance (e.g., Moreau, Lehmann, and Markman 
2001). 

We hypothesize that because emotions result from dis- 
confirmation of usage expectations, they act as important 
signals for the consumer regarding his or her own experi- 
ence. This hypothesis is akin to other examples of “affect as 
information” in which evaluations of products are influ- 
enced by situational mood states (Pham 1998). This emo- 
tional influence may operate in addition to other sources of 
emotion, such as the utility derived from experienced bene- 
fits. Because usage expectations are different from con- 
sumers’ expectations about the product’s ultimate benefits, 
focusing instead on early usage experience (e.g., the rate at 
which they can achieve those benefits, the ease of learning, 
and their own affective response), the emotions derived 
from usage expectations may be distinct from emotions 
experienced from gaining product benefits. 


The changing influence of emotions over time. It is 
important to consider the potentially dynamic influence of 
emotion at distinct periods in the adoption process. 
Whereas the dominant satisfaction paradigm assumes that 
consumers use static standards to make satisfaction judg- 
ments, Fournier and Mick (1999) find strong evidence that 
standards are often malleable. In the early stages of product 
learning, we also expect to observe dynamic changes in 
usage difficulty expectations as consumers gain direct 
experience with the new product and update their priors 
accordingly. In particular, inexperienced consumers, who 
may have been surprised by the difficulty or ease of their 
initial usage experiences, are likely to use their newly 
acquired direct experience to adjust their expectations for 
their next use. Although the adjustment may not be accurate 
(e.g., these consumers may either over- or underadjust), 
basic tenets of learning hold that over time and experience, 
usage expectations should stabilize. Over time, we expect 
emotional influence to be the greatest during the early 
learning stages of product usage, when the disconfirmation 
gap between expectations and actual experience is at its 
largest; then, as learning occurs, we expect this gap to 
shrink, thus decreasing the intensity of the emotions and 
their correlation with evaluations. Formally, 


H4: Both positive and negative emotions (a) are significant 
predictors of evaluation at the initial introduction of the 
new product (before first use), (b) are significant predic- 
tors of evaluation at the first use of the new product, and 
(c) are not significant predictors of evaluation after 
repeated use of the product. 


Shih and Venkatesh (2004) call specifically for experi- 
mental and longitudinal work to address usage diffusion 
elements. In Study 1, we use a controlled laboratory setting 
to test our hypotheses. In Study 2, we use a quasi experi- 
ment over a three-month period to explore these hypotheses 
further in a real-world setting. 
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Study 1 


Methods 


Choice of innovation stimuli. Three requirements 
guided our selection of a new product: (1) It had not been 
used by the participant population, (2) there was natural 
variation in prior experience with a previous version, and 
(3) learning was necessary to achieve basic performance 
goals. We selected a new personal digital assistant (PDA), 
the Palm Zire—specifically, the use of the handwriting- 
recognition software. The Palm Zire offers both a calendar 
function and an address function, and to use each of these 
functions, users must learn the proprietary Graffiti 
handwriting-recognition software.4 Learning to use the 
Graffiti software was chosen as a task that was both com- 
plex and fundamental to the operation of the new product 
and that could be viably learned and tested within the 
parameters of the laboratory setting. 


Design and procedure. The study was a 2 (product 
demonstration: no versus yes) X 2 (prior PDA experience: 
no versus yes) between-subjects design. Participants were 
175 undergraduate students at a large midwestern university 
who participated in the hour-long study for course credit. 
After a brief overview of PDAs and the two required tasks, 
each participant was given the experimental materials that 
detailed how to use the Graffiti characters. Next, partici- 
pants in the randomly assigned “demonstration” condition 
received the demonstration provided by the same experi- 
menter using the same script.5 

All participants then read additional details about the 
Palm Zire and answered the preusage dependent measures. 
Next, the experimenter delivered a Zire to each workstation. 
Participants were given step-by-step instructions and asked 
to complete the first task of entering an appointment. Par- 
ticipants showed completed work to the experimenter and 
answered the next set of dependent measures. After this, the 
participants proceeded to the second task, namely, entering 
an address. Participants then answered a final set of depen- 
dent measures; before leaving, they were thanked and 
debriefed. 


Independent and Dependent Measures 


Prior category experience. Participants indicated their 
experience level with PDAs on a seven-point scale. Approx- 
imately half indicated no experience or a very low level 
(i.e., a 1); thus, we used a median split to classify partici- 
pants as inexperienced versus experienced. 





4There is an alternate way to enter information using a simu- 
lated keyboard, but the experimental instructions required partici- 
pants to use the Graffiti software. 

SIn the demonstration, the experimenter held up the Zire, 
showed the participants where the stylus was located, and indi- 
cated how to tap the main menu buttons to find the appropriate 
tasks. The experimenter then put a transparency of the sheet detail- 
ing the Graffiti characters on an overhead projector and briefly 
demonstrated how and where to draw the letters on the screen of 
the Zire. 


Product-related expectations. With the exception of 
questions on prior PDA usage and demographics, we 
employed the same set of dependent measures at each of the 
three points of interest during the experiment. To allow par- 
ticipants to communicate any type of product-related expec- 
tation, we employed an open-ended written protocol both 
before and after product trial (“As you anticipate using the 
Palm Zire PDA to schedule an appointment, please describe 
the expectations that you have about your first experience 
using the Palm Zire PDA"). An examination of the 
responses indicated that almost all contained information on 
the anticipation of and reactions to the Zire's ease/difficulty 
of use. Two coders, who were blind to the condition of the 
participants, coded the written responses on two seven- 
point scales that captured expectations of difficulty in learn- 
ing and using. After each subsequent use, the coder rated 
the written protocols on two additional scales that captured 
how the experience compared with expectations (much 
more difficult to learn, much more difficult to use). The 
coders’ reliability was high (correlation on all scales 7.74); 
thus, we averaged their responses. Furthermore, the correla- 
tion between each pair of items was also high (each pair 
7.88); thus, we averaged each pair of items to form a mea- 
sure of usage expectations for Time 1 and a measure of dis- 
confirmation of usage expectations for Times 2 and 3. We 
combined the coded measures for Time 1 with the tradi- 
tional complexity measures that Rogers (1996) used (we 
describe this further in the next section) to form an overall 
expectation score. 


Complexity expectations. Following Rogers's (1996) 
definition of complexity, participants indicated on three 
seven-point scales how difficult they expected/perceived the 
Zire to be to use, how long it would take to learn to use, and 
how much of a challenge it would be to use (at each period, 
œ > .90); higher numbers indicated greater expected diffi- 
culty. At Time 1 (preusage), we combined these three self- 
report measures with the two items derived from the written 
protocols to create an overall index of expected usage diffi- 
culty (a = .92). 


Disconfirmation of complexity expectations. A standard 
method for assessing disconfirmation is to measure partici- 
pants' retrospective assessments of whether the product or 
service was better or worse than expected (e.g., Oliver . 
1993; Phillips and Baumgartner 2002). Effectively, our cod- 
ing of the written protocols at Times 2 and 3 captures a 
similar appraisal of this comparison in a more open-ended 
manner. In addition, we subtracted usage expectations at 
time t from usage expectations at time t + 1. We combined 
this difference with the two coded items. 


Emotions. 'To assess participants' emotions at each stage 
in the adoption process, we employed a modified version of 
Izard's (1977) differential emotions scale (DES-II); we bal- 
anced the scale's positive and negative items as Richins 
(1997) suggests. Similar to previous consumption emotion 
research (e.g., Phillips and Baumgartner 2002), we found a 
two-factor solution at each point in time using the modified 
scale, with the two factors representing the positive and 
negative emotions. We combined the seven positive and 
seven negative items to create a positive and a negative 
emotion variable for each point in time (see Figure 2). 

Product evaluations. We used a five-item scale with 
seven-point items to assess participants' evaluations of the 
Zire at each of the three points in time ("As an overall 
evaluation of this particular PDA, I think I will ‘dislike it 
very much/like it very much’”; “I think this particular PDA 
and its features are good": "strongly disagree/strongly 
agree"; “I am very pleased that I will have to try this par- 
ticular PDA in this study": "strongly disagree/strongly 
agree"; "I would like to use this PDA on a regular basis": 
"strongly disagree/strongly agree"; “Would you recommend 
this PDA to your friends?”: "definitely would not/definitely 
would"). At each point in time, all five items loaded on a 
single factor, and thus we summed the five items to form a 
single evaluation measure for each point in time (all three 
a > .89). 


Covariates. We included compatibility and net benefits 
as covariates that have been shown to influence new prod- 
uct success. In line with the work of Rogers (1996), we 


FIGURE 2 
Emotion Response Scale (Modified from Izard’s DES Il Checklist) 
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measured compatibility with three summed scale items to 
indicate how significantly participants would need to 
change the way they currently schedule appointments, keep 
. track of addresses, and manage their information (at each 
period, a > .74). Two scale items measured net benefits of 
the Zire (the perceived functional benefits less associated 
risks), and two other scale items measured participants’ 
evaluations of the Zire’s appointment and address functions. 
The four items loaded on the same factor (a > .78), and we 
summed them at each of the three points in time. 


Results 


The factors influencing initial complexity expectations. 
We used a two-way analysis of variance (ANOVA), with 
experience and demonstration as the independent factors, to 
test H,. The results indicate both a main effect of experi- 
ence (F(1, 172) = 19.62, p < .0001) and the predicted inter- 
action between the two factors (F(1, 172) = 5.21, p < .02). 
In support of Hia inexperienced participants who received 
the demonstration had significantly higher expectations of 
difficulty than those who did not (F(1, 172) = 3.96, p « .05; 
contrast: Mgemo = 20.9 versus Mio demo = 18.5). However, 
the demonstration did not significantly influence those with 
prior experience (F(1, 172) = 1.83, p > .10; Mgemo = 14.8 
versus Mi, demo = 16.6), in support of H),. 


The disconfirmation of complexity expectations at first 
use. The negative expectations that the product demonstra- 
tion created for inexperienced consumers may not be ulti- 
mately detrimental to product evaluations. With the mea- 
sure of disconfirmation as the dependent variable, we used a 
two-way ANOVA to test H}, which revealed the predicted 
interaction (F(1, 172) = 14.23, p < .001; see Figure 3, Panel 
A). Inexperienced participants who received the demonstra- 
tion experienced the least negative disconfirmation of 
expectations (i.e., the most positive disconfirmation, finding 
the Zire easier to use than expected) compared with their 
counterparts who did not receive the demonstration 
(F(1, 172) = 22.53, p < .0001; contrast: Mao demo = 4.5 ver- 
sus Maemo = —2.6, p < .0001), in support of H,,. Again, the 
demonstration did not significantly influence the degree of 
disconfirmation for those with prior experience (F(1, 172) = 
1.09, p > .10; Maemo = 4.2 versus Mao demo = 2.3), in sup- 
port of H»,. 


Impact of disconfirmation on experienced emotion and 
evaluation at first use. If disconfirmation influences emo- 
tion (and subsequently product evaluation), we would 
expect to observe an experience x demonstration interaction 
for emotion that mirrors the pattern tested in H,. The pre- 
dicted interaction is significant (F(1, 170) = 4.90, p = .02; 
Figure 3, Panel B). Experienced participants’ emotions 
were not influenced by the product demonstration, yet inex- 
perienced participants were strongly influenced by the 
demonstration; those who had. a demonstration reported 
more positive emotion (Ma. = 32) than those who did not 
have a demonstration (Myo demo= 27). 

Another way to test H4 is through regression analysis by 
examining the comparative influence of complexity expec- 
tations on positive and negative emotion at all three periods 
(before use, at first use, and after repeated use). At each 
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point in time, we ran two regressions with positive and 
negative emotions as the dependent variables. Predictors 
included the two independent factors (experience and 
demonstration), their interaction, the measure of usage 
expectations, and the covariates (compatibility and net 
benefits). At Times 2 and 3, we also included as predictors 
the disconfirmation of usage expectations. Perceptions of 
the Zire's usage difficulty and its net benefits significantly 
predicted emotions at each point in time. As Table 1 shows, 
complexity is a significant driver of emotion at Time 1 
(Bpositive z—31, p< .001; B negative 2t); p< .05). Before 
use, net benefits influence only positive emotion (B = .18, 
p < .01), not negative emotion. Importantly, this pattern also 
holds at Time 2 and Time 3, augmenting the support for H3. 

similar to Ha, we test H4, . with regressions at each 
period. For each period, we ran a regression using evalua- 
tions of the Zire as the dependent measure. Predictors 
included the two independent factors, their interaction, the 
measures of usage expectations, the covariates (compati- 
bility and net benefits), and positive and negative emotions. 
At Times 2 and 3, we also included as predictors the dis- 
confirmation of complexity expectations. Before first use, 
positive emotion was a significant driver of evaluation (B = 
41, p « .001), as were net benefits (B = .27, p < .001; see 
Table 2). However, negative emotions were not significant 
predictors of evaluation (B = —.10, p > .05). This result is 
notable, and we discuss it further in Study 2. After first use, 
both positive (B = .33, p < .001) and negative (B = —22, p < 
.01) emotions were significant drivers of product evalua- 
tions, as were net benefits (B = .25, p « .001). Finally, after 
repeated usage, positive (B — .47, p « .001) and negative 
(B = —24, p < .001) emotions were still both significant dri- 
vers of product evaluation, as were net benefits (B = .25, 
p < .001). Thus, the influence of emotion can be observed at 
all points in time, in support of H4, and H4, but not Hy. 


Discusslon 


The results from Study 1 demonstrate that a consumer's 
expectations about the difficulty he or she will experience in 
using a new product can be influenced by both marketer- 
directed communications and by his or her own prior 
experience. These usage expectations and their (dis)confir- 
mation were important factors that predicted emotions and 
drove their changes over the course of the usage experience. 
In turn, emotions predicted product evaluations at all points 
in time, even after we controlled for the other diffusion 
variables. 

Note that only positive emotions predicted evaluations 
toward the Zire before its use, whereas both positive and 
negative emotions influenced evaluations after use. Before 
use, any threats to expected goal progress were only hypo- 
thetical; perhaps the negative emotions generated before use 
of the Zire were not substantial enough to influence evalua- 
tions. Only when the expected progress was actually 
impeded did negative emotions influence evaluation. This 
phenomenon might be conceptualized as a “suspension of 
judgment," in which the consumer holds doubts or negative 
feelings at bay in the initial assessment of the product. In 
other words, the consumer may experience negative emo- 


FIGURE 3 
Expectation Disconfirmation, Experlenced Emotions, and Product Evaluation After First Use 
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tions when considering the learning/usage difficulties but 
may not incorporate these initial emotions in evaluating the 
product. This is a notable and somewhat counterintuitive 
finding, given that consumers pay disproportionately more 
attention to negative attribute performance in. postuse prod- 
uct evaluations (Mittal, Ross, and Baldasare 1998). 


Innovation Continuity and 
Complexity Expectations 
In Study 1, the complexity expectations of the inexperi- 
enced consumers were the most influenced by the product 
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aL arger numbers indicate greater negative disconfirmation (worse-than-expected usage complexity). 


demonstration. As such, these consumers were subject to 
the greatest amount of disconfirmation and, consequently, 
produced greater emotional reactions to the Zire. In this 
population, inexperienced participants had no experience 
with the PDA category. Discontinuous innovations have 
been defined as those.that do not fit neatly into an existing 
category (Lehmann 1994), and for consumers with no prior 
PDA experience, tbere is likely to be no substantial PDA 
category representation existing in memory. For consumers 
with PDA experience, however, such a representation 
exists. As such, the Zire could be considered a discontinu- 
ous innovation for those without PDA experience but a con- 
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TABLE 1 
Study 1: Predictors of Emotions (Standardized Estimates) 





Emotions 
Before Use After First Use After Second Use 

Predictors Positive Negative Positive Negative Positive Negative 
Prior PDA experience .03 —.12 .08 -.10 .07 —.02. 
Product demonstration .10 —.04 .01 .01 .18* —.07 
Experience x demo —.07 .01 —.04 —.04 —.12 .06 
Product-Related Expectatlons 

Net benefits .18** .10 CY idola —.08 .16* —.04 

Compatibility —.08 —.11 —.11 —.04 —.01 —.04 

Usage (difficulty) -.31"™ .20* —.52*** .88"** .42"*** .68*** 

Disconfirmation of usage expectations .04 .19* —17* .07 
R? .10 .12 33 .59 .31 .54 
“p< 05. 
**p < ot 
“9 < .001. 

TABLE 2 
Studies 1 and 2: Predictors of Evaluations of the Innovation (Standardized Estimates) 
Evaluations 
Before Use After First Use After Second Use 

Predictors Study 1 Study 2 Study 1 Study 2 Study 1 Study 2 
Prior experience —.06 13 .01 13 —.07 —.01 
Product demonstration .04 .01 —.01 .21 —.01 .01 
Experience x demo —.04 —.04 03 —.28 —.01 —.04 
Product-Related Expectatlons 

Net benefits en .36""" 20: 13 M cda 98g*** 

Compatibility .08 .02 .04 07 —.01 .18 

Usage (difficulty) —.16* —.18* —.23* —.51"* —.20"** —.21* 

Disconfirmation of usage expectations —.16* .36* .05 —.03 
Consumption Emotlons 

Positive A17 o gage go A7*** 2" 

Negative —.10 —.08 —.22"* — 24* —.24*** —.11 
Re .33 52 .65 51 .71 .46 
*p « .05. 
**p < .01 
***p < 001. 


tinuous innovation for those with it. Because the literature 
has documented the effect of continuity on responses to 
new products (e.g., Moreau, Lehmann, and Markman 2001; 
Rogers 1996), we designed Study 2 to control for the conti- 
nuity of the innovation, while assessing differences in prod- 


uct experience. 


likely to be smaller. In these situations, all consumers are 
likely to have some direct experience with existing products 
in the category, but the level may vary across consumers. 
Although their existing knowledge may not be perfectly 
calibrated, it is likely to provide a more accurate prediction 


With a discontinuous innovation, the magnitude of the 


of the actual usage experience than if it were unavailable. In 
Study 2, we test this boundary condition to determine 


gap between expected and actual goal progress is likely to 
be significant because the consumer has no direct category 
experience on which to base expectations (Oliver and Winer 
1987). As a result, the emotions that ensue are likely to be 
more powerful and predictive of overall evaluations. How- 
ever, with a continuous innovation, the size of the gap is 
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whether the E? model (H,-H4) holds for a continuous 


innovation. 


Study 1 also involved the typical level of artificiality 
that is characteristic of lab studies, and its controlled nature 
dictated an accelerated usage schedule. As such, partici- 
pants had no discretion over the timing of the use of the 


product. In both academic (e.g., Rogers 1996) and 
practitioner-oriented (e.g., The Economist 2004) literature, 
there is evidence that the expected usage difficulty of a 
product causes consumers to delay a purchase. Thus, we 
might expect differential complexity expectations to affect 
usage timing even in situations in which the consumer does 
not make a purchase decision, such as with'a forced adop- 
tion. Formally, 


H5: Expectations of complexity are Po correlated with 
the delay in initial usage. 


In Study 2, we use a quasi-experimental methodology 
over an extended period to test H,-Hs in the context of a 
continuous innovation. Whereas we predict that complexity 
expectations are less influential for a continuous innovation, 
we expect (and demonstrate) similarities between Study 1 
and Study 2 in the factors that influence overall evaluations. 
As we observe, the pattern of emotions predicting evalua- 
tions of the discontinuous innovation at alli three times is 
almost identical to the pattern predicting evaluations of the 
continuous innovation. 


Study 2 


l 

Method 

As in Study 1, several criteria guided our selection of 
stimuli: (1) The product had to be a continuous innovation 
for all participants, with natural variation in usage experi- 
ence among the set, and (2) the product had to require some 
learning for all participants to achieve its basic performance 
goals. We selected PageOut, a new course management 
software program from McGraw-Hill. A pretest confirmed 
that there was variance in participants’ experience in using 
online course management tools but that all potential (stu- 
dent) participants were highly familiar with different types 
of software, with Web page interfaces, and with computers. 
As a cover story, students were told that they had been vol- 
unteered to beta-test the software and would be asked to 
assess PageOut three times throughout the semester. 


Design and Protocol 


Undergraduates who were enrolled in three sections of an 
introductory marketing course participated in this 2 (high 
versus low experience) x 2 (demonstration versus no 
demonstration) between-subjects study throughout the 
semester as part of their curriculum. Although 123 students 
were enrolled in the three sections, 17 failed to complete all 
aspects of the study and were subsequently! deleted from 
further analyses. The students were not randomly assigned 
to a section, but the course prerequisites, the professor, the 
class format, and the course content were ‘held constant 
across all three sections. 

On the first day of class, the professor introduced Page- 
Out to the students in all three sections using a consistent 
script and provided all students with the same handout con- 
taining step-by-step instructions. In one randomly selected 
section, the professor used the computer podium to log on 
to the Web site and to demonstrate how to create a student 
profile. The other two sections received just the verbal 

I 


introduction and the instructional handout for this task. This 
operationalization of the demonstration differed from Study 
| in that it used a different medium (computer), used a dif- 
ferent instructional material for the Web site’s use, and was 
conducted in a group rather than individually. 


Before use (Time 1). On the first day of class, immedi- 
ately following the introduction, students reported the emo- 
tions they experienced while the professor was introducing 
PageOut. Students then completed demographic and experi- 
ence measures, followed by the dependent measures. 
Finally, students were given an assignment to log on to 
PageOut and register by the next class. A small extra credit 
incentive was provided. The administrators’ software 
enabled the professor to record students’ log-on and regis- 
tration times. 

After the first use (Time 2). On the second day of class, 
students reported the emotions they felt while logging on 
and registering with PageOut and responded to the depen- 
dent measures. 


After extended usage (Time 3). Over the course of the 
semester, students used PageOut to take online quizzes, 
check exam grades, download class notes, and communi- 
cate with the professor. At a mandatory class meeting 
shortly before the end of the semester, students reported 
their current emotions and completed the final set of depen- 
dent measures. 


Dependent Varlables 


Expectations, emotions, and evaluations. Measures of 
product-related expectations, emotions, and evaluations 
were similar to those used in Study 1, with one exception. 
In an effort to minimize the time required to collect the 
data, no open-ended protocols for usage expectations were 
employed. 


Log-on time. As we noted previously, the administra- 
tors’ software enabled the professor to track the time that 
elapsed from the time each student left class to the time that 
he or she registered on the Web site. 


Prior experience. Experience with online course man- 
agement tools was measured using two items. A median 
split identified students as experienced or inexperienced. 


Results 


Initial usage expectations and timing of initial use. Hs 
predicted that higher expectations of usage difficulty would 
be associated with delays in product usage. The simple cor- 
relation between participants’ perceptions of usage diffi- 
culty and the hours they allowed to elapsé before their ini- 
tial registration suggests that such a mechanism was at work 
(r 2 .25, p < .01). 


The influence of experience and demonstration on ini- 
tial complexity expectations. We tested H; using a two-way 
ANOVA. As in Study 1, there was a significant main effect 
of experience (F(1, 102) = 7.30, p < .01). The pattern of 
data was consistent with the first study; participants with 
more experience with course management software 
expected their initial use of the PageOut software to be less 
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difficult than did those with less experience (Mexpenenced = 
4.9 versus Minexpenenced = 9.8). 


The accuracy of complexity expectations for a continu- 
ous innovation. After participants logged on and registered, 
how did their initial usage expectations compare with their 
actual experience? Given the continuous nature of the inno- 
vation, we expected that H5 would be attenuated in this 
study. Accordingly, in contrast to the findings in Study 1, 
there were no significant effects of prior experience or 
demonstration on the degree of disconfirmation of complex- 
ity expectations. The means for all four groups were close 
to zero, indicating that participants were relatively well cali- 
brated in their expectations. 


Consumption emotions and product evaluations. As we 
expected, given participants’ greater accuracy in complexity 
expectations, disconfirmation has a much smaller influence 
on experienced emotion at each of the three periods. We 
replicated the experience x demonstration interaction for 
felt emotion at first use observed in Study 1 (using stan- 
dardized measures, B = —21, p « .05, r = .19). Less- 
experienced participants experienced more positive emotion 
at first use when they received a demonstration (M = .31) 
than when they did not receive one (M = —24), whereas 
more-experienced participants were not affected similarly 
by the demonstration (Mac, = —12 versus Mao demo = .13). 
However, as we expected with the small degree of discon- 
firmation, the experience x demonstration interaction for 
Time 2 evaluations of the innovation was not significant 
(F = .67, p > 2). 

There are dramatic similarities between the Study 1 and 
the Study 2 findings in the factors that influence overall 
evaluations. The pattern of results that predict evaluations 
of the Zire across time is almost identical to that of PageOut 
(Table 2). In both studies, only positive emotions predict 
evaluations before use, which might be interpreted as an ini- 
tial suspension of judgment during the product introduction 
such that negative emotions, though tbey are experienced, 
are held at bay. However, both positive and negative emo- 
tions significantly influenced evaluations after use in both 
studies, in support of H4. Finally, H4. predicted that the 
influence of emotions would wane over time. Unlike Study 
1, in which the impact of emotions remained strong during 
the duration of the one-hour session, the impact of emotions 
declined over the semester for PageOut, in support of Hy4,. 
Taken together, these findings suggest that the diminishing 
influence of emotions can be observed only over longer 
time horizons. 

Study 2 demonstrates that the pattern of results from 
Study 1 holds in a real-world quasi experiment but that 
emotional influence is not as extreme when the product is 
less innovative. However, even when emotional influence 
was smaller as a result of more accurate complexity expec- 
tations, the inclusion of emotion still increased the predic- 
tive power of the traditional diffusion model. Finally, this 


6To assess this contribution, we compared the explanatory 
power of the model shown in Table 3, Panel B, with that of a 
reduced model that does not include the two emotion variables 
(R2 = .38). The resulting F statistic (F(2, 95) = 9.13, p « .01) 
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study shows that complexity slows consumers' time to trial 


(Hs). 


General Discussion 


Theoretical Contribution 


The E? model. By demonstrating how affective 
responses during new product learning influence innovation 
evaluations over time, we extend previous work in both sat- 
isfaction and diffusion. The E? model provides a parsimo- 
nious description of how emotions are created in the early 
use of complex products and how they may influence prod- 
uct evaluations beyond diffusion's traditional focus on net 
benefits. These results emphasize the dynamic nature of 
both emotion and evaluation in innovation diffusion. 


Adding emotion to innovation frameworks. Refining 
models of innovation is easily justified by the importance of 
new product success to company profitability. Prior 
research on new product adoption focuses largely on cogni- 
tive processes, despite the increasing knowledge of the role 
of emotion in product consumption. We demonstrate that 
emotions resulting from early experience with the product 
(which therefore might be dismissed as ephemeral) can 
have a lasting influence on diffusion.? 


Exploring and expanding recent innovation research. 
The use-diffusion model (Shih and Venkatesh 2004) 
describes four patterns of consumer use of new products 
based on the rate and variety of consumer use. Shih and 
Venkatesh (2004) theorize that usage diffusion depends on 
(1) characteristics of the individual or household (e.g., inno- 
vativeness, resources, prior experience), (2) characteristics 
of the innovation (e.g., complexity, technological sophisti- 
cation, existence of complementary products), and (3) char- 
acteristics of the environment (e.g., communication inten- 
sity, technological access, media exposure). The current 
research expands the knowledge of how these factors may 
interact by examining how the characteristics of the envi- 
ronment (how the innovation is promoted to the consumer), 
the product (complexity), and the consumer (prior exper- 
tise) combine to create a successful or unsuccessful initial 
product experience. 

In addition, the importance of the phenomena we report 
herein is reflected in recent work on feature fatigue. 
Thompson, Hamilton, and Rust (2005) show that con- 
sumers assign more weight to product capabilities (addi- 


demonstrates that the addition of emotions to the standard model 
of preference makes a significant contribution. 

TResearchers have long advocated the addition of affective ele- 
ments to primarily utilitarian-based models of product evaluation 
(Derbaix and Pham 1991). However, many researchers view emo- 
tion as another viable end-state benefit similar to utilitarian bene- 
fits. For example, Havlena and Holbrook (1986) note that con- 
sumers might buy a luxury car for the pleasure of feeling superior. 
We do not investigate emotions as independent benefits in this 
Work but rather as a moderator of product learning and acceptance. 
Although emotions can color utilitarian attribute evaluation (Smith 
and Ellsworth 1985), they can also have a more global influence 
on product experiences (Cohen and Areni 1991). 


| 

tional features) in their evaluations before use than after use 
and less weight to product usability before use than after 
use. Thus, they show that consumers are often miscalibrated 
in their expectations about product complexity and usability 
before use. Our research shows that inaccurate and overcon- 
fident expectations about usability (e.g., when consumers 
seek excessive features in prospect) may create situations in 
which negative emotions have a powerful influence on 
product evaluations and, thus, in retrospect, ‘satisfaction. 


Managerial Implications | 


Our results show that the predictive pow | of traditional 
models of innovation adoption can be improved by the 
addition of emotional tesponses. Importantly, a substantial 
proportion of consumers' emotional responses appear to be 
a function of disconfirmed complexity expectations, not 
simply a result of product benefits. This tells marketers two 
things. First, emotional influence can be assessed by mea- 
suring disconfirmation of usage expectations, and second, 
emotional responses can be influenced by! changing con- 
sumers' complexity expectations before use. As we noted 
previously, marketers can influence complexity expecta- 
tions when they show product use in commercials, describe 
usage instructions in online formats, or use other high- 
involvement sales channels (e.g., sales re S). We tested 
product demonstration as one way to ien expecta- 
tions. Product demonstrations are believed to be beneficial 
to both marketers and consumers but expensive to execute, 
and marketers do not have a clear idea of how they ‘truly 
affect consumers (Ailloni-Charas 2000). Our research sug- 
gests that training frontline sales personnel i$ important; the 
salesperson may be tempted to insist that a technological 
product is easy to learn to close the sale, yet the misman- 
agement of usage expectations can lead to high rates of 
product return in the long run. Promising “easy” use may 
promote trial but sabotage product evaluation and diffusion. 
Thus, marketers must seek a balance in which con- 
sumers are encouraged to try new products but not through 
promises of ease that create unrealistic ej pectations for 
early use. For example, we can compare the nt strategic 
actions of Royal Philips and Best Buy to observe two oppo- 
site outcomes of influencing consumers’ complexity expec- 
tations. Best Buy’s Geek Squad (currently i in 642 stores) is 
a team of professionals, attired in stereotypically nerdy uni- 
forms (black polyester pants, white shirts, and clip-on ties), 
who make house visits to help consumers learn complicated 
product tasks, such as how to program an iPod or connect a 
TiVo system (Deam 2004). Rather than dównplaying the 
complexity of products sold in the store, Best Buy leverages 
these perceptions by offering (for a fee) a way to shift the 
learning costs to a team of prototypically smart people. Not 
only does this generate revenue, but our research would pre- 
dict that the Geek Squad also sends a sign i albeit humor- 


ous, to consumers that product learning is not always easy; 
with such a signal, consumers may subsequently experience 
little negative disconfirmation in early usage and be posi- 
tively surprised if they successfully achieve product use on 
their own. Conversely, Philips’s strategy of promoting itself 
as a provider of user-friendly gadgets (The Economist 2004) 
may be less optimal for the company. By launching a $100 
million ad campaign to promote its technology as “easy to 
use” (Endt 2004), Philips may lower consumers’ preuse 
complexity expectations and subsequently strengthen the 
negative emotions that occur in the early learning phase of 
use. Most learning requires some setbacks, and reducing 
consumers’ expectations of setbacks may have a negative 
impact on long-term product and brand evaluations. 


Limitations and Further Research 


A limitation of the current research is our use of forced 
adoption in the two studies. Participants in both studies did 
not choose to adopt the new products, though they had an 
outlet to report their opinions and evaluations at multiple 
points in the usage period. Although this situation was cho- 
sen for practical reasons, on reflection, we question whether 
forced adoption has a positive side in offering a better and 
more conservative test of the influence of complexity 
expectations and emotion. When consumers choose to try/ 
buy a product innovation, there are certain emotional and 
evaluative processes that arise solely from the evaluation of 
the choice, such as cognitive dissonance. Dissonance may 
amplify emotions that arise from disconfirmation of usage 
expectations. Thus, emotional responses may be heightened 
in voluntary adoption, creating even stronger influence of 
emotions on innovation evaluations. 

There are many other diverse avenues for further 
research suggested by our results. First, the E? model advo- 
cates the inclusion of both complexity expectations and 
emotions (positive and negative) in existing frameworks of 
evaluation, satisfaction, and diffusion. Second, these results, 
combined with work on regret (e.g., Tsiros and Mittal 200), 
begin to build a more complete picture of negative emotion 
after a choice is made. Luce, Bettman, and Payne (2001) 
show the importance of negative emotion as consumers 
contemplate choice. The act of making a choice is not the 
end of emotional influence. Consumer expectations for the 
usage or learning process may have a surprisingly strong 
impact, influencing regret over product choice dissatisfac- 
tion not only with product benefits but also with the costs to 
achieve those benefits in a timely manner. Can complex 
products make consumers feel smart because of unexpected 
hurdles overcome or stupid because of unexpected slowness 
in achievement? More important, can such emotional reac- 
tions influence the evaluation of the innovation beyond 
more concrete evaluative criteria? The current research sug- 
gests that they can. 
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Are All Smiles Created Equal? How 
Emotional Contagion and Emotional 
Labor Affect Service Relationships 


In this study, the authors examine the effects of two facets of employee emotions on customers’ assessments of 
service encounters. Drawing on emotional contagion and emotional labor theories, they investigate the influence of 
the extent of service employees’ display of positive emotions and the authenticity of thelr emotional labor display 
on customers’ emotional states and, subsequently, on customers’ assessments of the service interaction and their 
relationship with the service provider. To test the study hypotheses, 223 consumers participated in a simulated 
service encounter in which actors played the roles of service employees. In a 2 x 2 factorial design, the employees 
varied both the extent of their smiling behavior and their emotional labor display by engaging in surface or deep 
acting. The results show that the authenticity of employees’ emotional labor display directly affects customers’ 
emotional states. However, contrary to expectations, the extent of employee smiling does not influence customer 
emotions, providing no support for the existence of primitive emotional contagion in service interactions. 


Furthermore, employee emotions exert an influence on customer outcomes that are of interest to marketers. 


customers is considered an essential part of both cus- 

tomers’ assessments of service quality and their relation- 
ship with the service provider (Bitner 1990; Gwinner, 
Gremler, and Bitner 1998; Parasuraman, Zeithaml, and 
Berry 1985). Despite the considerable amount of empirical 
research on service relationships and customer assessments 
of service quality, several aspects of the service interaction 
have remained unexplored. An area of particular interest is 
the role of emotions in service encounters. Although the 
notion of having a friendly service staff and providing “ser- 
vice with a smile” has become a generally unquestioned 
mantra for service firms, empirical research about how 
employees’ emotional states affect customers and their 


l n general, the interaction between service employees and 
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assessments of service encounters has emerged only in 
recent years. 

Two research streams that address the role of emotions 
in service encounters involve emotional contagion (Hat- 
field, Cacioppo, and Rapson 1994) and emotional labor 
(Hochschild 1983). “Emotional contagion” is defined as the 
flow of emotions from one person to another, with the 
receiver “catching” the emotions that the sender displays 
(Schoenewolf 1990). In the context of service interactions, 
emotional contagion creates a ripple effect of emotions 
from service employees to customers (Pugh 2001; Tsai and 
Huang 2002; Verbeke 1997). In other words, employees 
who smile at customers may be contagious, in that they 
change the affective state of customers and thus influence 
customers’ perceptions and evaluations of the service 
encounter. “Emotional labor” refers to service employees’ 
display of expected emotions as a self-regulatory process 
(Hochschild 1983). When displaying expected emotions to 
customers, employees can choose between two acting 
strategies, surface or deep acting, which differ mainly in 
their extent of authenticity (Grandey 2003). 

The purposes of this study are twofold. First, we 
attempt to extend marketing theory by developing and test- 
ing a model of how employee emotions affect customers. 
Building on emotional contagion and emotional labor 
theories, we develop a model that enables us to test the dif- 
ferential effects of two facets of employee emotions— 
employee smiling behavior and authenticity of the emo- 
tional labor display—on changes in customer affect in a 
service setting. We also examine key customer conse- 
quences of interest to service firms, including the impact of 
employee emotions on customer satisfaction, customer— 
employee rapport, and customers' future loyalty intentions, 
all of which constitute previously untested relationships. 
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A second contribution of this study is our use of an 
experimental research design that enables us to assess the 
cause-effect nature of emotional contagion processes more 
precisely. The extant services literature on emotional conta- 
gion has not answered the question whether customers 
really catch employee emotions. Responding to Pugh’s 
(2001, p. 1026) statement that "in the absente of a true 
experimental design, it is not possible to conclude that con- 
tagion alone is responsible [for the variation in customer 
consequences]," our research establishes such; causal rela- 
tionships and fills an important gap in thé customer- 
employee emotions literature. 

We first review the concepts that are cdntral to our 
study, namely, emotional contagion, emotional labor, and 
customer consequences of service interactions, and then we 
present a conceptual model and discuss the relationships 
among the model elements. Next, we describe the design of 
our experiment, which enables us to examine, the relation- 
ships of the conceptual model using a wo Nay repeated 
measures analysis of variance (ANOVA) o partial least 
squares (PLS) structural equation modeling. Finally, we 
present the findings and their implications |for services 
research and management. 


The Role of Emotions in Service 
Interactions 


Emotlonal Contagion | 

Research on emotional contagion attempts to explain how 
emotions are transmitted among people in social interac- 
tions and how “catching” another person’s emotions affects 
the dynamics of the social interaction. Emotional contagion 
can occur at both subconscious and conscious levels 
(Barsade 2002). That is, the process of emotional contagion 
can be attributed to people’s “tendency to automatically 
mimic and synchronize facial expressions, vocalizations, 
and movements with those of another person and, conse- 
quently, to converge emotionally” (Hatfield, Cacioppo, and 
Rapson 1994, p. 5) and to more conscious social compari- 
son processes between people (Barsade 2002) 

With “primitive emotional contagion” (Hatfield, 
Cacioppo, and Rapson 1994), the transfer of emotions from 
one person to another is the result of the receiver’s uncon- 
scious, emotive processes. This type of emotional contagion 
is driven by a two-step mimicry process, in which a person 
(1) spontaneously imitates another person’s Facial expres- 
sions and other nonverbal cues, which (2) leads the person 
to experience the corresponding emotions ugh physio- 
logical links. Although the person feels the emotions that 
result from mimicry, the processes that lead to this emotion 
are often “subconscious and automatic” (Barsade 2002, p. 
648). As a consequence, emotional contagion theories sug- 
gest that primitive emotional contagion is spurred by the 
extent to which the sender displays emotions; a greater 
emotional display by the sender results in higher levels of 
emotional contagion in the receiver. 

In contrast, “conscious emotional contagion” is based 
on social comparison processes in which people actively 
search for emotions as a type of social information (Salan- 


cik and Pfeffer 1978). This search activity is considered a 
fundamental human behavior, which grows particularly 
strong in situations perceived as ambiguous (Gump and 
Kulik 1997). Specifically, conscious emotional contagion 
theory argues that people compare their mood with another 
person’s mood and adopt the sender’s emotive level when it 
appears appropriate (Barsade 2002).! For example, in the 
absence of other social information, people visiting an 
attorney for the first time can be expected to observe the 
attorney's emotional display and then to adopt his or her 
emotions as a result of their desire to search for social infor- 
mation and reduce perceived ambiguity. Unlike primitive 
emotional contagion, conscious emotional contagion 1s 
determined less by the extent to which the sender displays 
emotions during an interaction (e.g., frequency of smiling) 
and more by the authenticity with which the emotions are 
displayed (e.g., genuineness of a smile). When the receiver 
perceives the sender's emotional display as fake or disin- 
genuous, he or she will not interpret the emotional display 
as adequate for reducing perceived ambiguity, so conscious 
emotional contagion is less likely to occur. 


Emotlonal Labor 


The concept of emotional labor, which we consider a poten- 
tial driver of customers' emotional states and subsequent 
assessments of service interactions, refers to the “effort, 
planning, and control needed to express organizationally 
desired emotions during interpersonal transactions" (Morris 
and Feldman 1996, p. 987). Recent management literature 
has considered emotional labor in an effort to better under- 
stand how service organizations can manage employees' 
positive displays to customers. Furthermore, it 1s linked to 
the existence of either explicit or implicit organizational 
display rules (Rafaeli and Sutton 1987) that define which 
emotions employees are expected to display and which they 
should suppress in the course of interacting with customers. 

In general, service employees are expected to align their 
displayed emotions with organizationally desired emotions 
through their choice of emotional labor strategies 
(Hochschild 1983). With regard to specific emotional labor 
strategies, scholars have drawn on Hochschild's (1983) dis- 
tinction between surface acting and deep acting as the pri- 
mary framework for service employees. In "surface acting,” 
an employee tries to change only his or her outward behav- 
ior to exhibit the required emotions. Thus, surface acting 
refers to the act of displaying an emotion that is not felt and 
could involve both suppression of felt emotions and faking 
of unfelt emotions. For example, when dealing with an 
angry or annoying customer, an employee may simply put 
on a smile and pretend to be cheerful and friendly without 
actually feeling the emotions. In other words, surface acting 
constitutes the expression of feigned emotions and lacks 
authenticity (Grandey 2003). With "deep acting," however, 
employees express expected (or required) emotions by 
attempting to create these emotions within themselves. This 


l'This type of emotional contagion is sometimes referred to as 
"emotional comparison" in the literature (e.g., Bartel and Saavedra 
2000) 
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strategy is similar to the method-acting technique developed 
by Russian director Constantin Stanislavski (1965), in 
which actors are taught to create self-induced true emotions 
by using their emotional memory and recalling prior experi- 
ences and emotions (Hochschild 1983). As an example of 
deep acting in the context of service delivery, Hochschild 
reports on flight attendants who are trained to deal with 
angry passengers by thinking of them as frightened, first- 
time fliers. This process enables the flight attendants to 
change their inner feelings toward the customer from 
annoyance to pity and sympathy. Thus, the distinction 
between surface acting and deep acting is conceptually 
aligned with the “service-as-theater” metaphor, which pos- 
tulates the interaction between service employees and cus- 
tomers as a dramaturgical interaction in which actors (i.e., 
employees) perform (i.e., provide service) on stage (i.e., the 
service environment) in front of an audience (i.e., cus- 
tomers) (Grove and Fisk 1992). 


Consequences of Service Interactions 


The display of employee emotions and the resultant cus- 
tomer emotions likely affect various outcomes of interest to 
marketing managers. We focus on three major customer 
consequences of service interactions that are considered 
particularly relevant to service companies: customer satis- 
faction, customer-employee rapport, and customers’ future 
loyalty intentions. 

“Customer satisfaction” is widely regarded as the cogni- 
tive assessment of a customer’s emotional experience (Hunt 
1993). As Oliver (1981) discusses, satisfaction is consump- 
tion specific; that is, it is related to a single consumption 
experience. This transaction-related characteristic is often 
considered the main difference between satisfaction and 
similar evaluative concepts, such as consumer attitude and 
perceived service quality, which are regularly modeled as 
overall constructs or general evaluations of a service and are 
unrelated to a specific consumption episode (Hennig- 
Thurau and Klee 1997). 

"Customer-employee rapport" is “a customer's percep- 
tion of having an enjoyable interaction with a service 
provider employee, characterized by a personal connection 
between the two interactants" (Gremler and Gwinner 2000, 
p. 92). As a relational concept applicable to service settings, 
rapport depends on one or more interactions between 
employees and customers. Similar to satisfaction, 
customer-employee rapport represents a customer's cogni- 
tive assessment of an affective state. It also is important to 
stress that rapport can be cultivated through a single service 
interaction and does not depend on a shared long-term his- 
tory. Customer-employee rapport has been identified as a 
salient issue for service organizations because rapport 
exerts a strong influence on customer perceptions of service 
delivery and service organizations (DeWitt and Brady 2003; 
Gremler and Gwinner 2000). 

Finally, customers’ “future loyalty intentions" constitute 
a central component of service loyalty (Oliver 1997), which 
itself is defined in various ways. Most definitions focus on a 
customer's willingness to visit a particular firm again 
because of his or her positive emotions and cognitions 
(Oliver 1999). In empirical research, future intentions repre- 
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sent a frequently studied component of loyalty, when loy- 
alty is defined and/or measured as future intentions (e.g., 
Fornell 1992; Rust and Zahorik 1993; Zeithaml, Berry, and 
Parasuraman 1996). We emphasize future loyalty intentions 
because we believe that, in general, the future behavior of 
customers is of more interest to service marketers than cur- 
rent consumer attitudes (i.e., attitudinal loyalty) and/or prior 
behavior. 


A Conceptual Model of the Impact 
of Employee Emotions on 
Customers in Service Interactions 


The impact of different facets of employees' emotional dis- 
plays on customer emotions and their consequences for cus- 
tomers constitute the focus of this research. In Figure 1, we 
show the conceptual model and the specific hypotheses 
tested herein. In the following section, we discuss the pro- 
posed relationships in detail. 


Impact of Employee Emotions on Customer 
Emotions 


Drawing on emotional contagion and emotional labor 
theories, we contend that in interpersonal interactions 
between employees and customers, the likelihood that an 
employee's emotions affect customer emotions is facilitated 
by two key variables: the extent of an employee's smiling 
and the authenticity of his or her emotional labor display. 
Although prior research has found that employees who 
deliver "service with a smile" increase a customer's service 
experience (e.g., Rafaeli and Sutton 1989; Tsai and Huang 
2002), little is known about how or why an employee's dis- 
play of emotions is related to customer consequences. Emo- 
tional contagion theory (Hatfield, Cacioppo, and Rapson 
1994) suggests that people's expression of positive emo- 
tions facilitates a corresponding emotional state in others. 
McHugo and colleagues (1985) demonstrate that exposure 
to images of smiling faces produces corresponding 
observed and self-reported emotions in study participants. 
Furthermore, as we discussed previously, primitive (i.e., 
subconscious) emotional contagion suggests that the extent 
of a service employee's positive emotional display (e.g., 
amount of smiling) is the key driver of emotional contagion 
(Hatfield, Cacioppo, and Rapson 1994). Thus, on the basis 
of the assumption that customers perceive employee smil- 
ing, we argue that if an employee increases his or her 
amount of smiling, customers are more likely to mimic 
these facial expressions subconsciously during the 
encounter, thus altering their own emotional state. We use 
the term "customer positive affect" for this positive emo- 
tional state (DeWitt and Liu 2002; Watson and Tellegen 
1985). 


H;: High amounts of employee smiling lead to a greater 
increase in customer positive affect than do low amounts 
of employee smiling. 


We also expect that the authenticity of the service 
employee's emotional labor display influences the cus- 
tomer's emotional state. As we indicated previously, deep 
acting and surface acting are alternative strategies that 


| FIGURE 1 
A Conceptual Model of Employee and Customer Emotions in Service Interactions 
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employees use to ensure that their emotional display toward 
customers conforms to organizational display rules. A key 
difference between these two emotional labor strategies is 
the degree of authenticity of the emotional display (Broth- 
eridge and Grandey 2002; Hochschild 1983; Kruml and 
Geddes 2000). We argue that a high level of authenticity of 
the employee’s emotional labor display, a main characteris- 
tic of deep acting, triggers positive emotions within cus- 
tomers due to their preference for being treated in an honest 
and authentic way. In contrast, we do not expect such posi- 
tive customer emotions when the employee’s emotional dis- 
play is inauthentic, which is a main characteristic of surface 
acting. Evidence from the social psychological literature on 
authentic (i.e., Duchenne) smiles and emotion recognition 
supports this view. For example, Ekman and colleagues 
(Ekman 1992; Ekman, Davidson, and Friesen 1990; Ekman 
and Friesen 1982) show that authentic smiles stimulate 
more positive emotional reactions by respondents than do 
“faked” smiles. It is argued that this distinction in reactions 
is due to different neurological bases, in that observers 
often respond more positively to some subtle facial cues 
associated with authentic emotional displays, including the 
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symmetry of the smile or the activation of certain muscle 
groups around the eyes (Ekman and Friesen 1982; Ekman, 
Friesen, and O’Sullivan 1988). Thus, customer reactions to 
authentic emotional displays are likely to be more positive 
than those to inauthentic displays. 

This argument receives additional support from con- 
scious emotional contagion theory (Barsade 2002; Bartel 
and Saavedra 2000), according to which the lack of authen- 
ticity associated with employee surface acting makes it less 
likely that customers can reduce their service-related ambi- 
guity through the adoption of the employee’s emotions 
(Grandey et al. 2005; Kruml and Geddes 2000). In the case 
of deep acting, however, employees summon true and genu- 
ine emotions from within and thus display positive emo- 
tions to customers that are authentic, enabling those cus- 
tomers to adopt the employee's emotions consciously—that 
is, to use the employee's displayed emotions as a type of 
social information to reduce the ambiguity associated with 
the service experience. In other words, the change in cus- 
tomer positive affect should be greater when employees' 
emotional labor displays are authentic (i.e., engage in deep 
acting) than when they are inauthentic (i.e., surface acting): 
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H»: A high level of authenticity of the employee's emotional 
labor display (i.e., deep acting) leads to a greater increase 
in customer positive affect than does a low level of 
authenticity of the emotional labor display (i.e., surface 
acting). 


Customer Consequences of Employee Emotions 


In addition to the proposed effects on customer positive 
affect, we also expect that the independent variables (i.e., 
extent of employee smiling and authenticity of the emo- 
tional labor display) directly affect both customer satisfac- 
tion with the transaction and customer-employee rapport. 
Regarding customer satisfaction, customers often interpret 
an employee's emotional display as part of the service itself 
(i.e., the notion of service as theater; Grove and Fisk 1992), 
which suggests that they hold expectations about the dis- 
play of positive emotions (i.e., smiling) that influence their 
level of satisfaction (Tsai 2001). Regarding customer- 
employee rapport, an employee's smiling may be an 
antecedent of rapport, in that it increases the receiver's 
enjoyment of the personal interaction (Gillis, Bernieri, and 
Wooten 1995; Tickle-Degnen and Rosenthal 1990). 


H4: High amounts of employee smiling lead to a greater 
increase in customer satisfaction with the transaction than 
do low amounts of employee smiling. 

H4: High amounts of employee smiling lead to a greater 
increase in customer-employee rapport than do low 
amounts of employee smiling. 


Similarly, we expect that the provision of authentic 
emotions as part of employee deep acting leads to greater 
customer satisfaction and rapport than does an inauthentic 
emotional labor display (i.e., surface acting). Grandey 
(2003) finds that employees' use of deep acting leads to 
higher ratings of service delivery than does the use of sur- 
face acting, and Grandey and colleagues (2005) report that 
customer satisfaction is higher when customers perceive 
employee behavior as authentic. Grandey (2003) also pro- 
vides support for the direct impact of the type of emotional 
labor strategy on customer-perceived rapport. In her study, 
employees' deep acting is related to perceptions of friendli- 
ness and warmth, both of which are considered characteris- 
tics of rapport (Gremler and Gwinner 2000). 


Hs: A high level of authenticity of the employee's emotional 
labor display (1.e., deep acting) leads to a greater increase 
in customer satisfaction with the transaction than does a 
low level of authenticity of the employee's emotional 
labor display (i.e., surface acting). 

Hg: A high level of authenticity of the employee's emotional 
labor display (i.e., deep acting) leads to a greater increase 
in customer-employee rapport than does a low level of 
authenticity of the employee's emotional labor display 
(1.e., surface acting). 


Relationships Between Customer Emotions and 
Customer Consequences 


Theoretical support for the impact of a customer's emo- 
tional state on his or her satisfaction comes from service 
research, which suggests that customer satisfaction with a 
service encounter is strongly influenced by customer emo- 
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tions (Oliver 1997). In particular, when customers assess a 
specific consumption experience (i.e., form their level of 
satisfaction with the service), they draw strongly on their 
current emotional state and ask themselves questions such 
as "How do I feel about it?" Thus, a change in a customer's 
emotions due to an employee's emotional display should 
influence the customer's satisfaction. This impact is consis- 
tent with the affect-as-information model from social psy- 
chology (Schwartz and Clore 1988), which suggests that 
people rely on their moods as information cues when they 
make evaluative judgments. Consequently, a change in cus- 
tomer emotions facilitated by an employee's emotional dis- 
play should lead to a change in customer satisfaction, such 
that an increase in customer positive affect results in higher 
levels of satisfaction. 

H;: An increase in customer positive affect has a positive 

effect on customer satisfaction with the transaction. 


We further hypothesize that an increase in customer 
positive affect as a result of employee emotions will influ- 
ence customers' perceptions of their rapport with the 
employee. Specifically, when a customer is "infected" by an 
employee's positive emotions as a result of a service inter- 
action, the customer likely will enjoy the interaction with 
the service employee to a greater extent. Because an 
enjoyable interaction is a key characteristic of customer— 
employee rapport (Gremler and Gwinner 2000), an increase 
in customers' positive emotions should ultimately lead to 
higher levels of rapport. 


Hg: An increase in customer positive affect has a positive 
effect on customer-employee rapport. 


The services literature also indicates relationships 
among customer-employee rapport, customer satisfaction 
with the transaction, and future loyalty intentions. Specifi- 
cally, we argue that rapport is positively related to customer 
satisfaction because an enjoyable interaction with a high 
degree of customer-employee rapport is usually one in 
which customers reveal personal information, which 
enables employees to customize the service offering to the 
customer's needs (Gremler and Gwinner 2000); both of 
these elements are considered integral parts of customer sat- 
isfaction. Furthermore, we expect that customer satisfaction 
is positively related to customers' future loyalty intentions 
through the creation of positive attitudes toward the service 
provider (Yi 1990), a claim we base on both attitude theory 
(Fishbein and Ajzen 1975) and exit-voice theory 
(Hirschman 1970). Finally, because customers who person- 
ally like a service employee and have rapport with him or 
her can be expected to form positive expectations about a 
future service experience with this employee (Gremler and 
Gwinner 2000), we propose that customer-employee rap- 
port has a positive relationship to customers' future loyalty 
intentions. 

Hg: Customer-employee rapport is positively related to cus- 

tomer satisfaction with the transaction. 

Hijo: Customer-employee rapport is positively related to cus- 

tomers' future loyalty 1ntentions. 


Hj): Customer satisfaction with the transaction is positively 
related to customers' future loyalty intentions. 


Methodology 


Participants 


Participants in the experiment were undergraduate and 
graduate students from various non-business schools of a 
medium-sized university. The final sample contained data 
from 223 participants, 46.8% of whom were women. The 
mean age of the respondents was 23.5 years (SD = 3.2); 
ages range from 18 to 48 years. Although the use of student 
samples is sometimes considered a limitation in marketing 
research, drawing on student samples in experimental 
designs, especially those involving role playing, is well 
accepted for examining causal relationships (e.g., Barsade 
2002; Surprenant and Solomon 1987). 

We contacted participants through the university's Web 
site and course announcements; they received either a free 
movie ticket for a local movie theater or a free DVD movie 
in return for their participation. Before the actual data col- 
lection, we conducted a pretest with 32 consumers to assess 
the adequacy of the study design and the experimental 
manipulations and to ensure that the psychometric proper- 
ties of our study measures were adequate. 


Procedure 


General research design. We informed all participants 
that a new business model for a pérsonalized movie rental 
service was being tested at the university. The service was 
positioned as a new "movie consulting service" for which 
customers would answer a series of questions about their 
movie-viewing habits and preferences in a one-to-one con- 
sultation with a service employee (unbeknownst to the sub- 
ject, this employee was a trained actor) Based on their 
answers, customers would then receive personalized advice 
for potential movie rentals from the employee. Throughout 
the duration of the study, professionally designed posters 
that promoted this new service appeared in various loca- 
tions at the university. The university also set aside specific 
facilities to serve as the movie consulting service location 
for the duration of this study. The layout and interior deco- 
ration of these facilities resembled those of actual video 
rental services (e.g., a service counter, shelves containing 
movies, movie posters, and life-sized figures announcing 
current and upcoming movies). Donations of actual DVD 
and VHS movies and various poster displays from local 
video rental stores and movie theaters decorated the facili- 
ties and thus increased the realism of the servicescape (see 
Figure 2). 

Experimental manipulations. Using a 2,x 2 between- 
subjects factorial design, we manipulated the extent of 
employee smiling (high versus low) and the authenticity of 
the emotional labor display (high authenticity/deep acting 
versus low authenticity/surface acting). We randomly 
assigned participants to one of the four experimental condi- 
tions, high smiling/deep acting (n = 58), high smuling/ 
surface acting (n = 52), low smiling/deep acting (n = 55), 
and low smiling/surface acting (n = 58). 


Dramaturgy. We used appointments to schedule one 
participant at a time, which limited waiting time and elimi- 


nated potential interactions among the participants. On 
arrival, each participant filled out a preencounter question- 
naire and then immediately entered the video consulting 
store to begin the service encounter. During the service 
encounter, participants interacted with the service 
employee, who engaged them in a conversation by asking a 
series of specific questions about movie preferences and 
viewing habits. The service encounter concluded with the 
employee recommending a specific movie on DVD or VHS, 
which he or she then handed to the customer. In general, 
these service encounters lasted between 5 and 10 minutes 
(M = 7.5 minutes). 


Actors. The service employees were three trained stu- 
dent actors (two female, one male) whom we specifically 
recruited for this study and paid for their time. We hired 
these actors on the basis of their experience in service work, 
their interest in acting, and their knowledge of motion pic- 
tures. Initially, we narrowed the pool of potential candidates 
down to six people, all of whom then performed in a series 
of auditions and rehearsals and participated in the pretest of 
the experiment. On the basis of their performances in these 
tests, we selected the final three actors and trained them 
over a period of six weeks. 

Furthermore, we developed a role description for each 
of the four conditions. All roles required the actors to 
behave in a customer-oriented manner and avoid the display 
of negative emotions. For the extent of smiling manipula- 
tion, we trained the employees to smile frequently when 
displaying a high level of smiling and to minimize their 
smiling when displaying low-level smiling behavior. In line 
with previous research, we defined smiling as a noticeable 
upward twist of the employee’s lips (Pugh 2001; Sutton and 
Rafaeli 1988). For the authenticity of the emotional labor 
display manipulation, we trained the actors in accordance 
with the characteristics of surface and deep acting, as 
described in the literature. They read some pertinent intro- 
ductory literature (e.g., Grandey 2003; Hochschild 1983), 
and two of the authors then discussed the concepts with the 
actors for a total of ten hours. Mainly, we drew from the 
concept of method acting by showing movie excerpts and 
using teaching techniques associated with method acting. 
For the surface acting role (i.e., low authenticity of the emo- 
tional labor display), the employee was instructed to adapt 
only his or her outward behavior to the customer’s needs 
but not his or her inner feelings. For deep acting (i.e., high 
authenticity of the emotional labor display), the employee 
was instructed to create the appropriate emotions within 
him- or herself. To teach these actors the skills to do so 
effectively, we employed specific exercises used in 
Stanislavski’s (1965) acting technique, which are com- 
monly taught to acting students as part of their basic train- 
ing. These exercises target the development of specific 
skills for using emotion memories (i.e., remembering and 
reexperiencing specific emotions) to evoke the required 
emotions. We continued the training until both the actors 
and the researchers were satisfied that the actors had mas- 
tered a sufficient skill set to enable them to act out the spe- 
cific roles required for the experiment. 
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FIGURE 2 
Design of the Video Consulting Store 
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Finally, we developed a standardized service script that 
consisted of a list of questions about customers’ movie- 
related preferences to ensure that all service encounters 
would be identical in nature in all respects other than the 
experimental manipulations (for further details about the 
service script, see Appendix A). Data collection took place 
over a period of seven days. Each actor played only one role 
per day over a time of approximately two-and-a-half hours. 
We randomly determined the roles in advance, and each 
actor performed for roughly equal amounts of time in each 
of the four roles. 


Measures 


Preencounter questionnaire. Study participants received 
a questionnaire just before they entered the video rental 
store, which included four items from Brief and colleagues' 
(1988) job affect scale to measure preencounter customer 
positive affect (see also Burke et al. 1989). We dropped two 
of the orginal six items (strong and active) because we 
deemed them to be inappropriate for use by customer 
respondents. In addition, to conceal the true nature of the 
study, the preencounter questionnaire contained various 
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filler items about preferences and consumption behaviors 
for motion pictures. 


Postencounter questionnaire. Participants completed a 
postencounter questionnaire immediately after the end of 
the service encounter; it included measures of post- 
encounter customer positive affect, customer-employee 
rapport, customer satisfaction with the transaction, and 
future loyalty intentions, as well as demographic variables. 
We assessed postencounter customer positive affect with 
the same items used 1n the preencounter survey and calcu- 
lated any changes as postencounter customer positive affect 
less preencounter customer positive affect for each of the 
four 1tems.? To measure rapport, we used four items from 


2There 1s an ongoing discussion about the problems associated 
with the use of difference scores (e.g., Peter, Churchill, and Brown 
1993) These issues predominantly address the integration of two 
measures (e.g., service expectations and service performance 
evaluation) but are less relevant when the same construct 1s mea- 
sured at two points 1n time. Two common concerns about differ- 
ence scores are that they lack variability and do not differ from the 
initial level (Tisak and Smith 1994). However, these issues do not 


the scale developed by Gremler and Gwinner (2000); we 
excluded items that implicitly suggest a long-term relation- 
ship between the customer and the employee. For customer 
satisfaction with the transaction, we used five items from 
the work of Churchill and Surprenant (1982) and Bitner and 
Hubbert (1994). Finally, we measured future loyalty inten- 
tions with four items, three of which we adapted from the 
work of Zeithaml, Berry, and Parasuraman (1996) and one 
from the work of Taylor and Baker (1994). We measured all 
items using seven-point scales, in which higher numbers 
indicated greater agreement with the construct (for a list of 
all study items, see Appendix B). 


Results 


Manipulation Checks 


We employed several methods to ensure the validity of our 
study manipulations. First, unbeknownst to customers, all 
service encounters were recorded with a hidden digital 
video camera and transmitted in real time to an adjacent 
room. Two trained judges (one of the authors and one grad- 
uate student) watched each interaction to judge whether it 
was consistent with the required experimental manipula- 
tion. Service encounters that the judges deemed to be incon- 
sistent with the required role were dropped from further 
analyses. Second, because the authenticity of the emotional 
labor display manipulation ultimately depends on an 
employee’s self-evaluation, after each service encounter, we 
asked the service employees whether they believed they had 
effectively portrayed the desired role. In addition, during 
the debriefing, one participant guessed the true nature of the 
study. As a result of these measures, we excluded three ser- 
vice encounters from further analysis, which resulted in 223 
usable encounters. 

Third, after completion of the study, four:graduate stu- 
dents (two male, two female) who were not familiar with 
the study watched all 223 videotaped service encounters 
and rated each interaction on several dimensions.? Specifi- 
cally, two of the judges separately watched all interactions 
and recorded (1) the frequency of employee smiles (defined 
as the number of times the employee noticeably smiled) 
during each interaction and (2) the duration of each smile 
(measured in seconds). We transformed these data into a 
smiles-per-minute measure and a measure of the percentage 
of time the employee smiled during each encounter, which 
enabled us to control for service encounter length. The 
interrater agreement, which we measured with the intraclass 
correlation coefficient (ICC; Shrout and Fleiss 1979), was 
high between the two judges for both items, (frequency of 
smiles: ICC = .909, p « .01; duration of smiles: ICC = .948, 
p < .01). The ANOVA results also showed that the manipu- 


appear to be critical in our case. Specifically, the standard devia- 
tions of the difference scores range from 1.46 to 1.65, and the cor- 
relation between the difference score and preencounter affect 
(—.34), though significant, is smaller than 1. 

3We thank the anonymous reviewers for their suggestions of 
how to use the videotaped encounters to provide additional 
manipulation checks for this study. 


lation of the extent of employee smiling had a significant 
effect on the frequency of smiles per minute (Min extent = 
3.5, Miow extent = 4; F(1, 220) = 386.62, p < .01) and the 
percentage of time spent smiling (Mhigh extent = 35.2%, 
Miow extent = 1-926; F(1, 220) = 417.09, p < .01), as 
intended. We found no significant effect for the authenticity 
of the emotional labor display on the frequency of smiles 
(Mhigh authenticity = 2.0, Miow authenticity = 1.8; FC, 220) = 
1.08, not significant [n.s.]), though it influenced the per- 
centage of time spent smiling (Mbich authentcty = 23.996, 
Miow authenticity = 12.6%; F(1, 220) = 18.64, p < .01), which 


suggests a potential confounding of the manipulations. As 


Perdue and Summers (1986, p. 323) suggest, providing evi- 
dence that the effect sizes for unintended manipulations are 
substantially smaller than those for the intended manipula- 
tions indicates that the experimental manipulations have 
worked effectively and that statistical significance “should 
not be of great concern.” In our case, the extent of the 
employee smiling manipulation had a much greater effect 
on the percentage of time spent smiling (partial n2 = .656) 
than did the authenticity of the emotional labor display 
manipulation (partial n2 = .078), thus providing support for 
the validity of our experimental manipulation. 

The other two judges also watched all 223 videotaped 
interactions and separately provided a global rating of the 
authenticity of the employee’s emotional display for each 
interaction (on a four-point scale ranging from “not at all 
authentic” [1] to “very authentic” [4]). Again, interrater 
agreement was high between the two judges (ICC = .927, 
p < .01). The authenticity of the emotional labor display 
manipulation had a significant effect on authenticity ratings 
(Migh authenticity = 3.8, Miow authenticity = 1.2; FC, 220) = 
2552.78, p « .01), as intended. The extent of employee 
smiling also had a significant main effect on authenticity 
ratings (Mhuigh extent = 2.7, Miow extent = 2.3; F(1, 220) = 4.21, 
p < .05), but the effect size of the authenticity of the emo- 
tional labor display manipulation (partial n? = .932) was 
greater than that of the employee smiling manipulation 
(partial n? = .147), again providing support for the validity 
of our experimental manipulation. 


Heliability and Validity Assessment 


We report the means, standard deviations, and correlation 
coefficients of all variables in Table 1. To assess the relia- 
bility and validity of our measures, we calculated Cron- 
bach's alpha coefficients for each construct. All alpha 
scores are satisfactory, with no values below .80. In addi- 
tion, we conducted a confirmatory factor analysis with all 
multi-item measures in the model. The overall fit statistics 
for the four-factor model indicate that the model provides 
an acceptable fit to the data: ¥2(129, N = 223) = 333.7, p< 
.01; comparative fit index = .950; incremental fit index = 
.950; Tucker-Lewis index = .941; standardized root mean 
square residual — .048; and root mean square error of 
approximation = .086. 

The average variance extracted is greater than .60 for all 


constructs, and the composite reliability measures are all 


greater than .80. We also find support for convergent 
validity because the t-values for all constructs are signifi- 
cant at p « .01 (Anderson and Gerbing 1988). Similarly, we 
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TABLE 1 
Descriptive Statistics, Rellabllity Information, and Correlations 


Number 
of Items 

1. Customer positive affect (preencounter) 4 

2. Customer positive affect (postencounter) 4 

3. Change in customer positive affect 4 

4. Customer-employee rapport 4 

5. Customer satisfaction with transaction 5 

6. Future loyalty intentions 4 


3.70 
4.41 

71 
4.91 
4.69 
5.12 


SD 1 2 3 4 5 6 
1.20 .812 

1.41 .996 .904 

1.24 | —336  .596  .806 

1.66 .233 643 . .503 .967 

1.47 323 771 561 827 932 

1.56 .269 .662 486 .696 .798 .940 


Notes: N = 223 Values along the diagonal represent Cronbach's alpha internal consistency estimates All correlations are significant at p < .01 


(two tailed). 


find support for discriminant validity; the squared correla- 
tions between each pair of constructs are smaller than the 
average variance explained of the respective constructs 
(Fornell and Larcker 1981). Thus, these results indicate the 
acceptable reliability, convergent validity, and discriminant 
validity of our measures. 


Two-Way Hepeated Measures ANOVA 


To test H, and Hy, we conducted a two-way repeated mea- 
sures ANOVA to examine the effects of the extent of 
employee smiling and the authenticity of the emotional 
Jabor display on the change in customer positive affect. The 
within-subjects factor, customer positive affect, contains 
two levels: preencounter affect (Time 1) and postencounter 
affect (Time 2). As Tables 2 and 3 show, the extent of 
employee smiling has no significant main effect on cus- 
tomer positive affect (F(1, 217) = 1.5, n.s.), but the authen- 
ticity of the emotional labor display by employees has a sig- 
nificant effect (F(1, 217) = 35.2, p < .01, partial n? = .139).4 
The nature of this effect is in the expected direction: 
Employees who express authentic emotions by engaging in 
deep acting facilitate customer positive affect to a much 
greater extent (change in customer positive affect — 1.18, 
SD = 1.09) than do those who display inauthentic emotional 
labor by engaging in surface acting (change in customer 
positive affect = .25, SD = 1.22). This finding holds true not 
only for the overall composite score of customer affect but 
also for each of the four emotions (i.e., elated, peppy, enthu- 
siastic, and excited). Thus, we find strong evidence in sup- 
port of H, but not of H,. Therefore, we conclude that 
employees' emotional displays affect customers' emotional 
states, but this process appears to be driven primarily by the 
authenticity of the emotional labor display rather than by 
primitive emotional contagion through employee smiling. 


PLS Equatlon Modeling 


Next, we tested all elements of our conceptual model simul- 
taneously with PLS structural equation modeling. As a 
component-based method, PLS permits the use of nominal 
data, which we need to assess the effects of smiling and 


4Although we did not formally hypothesize an interaction 
effect, our examination indicates that both effects are largely inde- 
pendent with no interaction effect (F(1, 217) = .607, n.s.). We also 
investigate the extent to which the results are influenced by the 
individual employees by controlling for actors in the analysis. This 
did not change the results. 
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authenticity (Fornell and Bookstein 1982). Furthermore, as 
a distribution-free method, PLS has fewer constraints and 
statistical specifications than covariance-based techniques, 
such as LISREL. 

We operationalized both the extent of employee smiling 
and the authenticity of the emotional labor display as 
dichotomous variables (1 = low extent, 2 = high extent; 1 = 
low authenticity/surface acting, 2 = high authenticity/deep 
acting, respectively). To estimate the model paths, we used 
PLS Graph 3.0 and estimated the inner weightings with the 
path method (Chin 2001). To estimate the predictive power 
of the model, we applied a blindfolding approach (Fornell 
and Bookstein 1982). This approach results in Q2 values of 
.38 (change in customer positive affect), .67 (customer sat- 
isfaction), .81 (rapport), and .72 (loyalty intentions), all of 
which are significantly different from 0 and therefore indi- 
cate that the model has predictive power (Geisser 19774; 
Stone 1974). The model explains 14.8% of the variance in 
change in customer positive affect, 47.696 of the variance in 
rapport, 71.7% of the variance in customer satisfaction with 
the transaction, and 65.996 of the variance in future loyalty 
intentions, in further support of its relevance (Chin 1998). 
All indicator reliabilities are greater than .75, composite 
reliability is greater than .85 for all constructs, and the aver- 
age variance explained is greater than .60 in all cases (for 
more detailed information, see Appendix B). To determine 
whether the use of a difference score measure for change in 
positive affect influenced the results, we also analyzed an 
alternative model with postencounter customer positive 
affect rather than change in customer positive affect, but we 
found no differences between the two models.° 

The PLS results support most of our hypotheses about 
the inner model relationships and provide convergent 
validity for the findings of the two-way repeated measures 
ANOVA (see Table 4).6 Specifically, our results show that 
the authenticity of the employee's emotional labor display 


5Significance levels and the direction of the effects for all 
hypothesized paths are identical to those of the difference score 
model. The variance explanations for the postencounter model are 
11.696 for postencounter customer positive affect, 60.496 for rap- 
port, 78.496 for customer satisfaction, and 66.096 for loyalty 
intentions. 

6We generated the t-values through a bootstrapping procedure 
with 223 resamples with 100 cases each (Fornell and Bookstein 
1982). 
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TABLE 3 
Means and Standard Deviations for Customer Positive Affect 











Customer Positive Affect 
Change In 
Time 1: Time 2: Customer 
Preencounter Postencounter Positive Affect 
Extent of employee smiling High extent 3.72 (1.21) 4.54 (1.38) 82 (1.26) 
Low extent 3.69 (1.20) 4.30 (1.44) .61 (1.21) 
Authenticity of the emotional High authenticity 3.71 (1.20) 4.872 (1.28) 1.185 (1.09) 
labor display Low authentlcity 3.70 (1.21) 3.958 (1.40) .25b (1.22) 





Notes: Standard deviations are shown in parentheses. Values with the same superscript are significantly different from each other at p « 001; 


all other values do not significantly differ. 


has a strong and significant impact on the change in cus- 
tomer positive affect—high authenticity through deep act- 
ing results in a significantly greater increase in the change 
in customer positive affect than does low authenticity 
through surface acting—but the extent of employee smiling 
does not significantly influence the change in customer 
positive affect. 

The extent of employees' smiling and the authenticity of 
their emotional labor display both have a significant, direct 
impact on customer-employee rapport, in support of H4 and 
Hg, but they do not affect customer satisfaction, which fails 
to support H4 and Hs. The PLS results also show that 
change in customer positive affect has a significant effect 
on both customer satisfaction with the transaction and 
customer-employee rapport, in support of H} and Hg, 
respectively. Rapport positively influences customer satis- 
faction (in support of Ho), and customer satisfaction posi- 
tively influences future loyalty intentions (in support of 
Hj). Contrary to our expectations, rapport has no direct 
relationship to future loyalty intentions, leading us to reject 
Hio. However, the total effect of rapport on loyalty inten- 
tions is strong and significant as a result of the strong 
impact of rapport on satisfaction (for a listing of all total 
effects, see Table 4). 


Discussion 


What We Have Learned 


The purpose of this study was to develop and experimen- 
tally test a conceptual model of how employee emotions 
influence customers in service encounters. We apply a two- 
way repeated measures ANOVA and PLS structural equa- 
tion modeling to data collected from a sample of 223 con- 
sumers. The results of this research provide several 
important contributions to the literature. 

First, we fill an important gap in the service marketing 
literature by providing empirical evidence that an 
employee's emotional display can trigger changes in a cus- 
tomer's affective state. Our study is the first to provide a 
direct test of emotional contagion and its related processes 
in service interactions, and the controlled experimental 
design of our study enables us to test causal effects between 
different facets of an employee's display of emotions and 
the customer's state. 
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Second, our findings suggest that the authenticity of the 
employee's emotional labor display, rather than the extent 
of smiling, influences the customer's emotions and percep- 
tions. Respondents in our study who encountered authentic 
employees (1.e., those who engaged in deep acting) were far 
more likely to adopt the emotions of that employee than 
were those who interacted with inauthentic employees (i.e., 
those who engaged in surface acting). This finding con- 
tributes to the emotional labor literature in that our study is 
the first to provide evidence that different types of emo- 
tional labor (i.e., deep versus surface acting) differentially 
influence the customer experience during service encoun- 
ters. Furthermore, our findings contribute to the emotional 
contagion literature by contradicting the dominant belief 
that emotional contagion in service settings is based mainly 
on mimicry effects (i.e., primitive contagion processes; see, 
e.g., Pugh 2001). Although mimicry effects occur within 
extremely short time frames (often less than one second) 
and thus might affect the receiver’s emotions for a short 
period, our results suggest that their impact fades during the 
course of a service encounter. That is, primitive emotional 
contagion might occur in the early phases of service 
encounters, but it does not remain throughout the comple- 
tion of the encounter, because postencounter emotions do 
not appear to be influenced by mimicry effects. 

Third, our findings provide support for the affect-as- 
information theory, which links customer emotions to satis- 
faction (Schwartz and Clore 1988); the results show that 
both the customer’s postencounter emotional state and 
change in positive affect lead to increased customer satis- 
faction. An increase in customer positive affect also influ- 
ences customer satisfaction indirectly through customer- 
employee rapport. Because satisfaction is positively related 
to customers' future loyalty intentions, employees' emo- 
tional displays appear to play important roles in a service 
firm's long-term success and have significant impacts on 
key customer outcomes. 

Fourth, we could not replicate Grandey and colleagues' 
(2005) results of a direct relationship between emotional 
labor and customer satisfaction. In our study, the two paths 
from the extent of employee smiling and the authenticity of 
the emotional labor display to customer satisfaction are not 
significant. Therefore, our results suggest that such effects 
are of an indirect nature, with customer-employee rapport 
being the main driver of customer satisfaction. Thus, the 
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link between authenticity of the emotional labor display and 
customer satisfaction is not as direct as previously thought 
but instead is mediated by the extent to which customers 
perceive that rapport with the service employee exists. A 
possible explanation of this finding is that whereas the con- 
struct of rapport focuses mainly on the interaction with the 
employee, satisfaction also captures the customer’s assess- 
ment of other aspects of the service encounter, such as the 
design of the servicescape (1.e., “I like how the video store 
looks") and outcome qualities (i.e., “I like the movie I was 
given by the employee"). 

Fifth, although we find that customer-employee rapport 
is a driver of satisfaction, our results also suggest that rap- 
port does not exert a direct influence on customer loyalty. 
Although previous marketing research has postulated such a 
direct relationship (Gremler and Gwinner 2000), our study 
suggests that the interplay among rapport, satisfaction, and 
future loyalty intentions may be more complex than previ- 
ously believed. 


What We Still Need to Learn 


Although our findings expand the extant knowledge on 
emotions in service encounters, we recognize several limi- 
tations that must be taken into account when generalizing 
our results. The design of our experiment provides insight 
into the transfer of employee emotions in one specific ser- 
vice domain—an innovative video rental service—but it 
remains unclear whether findings would be similar for other 
services. Another limitation is that we focus on positive 
emotions because they are most relevant (and desired) in 
the service delivery context. However, similar ripple effects 
may occur for negative emotions. For example, anger and 
frustration displayed by employees may transfer to cus- 
tomers and negatively affect their service experience. Fur- 
ther research should address this issue by testing whether 
effects similar to those we find apply equally to the flow of 
negative emotions in the service delivery context. 

Because participants in our study engaged in a service 
that was unfamiliar to them, their emotional processes were 
completely independent of any prior service interactions. 
However, in many real-life services, relationships between 
employees and customers have been established over a 
series of encounters, which might influence the emotional 
flow in future service transactions. Another avenue for fur- 
ther research would be to test the cross-cultural stability of 
our findings. Given that cultural norms about service deliv- 
ery and smiling behavior in general vary across cultures, 
additional research should examine the extent to which cul- 
tural variables influence the interplay of employee emo- 
tional displays and customer variables in service deliveries. 


What Marketing Managers Should Do 


This study illustrates that the emotions displayed by front- 
line employees are an important driver of the relationships 
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between service employees and customers. The transfer of 
positive emotions from employees to customers can enable 
firms to establish high levels of customer-employee rap- 
port, customer satisfaction, and future loyalty intentions, all 
key objectives of relationship marketing. 

Our finding that customer affect is driven mainly by the 
authenticity of the emotional labor display, rather than the 
extent of employee smiling, has important implications for 
service marketers. First, it contradicts the common mantra 
in service organizations that service with a smile and 
friendly service staff inevitably pay off and serve as a 
competitive advantage. The commonly advocated "Smile!" 
policy that is currently in place in many firms may be a less 
useful strategy than is generally believed. Rather, our 
results indicate that the authenticity of the emotional dis- 
play by frontline staff and the sincerity with which staff 
interacts with customers may be much stronger drivers of 
service outcomes than policies that require people to smile 
at any cost but do not provide them with the tools to create 
and display seemingly real emotions when they serve cus- 
tomers. Simply hiring low-paid service workers with lim- 
ited qualifications and motivation and requiring them to 
smile at customers as part of their job description may not 
deliver the desired results, whereas the display of authentic 
feelings by service employees—facilitated by the use of 
deep acting—is likely to be more effective for positively 
influencing customer satisfaction and related service 
outcomes. 

Second, in terms of managing frontline employees, the 
provision of authentic feelings is a far more challenging 
task than simply smiling at customers. Such strategy may 
require increased managerial emphasis on identifying and 
hiring talented and qualified frontline employees. Managers 
also may need to provide additional training for frontline 
staff that teaches them how to engage in deep acting. One 
such approach uses a perspective-taking technique, which 
puts employees in the customers’ shoes and thereby 
increases their ability to adopt a customer’s viewpoint 
(Parker and Axtell 2001). For example, flight attendant 
trainees can be taught strategies to deal with irate cus- 
tomers, such as imagining that something traumatic has 
happened in these customers’ lives or by pretending that 
they are children (Hochschild 1983). 

Third, our results confirm that the emotions customers 
experience during service encounters play crucial roles and 
directly affect the success of service relationships. Because 
customer emotions appear to be key drivers of rapport with 
employees and, ultimately, customer satisfaction and loy- 
alty intentions, service organizations may benefit from 
focusing their attention on increasing positive customer 
emotions. This recommendation is consistent with emerg- 
ing literature on customer delight (Rust and Oliver 2000), 
which stresses the emotional component of customers’ ser- 
vice evaluations. 


APPENDIX A i 
Summary of Service Script 


Step 1. As soon as the customer enters the video consult- 
“ing store and steps toward the service counter, the 
employee makes eye contact with the customer 

and welcomes him or her to the store. 

Step 2. The employee briefly introduces the nature of the 
movie consulting business and Its service offerings 
following a standardized script. The employee 
polnts out that thls service is currently being tested 
and thus is free of charge to the customer. 


Step 3. The employee asks a series of standardized ques- 
tions about the customers movie-viewing habits 
and preferences. These questions follow a stan- 
dardized order and cover the following topics: (1) 
preferences for specific movie genres, (2) prefer- 
ences for happy endings, (3) importance of film 
directors In influencing choice of movies, (4) prefer- 
ences for specific film directors, (4) preferences for 
speciflc lead actors and actresses, and (5) prefer- 
ences for a movie's country of origin. During this 
process, the employee takes brief notes while the 
customer answers all questions. 


Step 4. Following the set of standardized questions, the 
employee briefly reviews hls or her notes and sum- 
marizes the customer's responses. 


Step 5. The employee proceeds to recommend a specific 
movie to the customer on the basis of the cus- 
tomers responses. The employee briefly introduces 
the movie using a standardized script. If the cus- 
tomer has already seen this movie or'is not inter- 
ested for other reasons, the employee recom- 
mends up to two additional movies. 


Step 6. When the movie cholce is finalized, the employee 
asks the customer for his or her choice of DVD or 
VHS format, and the employee hands the movie to 
the customer. The customer is thanked for his or 
her business and leaves the video consulting store. 


APPENDIX B 
List of Hems and Goodness-of-Fit Measures of 
PLS Analysis 


Change In Customer Positive Affect (Composite 
Reliability [CR] = .871, Average Varlance Extracted 


[AVE] = .627) 
At this moment, | feel ... 
elated. 794 
peppy. .760 
enthusiastic. .800 
excited. .813 
Customer Satisfaction with Transaction (CH - .948, 
AVE = .784) 
| am delighted by thls service experlence. .922 
Thls service experience really helped me to find a 
good movie. ./64 
This service experience was a great one. .863 
| am satisfied with this specific service experience. .939 
| really liked this service experience. .928 


Customer—Employee Rapport (CR = .977, AVE = .913) 
In thinking about my relationship with this person, 


| enjoyed interacting with this employee. .949 
This employee created a feeling of “warmth” in our 

relationship. .958 
This employee related well to me. 947 
| was comfortable interacting with this employee. .969 


Future Loyalty Intentions (CR = .956, AVE = .846) 
The next time ! need the services of a video store, 


| will choose this company. .936 
| will Say positive things about this service provider 

to other people. .922 
| plan to visit this service provider in the next years. .922 
| will recommend this service provider to someone 

who seeks my advice. .898 


Notes: Numbers In the right-hand column are coefflclents of 
determination 
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Yong Liu 


Word of Mouth for Movies: Its 
Dynamics and Impact on Box Office 
Revenue 


This article uses actual word-of-mouth (WOM) information to examine the dynamic patterns of WOM and how it 
helps explain box office revenue. The WOM data were collected from the Yahoo Movies Web site. The results show 
that WOM activities are the most active during a movie’s prerelease and opening week and that movie audiences 
tend to hold relatively high expectations before release but become more critical in the opening week. More 
important, WOM information offers significant explanatory power for both aggregate and weekly box office revenue, 
especially in the early weeks after a movie opens. However, most of this explanatory power comes from the volume 
of WOM and not from its valence, as measured by the percentages of positive and negative messages. 


mal communication among consumers about attributed to the WOM that these movies generated. 

products and services. Two important features dis- This study makes use of a data set containing actual 
tinguish WOM from other information sources, such as WOM information to study the dynamic patterns of movie 
advertising: WOM is usually perceived as more credible WOM, the explanatory power of WOM for box office reve- 
and trustworthy, and it is more readily accessible through nue, and the potential antecedents of WOM. It adds to the 
social networks (Banerjee 1992; Brown and Reingen 1987; growing literature on movie marketing and provides 
Murray 1991). A McKinsey & Company study found that insights into new product development, entertainment mar- 
67% of the sales of consumer goods are based on WOM keting (Eliashberg and Shugan 1997), services marketing, 


WE of mouth (WOM), or “buzz,” involves infor- Star Wars: Episode 1-The Phantom Menace, has been 


(Taylor 2003). and WOM research in general. 

In the arena of new product development, WOM may The WOM data were collected from the Yahoo Movies 
play particularly important roles because awareness must be message board (see http://movies.yahoo.com/). It contains 
built, and consumers need information when deciding more than 12,000 posted messages, which were coded into 
whether to purchase a product they do not know well weekly measures of volume and valence before being 
(Mahajan, Muller, and Kerin 1984). In the movie industry, empirically examined. This study attempts to provide 
which has become a popular setting to examine various new answers to the following questions: First, in markets in 
product issues (e.g., Elberse and Eliashberg 2003; Krider which the product has a low price and prerelease informa- 
and Weinberg 1998; Lehmann and Weinberg 2000; Sawh- tion is abundant (e.g., massive prerelease advertising, exten- 
ney and Eliashberg 1996), hundreds of new films are sive professional reviews), how active is WOM before 
released annually by Hollywood and independent filmmak- release, and is there any consistent shift in consumer 


ers. In general, it is believed that WOM strongly influences evaluation of a new product before introduction (i.e., expec- 
people's movie selection (Austin 1989; Bayus 1985; Faber tations) versus after introduction (i.e., experience based)? 
and O'Guinn 1984; Neelamegham and Chintagunta 1999). Second, to what extent do important measures of WOM 
Indeed, the box office success of several movies, such as help explain the sales of a new product, both periodically 
The Blair Witch Project, My Big Fat Greek Wedding, and and in aggregation? Third, what factors consistently func- 
tion as antecedents of WOM across time periods? If WOM 
— Ó€—P—— P———————————— is important for the success of a new product, firms should 
Yong Liu is Assistant Professor of Marketing, Martin J. Whitman School of try to understand and manage these antecedents. 
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studies have used these data, with the notable exception of 


To read or contribute to reader and author dialogue on this article, visit Godes and Mayzlin (2004b), who study WOM of television 


http/www.marketingpower.comvjmblog. shows from the Usenet newsgroups. Through regressions 
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across weeks and shows, they empirically test the relation- 
ship between WOM (measured in volume and dispersion 
across different newsgroups) and weekly ratings. They find 
that the dispersion measure, but not volume, has significant 
explanatory power for ratings. 

The current study differs from that of Godes and May- 
zlin (2004b) in several ways. First, in addition to volume, 
the role of valence, which is coded on the basis of the con- 
tent of each WOM message, is investigated in detail. Sec- 
ond, many of the results provide insights into the prerelease 
period of a new product, which is critical in markets, such 
as movies, in which early sales are of great concern. Third, 
in addition to the relationship between WOM and sales, the 
dynamic patterns of WOM  (i.e., how it evolves and its 
potential antecedents) are investigated. Finally, the research 
context is different. Unlike network television shows, which 
normally run on a seasonal basis, movies have very short 
life cycle in theaters. They also have unique industrial fea- 
tures that offer interesting research questions and require 
special considerations in empirical analysis; for example, a 
lot of weight is placed on the opening week, and a mini- 
mum number of weeks are usually specified in exhibition 
contracts (De Vany and Walls 1999). 


Theoretical Background 


Although successful movies can generate great box office 
revenue and profits, the movie industry is also known for 
high risks. Of any ten major films produced, six to seven are 
unprofitable (Vogel 2001, p. 35; see also Shugan 1995). 
Two characteristics of the movie industry contribute to the 
popular belief that WOM influences moviegoers. First, as a 
prime category of popular culture goods, movies tend to 
receive great public interest and attention. Therefore, active 
interpersonal communication about movies can be expected 
to exist and, as indicated by the theory of information 
accessibility and influences (Chaffee 1982), may influence 
the audience. Second, the intangible and experiential nature 
of movie consumption makes it difficult to judge movie 
quality before it is actually viewed. When the alternatives 
are difficult to evaluate before purchase, consumers often 
engage in WOM to gather more information (Bristor 1990; 
Harrison-Walker 2001; Rogers 1983). 

Volume and valence are among the most important 
WOM attributes that have been examined (e.g., Mahajan, 
Muller, and Kerin 1984; Mizerski 1982; Neelamegham and 
Chintagunta 1999). Volume measures the total amount of 
WOM interactions. Valence captures the nature of WOM 
messages (i.e., whether they are positive or negative). 
Mainly because of data availability, most extant studies 
examine either volume or valence, but not both. For exam- 
ple, Anderson (1998) and Bowman and Narayandas (2001) 
focus exclusively on the volume of WOM.  . 

Two previous studies use controlled laboratory settings 
to investigate how WOM is related to certain behavioral 
features of moviegoers. In particular, Faber and O'Guinn 
(1984) examine how much credibility and importance col- 
lege students attach to WOM versus other information 
sources. Burzynski and Bayer (1977) study how WOM 
moderates people's liking of a film. Several studies that 


focus on other issues of the movie market have included 
various proxy measures of WOM as covariates (e.g., 
Elberse and Eliashberg 2003; Neelamegham and Chinta- 
gunta 1999; Zufryden 1996). Two other independently con- 
ducted studies also use data from Yahoo Movies. Dellaro- 
cas, Awad, and Zhang (2004) use a Bass diffusion model to 
examine how user ratings posted in the opening week help 
explain the two Bass parameters (p = the external influence 
factor, and q = the internal influence factor), which are esti- 
mated from the box office history of a movie sample. They 
find that the volume of the first week's user ratings and their 
density (defined as a ratio between the volume of ratings 
and the first week's box office revenue), but not the numen- 
cal value of these ratings, are useful in explaining p. How- 
ever, the value of user ratings becomes a significant 
explanatory variable for q. Duan, Gu, and Whinston (2005) 
use similar user-ratings data but focus on the correlation 
between the daily measures of these ratings and the daily 
box office revenue in the first two weeks. They find that 
user ratings have no explanatory power for box office reve- 
nue, but the volume of ratings does. 

There are several important differences between the cur- 
rent study and those of Dellarocas, Awad, and Zhang (2004) 
and Duan, Gu, and Whinston (2005). First, this study disen- 
tangles the weekly effects of WOM in a way similar to 
Eliashberg and Shugan (1997) and Basuroy, Chatterjee, and 
Ravid (2003). This is useful given that different weeks are 
considered of different importance in the movie industry, 
and many managerial decisions are made on a weekly basis. 
Second, the data contain WOM information in the pre- 
release period, which can be used to explain and predict the 
movie's opening-week box office revenue. An important 
managerial value of prerelease WOM is that it can help 
improve the early forecast of opening-week box office and 
aggregate box office revenue. Third, this study investigates 
the potential antecedents of WOM, which provide useful 
implications for both WOM research and managerial prac- 
tice. Finally, because the data are coded into weekly mea- 
sures, it is possible to examine how WOM evolves over 
time.! 

Extant studies have found that the volume of WOM cor- 
relates significantly with consumer behavior and market 
outcome (Anderson and Salisbury 2003; Bowman and 
Narayandas 2001; Van den Bulte and Lilien 2001). The rea- 
son that is often advanced to explain why the pure volume 
of WOM matters is consumer awareness. For example, 
Godes and Mayzlin (2004b) suggest that the more conver- 
sation there is about a product, the more likely someone is 
to be informed about it, thus leading to greater sales. On the 
basis of a similar rationale, research that uses diffusion 
models often examines WOM by either the number of 


lOn a more technical level, the data are collected from the 
Yahoo Movies message board, which is characterized as a “com- 
munity” site that allows bilateral discussions in a threaded manner 
(see http://movies.yahoo.com/). Dellarocas, Awad, and Zhang’s 
(2004) and Duan, Gu, and Whinston’s (2005) data come from the 
“User Review” section of Yahoo Movies, where users can post a 
numerical rating, a written review, or both. 
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adopters (Neelamegham and Chintagunta 1999) or the 
interaction between the number of adopters and that of non- 
adopters (Zufryden 1996). In this article, the effect of vol- 
ume is called the “informative” effect on awareness. 

Positive WOM typically gives either a direct or an indi- 
rect recommendation for product purchase. Negative WOM 
may involve product denigration, rumor, and private com- 
plaining. The reason valence matters is relatively straight- 
forward; positive WOM enhances expected quality (and, 
thus, consumers’ attitudes toward a product), whereas nega- 
tive WOM reduces it. In this article, the effect of valence is 
called the “persuasive” effect on attitude. 

Although volume and valence are among the most 
important measures of WOM in the literature, other mea- 
sures, such as duration, intensity, and dispersion, exist 
(Eliashberg et al. 2000; Godes and Mayzlin 2004b). 
Because these measures are difficult to construct from the 
current data, this study focuses on volume and valence. 

The effects and potential antecedents of movie WOM 
are investigated by examining three important aspects of a 
movie’s run: the opening week, the weeks after opening, 
and aggregate box office revenue. Previous studies that have 
adopted similar approaches include those of Elberse and 
Eliashberg (2003), who separate the opening week from 
remaining weeks, and Eliashberg and Shugan (1997), who 
examine the effects of critical reviews in each of the first 
eight weeks after a movie’s release and on aggregate box 
office revenue. 


Comparison of WOM Before and After Release 


Opposite theories exist about the degree of WOM activities 
before a movie’s release. Several factors suggest that pre- 
release WOM is minimal. First, the price of watching a 
movie is relatively low for an average consumer; the aver- 
age movie admission was $6 in 2003 (Motion Picture Asso- 
ciation 2003). The low cost, coupled with moviegoing often 
being an impulse decision, implies that potential movie- 
goers may not go through an intensive decision-making 
process in which a large amount of information, such as 
WOM, is needed. Second, most movies are released after 
intensive prerelease advertising. Advertising, which is 
mainly done in newspapers and on television, typically 
amounts to 50% of the production budget (Vogel 2001, p. 
96). It provides information about the opening date, main 
cast, some of the storyline, critical opinions (in a selected 
fashion), and, in many cases, whether the movie is to be 
released nationwide “in a theater near you” or only “in 
selected cities.” Critical reviews also provide useful infor- 
mation to moviegoers and can be easily obtained from 
many media channels (Faber and O’Guinn 1984; Litwak 
1986). Because movie critics are mostly independent from 
the studios, their reviews could offer supplementary value 
to advertising. If massive advertising and active critical 
reviews can meet potential moviegoers’ need for informa- 
tion and thus be an effective substitute for WOM, the 
amount of WOM in the prerelease period should be mini- 
mal (Chaffee 1982). Finally, it is conceivable that the actual 
experience with a product may function as an important 
stimulus for WOM communication (Anderson 1998). Thus, 
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the volume of WOM could be expected to be significantly 
large only after many moviegoers have seen the movie in 
the opening week.? 

Conversely, as discussed previously, movies are popular 
cultural goods with an intangible/experiential nature, which 
increases the likelihood that prerelease WOM is active. 
Because WOM comes from other moviegoers, it may also 
be perceived as more trustworthy than advertising (Faber 
and O’Guinn 1984), and it may better reflect popular taste 
than critical reviews (Holbrook 1999). As a result, although 
abundant information about a to-be-released movie is pro- 
vided by advertising and critical reviews, WOM may still be 
perceived as useful and thus serve as a complement to 
rather than a substitute for other information sources. This 
study relies on an empirical approach to examine how 
active WOM is during the prerelease phase of a movie and 
how it compares with WOM in the opening week. 

By its very nature, advertising will be positive about a 
new movie. Notably, Eliashberg and Shugan (1997) find 
that critical reviews are also mostly positive. To the extent 
that these sources of information influence moviegoers, the 
valence of prerelease WOM for an average movie should be 
mainly positive. After a movie is released, actual experience 
will come into play to influence WOM valence further. 
Abundant research indicates that consumer satisfaction is 
influenced by the confirmation or disconfirmation of expec- 
tations (Anderson 1973; Oliver 1980; Olson and Dover 
1979). Because the viewing public holds a comparatively 
high expectation for a to-be-released movie, the likelihood 
of disconfirmation is greater. In turn, this may result in a 
decrease in the positivity of WOM. Lieberman and Esgate 
(2002) make a similar argument for movie trailers and inter- 
active games; that is, because trailers and games create high 
expectations before release, viewers are more likely to be 
disappointed after seeing the movie and talk unfavorably 
about it. Thus: 


H;: The valence of prerelease WOM is mainly positive, and 
the valence of WOM in the opening week is less positive 
than that of prerelease WOM. 


Weekly Box Office Revenues 


Figure 1 presents a conceptual framework that integrates 
WOM into an otherwise familiar picture of the movie mar- 
ket (Elberse and Eliashberg 2003; Lehmann and Weinberg 
2000). The extant literature consistently finds that the num- 
ber of screens on which a movie is shown strongly influ- 
ences box office revenue in a given week (e.g., Swami, 
Eliashberg, and Weinberg 1999). Evidence also indicates 
that critical reviews correlate with weekly box office reve- 
nue, though previous studies have reported mixed findings 
about the significant level of this correlation and when it 
exists in a movie's run (Basuroy, Chatterjee, and Ravid 


2A caveat is that some audiences might have the chance to 
watch a sneak preview of a to-be-released movie. However, sneak 
previews are conducted only for selected movies, in few cities, and 
for a small number of people. The vast majority of moviegoers 
would not have experienced the movie before its release. 


FIGURE 1 
A Conceptual Framework of Movie Box Office Sales 
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Notes: MPAA = Motion Picture Association of America. 


2003; Eliashberg and Shugan 1997; Reinstein and Snyder 
2005). Whereas the number of screens captures distribution 
intensity (and, thus, the audiences’ accessibility to movies), 
critical reviews and WOM constitute most of the third-party 
information that moviegoers may use. Unlike critical 
reviews, which are static and occur mostly before the movie 
ig released, WOM is a dynamic phenomenon that begins 
before release and spans the entire run. Thus, it is important 
to measure WOM periodically. 

If WOM is able to influence consumer decisions, incor- 
porating WOM into extant models that have the number of 
Screens and critical reviews as explanatory variables 
(Basuroy, Chatterjee, and Ravid 2003; Eliashberg and 
Shugan 1997) will enhance the models’ explanatory power 
for weekly box office revenue. This is especially true if 
WOM functions as a complement to critical reviews. An 
additional factor contributing to the explanatory power of 
WOM beyond that of the number of screens is the reality of 
overcapacity in movie theaters. The movie exhibition indus- 
try has been known for serious oversupply of theaters and 
seats. A series of articles that appeared in Forbes (Ackman 
2001a, b; Schiffman 2000) estimate that the average capac- 
ity utilization of movie theaters is less than 20%. Although 
the number of screens picks up a large amount of variance 
in box office sales, the inclusion of additional factors that 
influence moviegoers (e.g., WOM) can explain more vari- 
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ance through capturing the increase in capacity utilization 
inside theaters. 

The two measures of WOM, volume and valence, influ- 
ence moviegoers through different cognition—behavior 
routes (Figure 2). As discussed previously, the volume of 
WOM mainly has an informative role to enhance consumer 
awareness.) The greater the volume of WOM, the more 
likely a consumer will be to hear about it. Not surprisingly, 
greater awareness tends to generate greater sales. 


3A few studies have found that the number of arguments in a 
persuasive message may influence persuasion, but the nature of 
this influence depends on situational factors, such as the level of 
involvement and the strength of the arguments. For example, Petty 
and Cacioppo (1984) suggest that the number of arguments serves 
as a simple persuasive cue for low-involvement situations, but a 
larger number of arguments may enhance issue-relevant thinking 
when involvement is high. For WOM in general and for movie 
WOM in particular, the overall degree of consumer involvement 
and the factors that drive it remain to be studied. Moreover, it 1s an 
interesting issue whether the findings regarding multiple argu- 
ments in one persuasive message are applicable to a context simi- 
lar to movie WOM, in which WOM messages come from different 
senders and usually advance different opinions. However, note that 
the informative effect proposed in the literature (e.g., Godes and 
Mayzlin 2004b; Neelamegham and Chintagunta 1999; Zufryden 
1996) always occurs as moviegoers are exposed to WOM. 
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FIGURE 2 
The Impact of WOM Measures on Potential Moviegoers 
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Conversely, the role of WOM valence, because of its 
positive/negative nature, is more of a persuasive one that 
influences consumer attitude. Although it is relatively 
straightforward to postulate that positive WOM enhances 
attitude and negative WOM reduces it, whether this effect 
can be unambiguously transformed into actual box office 
sales is unclear. Indeed, behavioral research has found that 
attitude does not always predict behavior well (Ajzen and 
Fishbein 1980) and that situational factors may influence 
behavior beyond what attitude can explain. 

In the movie market, the link between attitude and 
behavior is further weakened when moviegoing occurs 
impulsively. Moreover, unlike other consumer goods that 
remain in the market for a long time so that consumers have 
sufficient time to act on the desire to purchase them, movies 
are in theaters only for a short time. Along this line, Sawh- 
ney and Eliashberg (1996) distinguish between the time to 
decide whether to watch a movie and the time to act on this 
decision. 

Therefore, a positive relationship between WOM vol- 
ume and box office revenue is postulated. However, because 
of the weak link between attitude and behavior, the relation- 
ship between WOM valence and box office revenue may be 
insignificant. 

H,: WOM has significant explanatory power for box office 

revenue in the subsequent period. Whereas the effect of 


WOM volume should be significantly positrve, that of 
WOM valence may be insignificant. 


Aggregate Box Office Revenue 


The overall performance of movies, as measured by aggre- 
gate box office revenue, is a critical issue in the industry. 
Beyond the weekly effects discussed in the previous sec- 
tion, it is useful to examine whether prerelease WOM (i.e., 
the buzz that accumulates before the opening week) can 
help explain aggregate box office revenue. The importance 
of accumulated buzz before a movie’s release has.attracted 
much interest in both academic research (Elberse and 
Eliashberg 2003) and the popular press (King 2003). If 
WOM is informative about weekly box office revenue and 
if WOM activities can correlate between adjacent weeks, 
prerelease WOM will help explain/predict aggregate box 
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office revenue. In line with previous discussions, the effect 
of the volume of prerelease WOM is expected to be signifi- 
cantly positive, but that of the valence may be insignificant. 

As do studies that examine aggregate sales of movies 
and other entertainment products (Litman 1983; Reddy, 
Swaminathan, and Motley 1998), this study uses factors 
that are constant across the movies’ run as explanatory 
variables. In addition to prerelease WOM, this study links a 
movie’s aggregate box office revenue to factors including 
star power, genre, production budget, critical reviews, and 
the opening strength of the movie.* Eliashberg and Shugan 
(1997) examine the effects of critical reviews on aggregate 
box office revenue and find that though the percentage of 
positive reviews is a significant predictor, the volume of 
reviews is not. The role of star power, genre, and production 
budget has been examined in a few studies, but the results 
are mixed (Basuroy, Chatterjee, and Ravid 2003; Elberse 
and Eliashberg 2003; Litman 1983). 

Opening strength refers to the number of screens in the 
first few weeks of a movie’s run. A wide opening strategy 
may generate higher total sales because greater accessibility 
better appropriates the public interest in a movie that has 
accumulated before release. However, opening wide may 
also merely shift future demand into early weeks if the 
overall demand is not increased. If audiences are not stimu- 
lated enough by a wide opening to go to theaters early and 
if the number of screens drops quickly, total sales will be 
negatively influenced. The business press suggests that 
movies are increasingly opening wide (King 2001). 

H3: Prerelease WOM offers significant explanatory power for 


aggregate box office revenue. However, most of the 
explanatory power comes from its volume, not its valence. 


4Other variables, such as Motion Picture Association of Amer- 
ica (MPAA) ratings, directors, distributors, and whether the movie 
is a sequel, are potential explanatory variables for aggregate box 
office revenue. However, most studies failed to find these variables 
to be significant (e.g., Elberse and Eliashberg 2003; Litman 1983). 
These were also included in the estimation and were found to be 
insignificant as well. Thus, they are not included in the analyses 
reported here. 


Potential Antecedents of WOM 


The double-lined arrows in Figure 1 illustrate the factors 
that are potential antecedents of WOM. These include five 
groups of factors: movie genres, Motion Picture Association 
of America (MPAA) ratings, whether major stars are in the 
cast, critical reviews, and the WOM measures in the previ- 
ous week. Movie genres (e.g., action, drama, horror) clas- 
sify movies on the basis of their storyline, and MPAA rat- 
ings (G, PG, PG-13, and R) classify movies on the basis of 
content, primarily according to the suitability of the movie 
to different audience segments. As a communication 
process, WOM activities can be influenced by the strength, 
inherent interestingness, and possibly the controversy of the 
topic. Movies with certain genres (e.g., action, adventure) 
and MPAA ratings (e.g., R, which requires a parental 
guardian for moviegoers under the age of 17) may be asso- 
ciated with distinctive patterns of WOM behavior, espe- 
cially during the prerelease period when the cuziosity about 
movies is high. 

The phenomenon of star power is substantial in the 
entertainment sector. Movie stars are the frequent topic of 
interpersonal communication among moviegoers. Thus, the 
existence of stars is likely to correlate with WOM activities. 
Because the literature is inconclusive about the role of 
movie stars on box office revenue and because Hollywood 
seems to promote a culture of stars despite the high pressure 
faced by studios to control production budgets (Vogel 
2001), either a confirmation or a disconfirmation of the 
impact of star power on WOM would be valuable. 

Critical reviews are usually published in major newspa- 
pers, in magazines, and, increasingly, on movie Web sites. 
They may influence moviegoers' expectations of a film and 
thus correlate with WOM. However, such influence may 
exist only if the majority of movie audiences read these 
reviews. Because there is no published evidence about the 
size or the demographic features of those who read critical 
reviews, it is difficult to speculate on the significance of the 
correlation between reviews and WOM. Although most 
critical reviews begin circulating before a movie’s opening 
week, the data do not contain information about the exact 
time the reviews became available. To reduce any ambiguity 
in the analysis for the prerelease period, critical reviews are 
included as a potential antecedent of WOM only in a 
movie’s opening week and the weeks after it. 

Extant studies advance the theory that WOM can be 
viewed as a consequence of social networks; thus, it is con- 
tagious and persistent (Banerjee 1992; Brown and Reingen 
1987; Murray 1991). This indicates that on the aggregate 
level, WOM in adjacent weeks will be positively correlated. 
From a behavioral point of view, active WOM during a 


SMovie advertisements also display critical reviews from time 
to time. However, such displays are highly selectiye; only positive 
evaluations, such as “two thumbs up,” are displayed. To the extent 
that consumers understand this selective process, the impact of 
critical reviews through movie advertisements should decrease 
(Taylor 2003). 


period helps promote awareness and interest in a movie; 
therefore, more people will see the movie in the next period 
and subsequently talk about it. This can happen for two rea- 
sons: First, greater WOM makes the memory of the movie 
last longer, thus increasing the likelihood that a moviegoer 
will talk about the movie after seeing it. Second, the per- 
ceived social benefit of talking about a popular subject (i.e., 
movie) tends to be greater than talking about a less popular 
one (Brown and Reingen 1987). 

The data can be used to examine the aggregate effect of 
this temporal correlation. That is, a week with greater 
WOM activities is more likely to be followed by another 
week of active WOM than is a week with less WOM. This 
phenomenon is called a “carryover” effect of WOM, which, 
in the context of this study, refers to the possibility that 
WOM activities in a week correlate with WOM in the sub- 
sequent week.® 


Hy: Movie genres, MPAA ratings, star power, and critical 
reviews are among the potential antecedents of movie 
WOM. A dynamic carryover effect exists between WOM 
activities 1n adjacent weeks. 


Empirical Analysis and Discussion 
of Results 


Data and Varlables 


The WOM data are collected from the Yahoo Movies mes- 
sage board (http://movies.yahoo.com/). There are several 
reasons Yahoo Movies serves as a good source of movie 
WOM. First, it is one of the most popular movie Web sites. 
Variety (Graser 2002) reports that Yahoo Movies has the 
largest number of unique visitors among its Web competi- 
tors. Second, Yahoo Movies requires no access fee for either 
browsing or posting a message. This helps reduce any pos- 
sible bias in the demographic composition of the Web site's 
visitors. Third, the structure of the Web site is well designed 
so that finding and collecting information is straightfor- 
ward, thus reducing possible errors during data collection. 
Finally, WOM messages are archived and indexed numeri- 
cally by the dates when they were posted. Thus, it is possi- 
ble to track the period in the movie's run to which a particu- 
lar message belongs.’ The prerelease period is from the date 


6Note that this carryover effect does not require the volume of 
WOM to keep increasing over time. There are other factors that 
may function as antecedents of WOM, such as weekly advertising 
spending and whether the movie is a sequel. The data do not con- 
tain sufficient information or variance to investigate these factors. 
This is an issue for further research. 

7A few other Web sites (e.g., IMDB.com) also provide user 
comments about movies. However, these Web sites do not attract 
as many visitors as Yahoo Movies (Graser 2002). Furthermore, in 
general, they do not provide functions similar to Yahoo Movies, 
which allows threaded message posting and replying. Collecting 
and organizing the postings from these other sites are more diffi- 
cult. For example, the user comments on IMDB.com are nót num- 
bered, thus increasing the difficulty of an already tedious data- 
processing task. To the extent that no significant difference exists 
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when the first WOM message about a movie is posted to the 
date before the official movie release. It captures the accu- 
mulation of WOM before the movie opens. 

Coding WOM messages into valence is an extremely 
tedious task that poses serious constraint on the amount of 
data that can be studied. To cope with this difficulty, this 
study focuses on movies released during a five-month 
period (May-September) in 2002. After excluding titles for 
which there is no reliable information about all control 
variables, the final data set contains 40 movies, with 12,136 
WOM messages to be coded. 

Three judges independently read each of the 12,136 
messages and assigned them to one of five categories: posi- 
tive, negative, mixed, neutral, and irrelevant. The messages 
classified as positive or negative either show clear overall 
assessment of the movie or provide direct recommenda- 
tions. A message is classified as mixed if it expresses posi- 
tive opinions about some aspects of the movie but negative 
opinions about others and if there is no clear overall assess- 
ment. A message is classified as neutral if it talks about the 
movie but does not provide any positive or negative com- 
ments. Finally, a message is classified as irrelevant if it is 
not related to the movie itself. The three independent cod- 
ings are integrated using the majority rule: If at least two 
judges assign the same category, that category is used for 
the message. If all three judges disagree, the message is 
coded as disagree. The level of disagreement among the 
three judges was very low (approximately 4% for all the 
weeks examined). For example, for WOM posted during the 
opening week, the judges did not achieve a majority agree- 
ment for only 4.1% of all messages. The data require the 


in the demographic profiles of visitors to these movie Web sites, 
using data from Yahoo Movies can be fairly representative. Never- 
theless, it is acknowledged that compiling data from different Web 
sites and examining their differences and the collective influence 
on product sales could be potentially useful. 


use of aggregate measures for WOM valence. Consistent 
with previous studies that involve valence coding (Basuroy, 
Chatterjee, and Ravid 2003; Eliashberg and Shugan 1997), 
the valence of WOM is measured in two dimensions: the 
percentage of positive messages and the percentage of nega- 
tive messages.8 

Other data come from various public sources. The 
weekly box office revenue and number of screens are col- 
lected from Variety magazine, as are the critical reviews. 
Eliashberg and Shugan’s (1997) method to calculate the 
total number of reviews, the percentage of positive reviews 
(CRPRO), and the percentage of negative reviews 
(CRCON) is used. Similar to their data, an extremely high 
correlation exists between CRPRO and CRCON (-.878, p < 
.0001). As a result, these data must be included separately 
in the estimation. Production budget comes from another 
popular movie Web site, The Numbers (see www.the- 
numbers.com), and MPAA ratings and movie genres are 
collected from the Internet Movie Database (http:// 
us.imdb.com). Star power is based on the star list published 
annually by Premier; a movie is classified as having star 
power if at least one of its cast members appears in “The 
Power List 2002.” Table 1 provides the basic summary sta- 


8An alternative method is to use the absolute numbers of posi- 
tive and negative messages, either individually or together with the 
volume of messages. However, note that doing so does not clearly 
separate the volume effect from the valence effect and may pro- 
duce spurious results, because both numbers are significantly cor- 
related with the total volume. For example, using the number of 
negative reviews to explain box office revenue always yields a 
positive sign, indicating that the effect of total volume may domi- 
nate the valence effect. Using the numbers of positive and negative 
messages together with the volume produces an unacceptable vari- 
ance inflation factor, a strong signal for multicollinearity. There- 
fore, in this study, the analysis and discussion are restricted to the 
volume measure and the valence measures that are defined as 
percentages. 


Key Summary Statistics of the Movie Sample 





Varlable 


Box office (aggregate) 

Ratio of opening week to aggregate box office 
Exhibition longevity (in weeks) 

Production budget 

Screens in opening week 

Volume of total WOM messages 

Volume of WOM (prerelease) 

Percentage of positive WOM (prerelease) 
Percentage of negative WOM (prerelease) 
Volume of WOM (opening week) 

Percentage of positive WOM (opening week) 
Percentage of negative WOM (opening week) 
Volume of critical reviews 

Percentage of positive reviews 


TABLE 1 
Medlan M SD 
$41,986,198 $67,005,348 $56,681,186 
43.196 41.096 14.396 
15.0 15.5 4.89 
$48,000,000 $51,650,000 $30,955,344 
2633.5 2518.9 890.3 
139.5 303.4 411.8 
45.5 84.6 89.1 
27.3% 28.5% 12.5% 
9.1% 10.5% 8.2% 
49.0 94.8 134.5 
20.3% 23.8% 11.9% 
13.3% 16.1% 13.5% 
16.5 16.3 4.1 
28.1% 36.9% 28.7% 
32.5% 37.6% 30.6% 


Percentage of negative reviews 


Notes: Total number of movies = 40. Total number of WOM messages analyzed = 12,136. 
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tistics of the sample.? Following the work of Eliashberg and 
Shugan (1997) and Basuroy, Chatterjee, and Ravid (2003), 
and in light of the varying exhibition longevities among the 
40 movies, empirical analyses are conducted for the first 
eight weeks in the movies’ run. This covers most of the 
interesting periods for the movies; the first eight weeks 
account for 97% of the box office revenue in the data. 


Dynamic Features of Movie WOM 


For illustrative purposes, Figure 3 plots how volume and 
valence change over time, beginning with the prerelease 
period (Week 0), then the opening week (Week 1), and so 


5Note that for both WOM and critical reviews, the percentage of 
positive comments and the percentage of negative comments do 
not add up to 100%, because other categories (e.g., mixed) exist 
when the valence measures are constructed. It is also interesting 
that WOM is less often positive or negative than critical reviews. 
This happens because critical reviews tend to give clear overall 
assessments, but WOM is more likely to be ambiguous. 


FIGURE 3 
Dynamic Patterns of WOM for the Movie Sample 


A: The Average Volume of WOM 
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on. Figure 4 further provides plots for two specific movies, 
xXx and Scooby-Doo, to illustrate different patterns of the 
valence measures. 

Figure 3, Panel A, reveals that movie WOM is the most 
active during the prerelease period and the opening week, 
and then gradually decreases. For the 40 movies in the sam- 
ple, most movies (i.e., 23) have their greatest WOM volume 
during the prerelease period, and 15 movies have the most 
active WOM during the opening week. Compared with all 
other periods, the opening week has the greatest volume of 
WOM,; on average, 94.8 WOM messages per movie are 
posted. The volume of prerelease WOM is slightly lower 
but is still very high; on average, 84.6 messages are posted 
during this period. The average volume drops significantly 
to 36.6 in the second week and continues to decrease. 

Therefore, although other information sources are abun- 
dant and most moviegoers have not watched the movies, 
WOM is very active in the prerelease period. It suggests 
that WOM is more of a complement to other information 
sources than a substitute. This finding lends direct support 
to the hypothesis that WOM can be a useful measure to 
explain box office sales. It also implies that instead of being 


FIGURE 4 
Valence of WOM for Movies xXx and 
Scooby-Doo 


Percentage 


Percentage 
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based on actual experiences with a movie, prerelease WOM 
can be fairly speculative in nature. 

Figure 3, Panel B, indicates that the valence measures 
have a different pattern from that of volume. Although there 
are variations in positive and negative percentages from 
week to week, such changes in terms of direction and mag- 
nitude appear to be rather random. Furthermore, the average 
percentage of positive WOM is greater than that of negative 
WOM in the early periods, especially during prerelease. To 
quantify this finding further, each movie is examined indi- 
vidually. During the prerelease period, the percentage of 
positive WOM is greater than the percentage of negative 
WOM by a margin of 5% for as many as 31 movies. These 
margins are within 5% of each other for 8 movies, and only 
in one case (for the movie The Banger Sisters) does the per- 
centage of negative WOM exceed that of positive WOM by 
5%. 

Figure 3, Panel B, also indicates that movie audiences’ 
expectations about a typical movie (before release) are usu- 
ally higher than what they think in the opening week. In 
particular, the percentage of positive WOM begins at a 
comparatively high level during prerelease and drops to a 
lower level in the opening week. At the same time, the per- 
centage of negative WOM begins at a comparatively low 
Jevel during prerelease but increases to a higher level in the 
opening week. Among the movies in the sample, the WOM 
for about half (19 movies) drops by more than 5% from pre- 
release to the opening week. The other movies either have 
comparable levels of positive WOM between the two peri- 
ods (11 movies) or have more positive WOM during the 
opening week (10 movies), and 17 movies have their nega- 
tive WOM increase by more than 5% from prerelease to the 
opening week (e.g., xXx), whereas only 6 movies have 
noticeable decreases in negative WOM (e.g., Scooby-Doo). 

Thus, there is support for H,, which postulates that 
WOM tends to be positive in the prerelease period but 


becomes less so in the opening week. In addition to illus- 
trating the confirmation or disconfirmation theory of expec- 
tations in the context of movies, these results imply a poten- 
tial drawback of intensive advertising. That is, advertising 
may generate awareness for new products, but it may also 
attract nontarget consumers who are more likely than others 
to spread negative WOM. In the movie market, these are the 
people who normally would not like a particular movie. 
This problem is of greater concern when retailers (theaters) 
would have kept the product longer if WOM were more 
positive. As a result, seeking ways to make advertising 
more targeted should be of considerable value. Without the 
concern about negative WOM, a firm would want the 
“wrong” buyers to buy because they still generate addi- 
tional revenue. 

The remaining part of this section discusses three sets of 
analyses of how the measures of WOM help explain weekly 
and aggregate box office revenue and what constitutes 
potential antecedents of WOM. To cope with potential non- 
linearity, double log specifications are used for the regres- 
sion equations; natural logarithms are taken for variables 
that could take unbounded positive values. The terms 
LNMSG,, POSIPER,, and NEGPER, are used to denote vol- 
ume (in natural logarithms) and valence (in percentages) of 
WOM in week t (t = 0 indicates the prerelease period). Two 
weekly variables are included to control for the effects of 
competition; one is the number of new releases among the 
top 20 movies, and the other is the average age, measured in 
weeks, of the top 20 movies (for a discussion about why 
these factors may capture the effect of competition, see 
Elberse and Eliashberg 2003). In all estimations, multi- 
collinearity is tested for by calculating the variance inflation 
factor. The variables used produced acceptable variance 
inflation factor values. Table 2 lists all the variables, and 
Table 3 provides the correlation matrix among WOM mea- 
sures for a movie's prerelease and opening week. Similar 


TABLE 2 
List of Variables Used in the Analyses 


Variable Name 


t (subscript) 
CONST 
CRCON 
CRPRO 
GACADV 
GCOMEDY 
GSCIFI 
LNAGE, 
LNCRITIC 
LNMSG, 
LNNEW, 
LNPROD 
LNREV, 
LNSCHRN, 4 
LNSCRN, 
MPAA-PG 
MPAA-R 
NEGPER, 
POSIPER; 
STAR 


Meaning 


Week number In movie's life span (t = 0 is prerelease) 
The constant intercept in the regression models 


Percentage of negative critical reviews 
Percentage of positive critical reviews 
Movie genre is action/adventure 
Movie genre is comedy 
Movie genre is science fiction 


Average age of the top 20 movies in week t (in natural log) 


Number of critical reviews (In natural log) 


Number of WOM messages !n week t (In natural log) 
Number of new releases among the top 20 movies In week t (In natural log) 
Value of production budget of the movie (In natural log) 


Box office revenue in week t (in natural log) 


Number of screens In the first four weeks after opening (in natural log) 


Number of screens in week t (in natural log) 
Movie Is rated PG in MPAA ratings 
Movie is rated R 1n MPAA ratings 
Percentage of negative WOM In week t 
Percentage of positive WOM in week t 
There are major stars In the movie 
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! TABLE 3 
Correlation Matrix for Prerelease and Opening Week WOM 


LNMSGy; POSIPER; 
LNMSOGg 1.000 = 
POSIPER ) —.312 1.000 
(.050) | 
NEGPERg .026 — 265 
(.873) (.098) 
LNMSG, .764 —.034 
(.000) (.835) 
POSIPER, .053 .211 
(.746) (.191) 
NEGPER, —.216 
(.180) (.825) 


Notes: p values are in parentheses. 


patterns exist for the remaining weeks. The most significant 
correlation exists between LNMSG, and LNMSGop, imply- 
ing the possibility of a carryover effect. 


WOM and Weekly Box Offlce Revenue 
To examine the weekly effects of WOM, the following 
model is estimated: 

( | LNREV, = CONST +a,LNSCRN,, + a,LNMSG 


ıt- I 


+a3POSIPER, , _ | +a,NEGPER 


1,t — 1 
+ asLNCRITIC, + agCRPRO, + a;LNNEW, 
+agLNAGE, +€,, 


for each week separately (t = 1, 2, ..., 8), where 1 indexes 
the movies (i = 1, 2, ..., 40) and is omitted hereafter to 
avoid notation clutter. The covariates include the number of 
screens (LNSCRN,), critical reviews (LNCRITIC, 
CRPRO), and the degree of competition (LNNEW,, 
LNAGE,). This specification is similar to that of Eliashberg 
and Shugan (1997) and Basuroy, Chatterjee, and Ravid 
(2003), except for the additional (lagged) WOM measures 
and the two competition variables.10 

Table 4 reports the weekly regression results. All the 
regressions achieved high levels of model fit, and all the 
significant parameters have expected signs. For the opening 
week, the joint F test shows that the measures of WOM 
have significant explanatory power for box office revenue. 
However, whereas the volume of WOM (LNMSGp) is 
highly significant (p « .001), the valence measures 
(POSIPER, and NEGPER)) are not. In other words, almost 
all the explanatory power that WOM brings to the estima- 


I0OIn an effort to test and control for any effects that might be 
caused by unobservable variables, the estimation of a set of movie- 
specific information, such as movie genres, MPAA ratings, pro- 
duction budget, whether a movie 1s a sequel, and whether there are 
major stars or directors, is included. However, these variables are 
mostly insignificant. Thus, they are omitted from the final model 
of estimation. 


NEGPER, 


LNMSG, POSIPER, NEGPER, 

1.000 = = - 
—.007 1.000 2 pu 
(.965) 
—.197 —.025 1.000 - 
(.223) (.880) 

267 —071 —426 1.000 
(.096) (.664) (.006) 


tion comes from its volume but not its valence. The estima- 
tion for the next four weeks (Week 2—Week 5) yields the 
same results; that is, the volume of WOM (LNMSG, _ ;) is 
highly significant, but the valence (POSIPER,_, and 
NEGPER, _ ;) is not. At Week 6, the effect of volume 
becomes marginal and then disappears. This happens 
because WOM activities are gradually reduced as people 
lose interest in a movie during subsequent weeks. Thus, the 
increasingly sparse and irregular WOM fails to provide 
informative measures for weekly box office revenue. For 
example, there are only 5.1 WOM messages posted per 
movie in Week 6, compared with 94.8 in the opening week. 
As a result, H, is supported, but only for the early weeks. 
This is particularly enlightening because most of the box 
office sales occur in these weeks; for the movies in the data, 
the first five weeks contribute to 91.396 of aggregate box 
office revenue.11 

To examine the robustness of these results, especially 
the insignificance of valence, which might be particularly 
surprising, several further analyses were conducted. A typi- 
cal reason for the lack of significance is that the sample size 
may not have been sufficiently large. Whereas the number 
of movies that can be analyzed is practically constrained by 
the labor-intensive coding process, the sample size is still 
comparable to those of extant studies. For example, Godes 
and Mayzlin (2004b) analyze 41 television shows; 
Lehmann and Weinberg (2000) use 35 movie and video 
titles; Eliashberg and Shugan (1997) analyze 56 movies; 
and Dellarocas, Awad, and Zhang (2004) use 25 movies in a 
ratings survey and then 80 movies for box office analysis 
(40 movies for estimation and 40 for prediction). Neverthe- 
less, a pooled regression was conducted to examine whether 


11A question here is whether the volume of WOM is merely 
picking up the effects of advertising. To examine this issue, data 
on the advertising budget for each movie were collected and 
included in weekly regressions with WOM. The estimation results 
remain similar, and the volume of WOM is still highly significant. 
Thus, novie WOM offers explanatory power for weekly box 
office revenue beyond that of advertising. 
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TABLE 4 
The Effects of WOM Communication on Weekly Box Office Revenue 


Week (t) 1 2 3 4 5 6 7 8 
CONST 7.273 9.346 6.449 8.171 6.469 4.981 .594 4.067 
(.000)** (.000)** (.000)** (.000)** (.000)** (.015)** (.000)** (.817) 
LNSCHN, .566 .401 .673 .985 973 1.050 1.379 1.185 
(.000)** (.006)** (.000)** (.000)** (.000)** (.000)** (.000)** (.000)** 
LNMSG; -1 .592 345 .366 275 387 .169 —.088 .144 
(.000)** (.000)** (.000)** (.005)** (.000)** (.0103) (.449) (.362) 
POSIPER,. , .784 —,300 .812 .186 342 045 —.185 118 
(.257) (.674) (.133) (.603) (.312) (.910) (.497) (.715) 
NEGPER,. ; .059 —1.369 —.610 .385 —.173 —.265 = 
(.952) (.092)* (.264) (.309) (.136) (.620) (.399) (.655) 
LNCHITIC ./59 1.106 .894 343 421 .657 .358 .561 
(.018)** (.001)** (.020)** (.323) (.137) (.192) (.228) (.369) 
CRPRO .695 .398 .688 ./28 ./36 .340 .386 307 
(.019)** (.235) (.076)* (.047)™* (.009)"* (.336) (.288) (.454) 
LNNEW —.111 .005 .153 —.251 —.136 —.086 .073 —.112 
(.505) (.984) (.494) (.127) (.423) (.738) (.711) (.682) 
LNAGE .146 —.339 —1.378 —.681 _ —1.510 060 
(.760) (.516) (.972) (.045)** (.140) (.964) (.097)* (.963) 
Model fit F 44.87 16.86 22.70 27.50 50.37 23.58 69.04 22.45 
(p value) (.000) (.000) (.000) (.000) (.000) (.000) (.000) (.000) 
Adjusted R2 .902 ./65 .820 .855 .925 .862 .954 .905 
Joint F test of 14.01 9.64 10.87 4.01 10.48 1.07 .43 53 
WOM (.000) (.000) (.000) (.017) (.000) (.382) (.735) (.672) 
measures 
*p « .10 
**p « .05. 


Notes: The dependent variable is LNREV;. p values are in parentheses. 


the insignificance of WOM valence can be attributed to 
sample size. To do so, all the weekly data of the 40 movies 
were pooled to regress weekly box office revenue on the 
explanatory variables. With 378 observations, the pooled 
regression achieved a high level of model fit (adjusted R? = 
.05). However, the results remain the same; that is, the vol- 
ume of WOM is highly significant, but valence is not.!2 

It is also possible that the valence of WOM works dif- 
ferently for movies that receive mostly positive WOM and 
those that receive mostly negative WOM. A relevant argu- 
ment is the diminishing impact of incremental WOM mes- 
sages. If the overall WOM information about a movie is 
positive, the addition of another positive message may not 
be as influential as the addition of a negative one. The 
reverse is true for a movie that has overall negative WOM. 
Thus, estimating all movies together might have neutralized 
the impact of the valence measures. This argument moti- 
vated an examination of the impact of WOM by conducting 
separate regressions on different movie groups (especially 
those that received mostly positive WOM and those that 


I2Note that a pooled regression, though it offers a greater num- 
ber of observations and thus increases the chance of a parameter to 
be significant, has its limitations. It estimates the average effect 
across all periods. Thus, it is not particularly useful if there is a 
significant difference of the estimated effects in different periods. 
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received mostly negative WOM). The results of WOM vol- 
ume and valence remain unchanged. 

In addition to the examination of weekly box office 
revenue, an analysis was conducted for aggregate box office 
revenue using prerelease WOM and other control variables; 
valence measures were again found to be insignificant. 
Finally, various model specifications, such as linear and 
semilog, were examined, and all lead to qualitatively simi- 
lar results. 

Therefore, the results confirming H, appear to be fairly 
robust. They suggest that though WOM is a useful informa- 
tion source in the movie market, it correlates with box 
office revenue mainly through an informative role on 
awareness but not a persuasive path of attitude changes. In 
addition to the theoretical reasons discussed previously 
regarding the weak correlation between attitude and behav- 
ior, whether a person likes a particular movie is (to a large 
extent) an issue of personal taste, which lends additional 
support to this finding. It is clear that WOM increases 
awareness among potential moviegoers, which is confirmed 
by the highly significant effect of volume, but when it 
comes to deciding whether to watch a particular movie, to 
what extent should another person's subjective assessment 
be reflective of the WOM recipient's own taste? If the belief 
that a bad (good) movie for one person may be enjoyable 
(not enjoyable) to another is sufficiently strong, or if the 
WOM receiver attributes the valence of WOM to the taste 
of the message sender (see Eagly, Wood, and Chaiken 


[1978] and other studies on attribution theory), the valence 
will lose much of its influence. This explanation is consis- 
tent with several previous studies on WOM. For example, 
Russell, DeCarlo, and Ramaswami (2001) find that when 
message receivers attribute the negativity of WOM to the 
communicator, brand evaluations actually increase. 

The finding that WOM operates mainly through 
increased awareness and the argument that the experiential/ 
intangible nature of movie consumption enhances the need 
for WOM lead to an interesting prediction; namely, the 
effect of WOM on to-be-released sequel movies should be 
less than that on nonsequels because moviegoers have more 
information about and experience with sequels as a result of 
the mostly successful prequels. This difference should be 
the most obvious in the opening week when box office reve- 
nue is the greatest and when the WOM effect is highly 
Significant. 

To examine this prediction, regressions similar to those 
reported in Table 4 could be conducted for sequel and non- 
sequels separately. However, the small number of sequels in 
the data (there were only 3) prevents estimations for com- 
pletely separated samples. Thus, a different test was used. 
In particular, the opening-week regression was conducted 
for nonsequels only (37 movies), and whether the effect of 
volume was greater than that based on all 40 movies was 
observed. The parameter LNMSG, _ ; was .6044 for non- 
sequels but .5917 for all movies. Because the estimations 
are based on double log specifications, this seemingly small 
difference actually implies that the average: volume of 
WOM during prerelease yields 6% more box office revenue 
in the opening week for nonsequels than for all movies 
combined (i.e., 84.6 6044 - 5917 = 1.06). For the average 
opening-week revenue of $27.01 million in the sample, this 
is a $1.62 million difference. This difference is especially 
significant considering that the test is less powerful than a 
direct comparison between sequels and nonsequels and that 
it is based on merely 3 of the 40 movies. 

For the early weeks, inclusion of the WOM measures 
produced noticeably better model fit than the previous stud- 
ies that used only the remaining explanatory variables. 
From Week 1 to Week 5, the average enhancement of model 
fit measured by adjusted R2 is 17%. 


WOM and Aggregate Box Office Revenue 


Table 5 presents the results of regressing aggregate box 
office revenue (LNGROSS) on prerelease WOM and other 
covariates. The opening strength of movies is measured as 
the total number of screens in the first four' weeks; this 
accommodates the industry practice of adopting a minimum 
number of screens in exhibition contracts for the first few 
weeks (for similar issues, see De Vany and Walls 1999) and 
reduces any possible biases associated with the movies that 
do not achieve the widest distribution in the opening week. 
The LNSCRN, 4 turns out to be highly significant; the 
wider distribution a movie has in the first four weeks, the 
greater the aggregate box office revenue will be. The per- 
centage of positive reviews (CRPRO) has a significant, 
positive impact. The volume of critical reviews 
(LNCRITIC) is also significant, but only marginally. These 


TABLE 5 
Prerelease WOM and the Aggregate Box Office 


Revenue (LNGROSS) 

Independent Parameter t-Value 
Variables Estimates (p Value) 
CONST 8.875 3.95 (.000)** 
LNSCHN,. 4 516 2.50 (.018)** 
LNPROD —.015 —.10 (.921) 
LNMSGg .589 4.65 (.000)** 
POSIPERg 342 .44 (.667) 
NEGPER —.515 —.44 (.665) 
LNCRITIC .579 1.74 (.093)* 
CRPRO 1.069 3.21 (.003)** 
STAR .168 .96 (.346) 
GSCIFI —.194 —.84 (.409) 
GACADV —.164 —,86 (.394) 


Joint test (LNMSG, = POSIPER, = NEGPER, = 0): F = 
9.49 (p « .001) 
Model fit: F = 13.79 (p « .001), adjusted R2 = .77 


*p « .10. 
**p < .05. 


results are similar to those that Eliashberg and Shugan 
(1997) report. Movie stars (STAR) or movie genres are not 
found to be informative of aggregate box office revenue. 
The three measures of WOM are highly significant as a 
group, confirming the value of using prerelease buzz to 
explain overall box office performance. As with the weekly 
analysis, whereas the volume of WOM (LNMSGp) is the 
most significant among all explanatory variables, the 
valence measures (POSIPER$ and NEGPER») are not sig- 
nificant. Thus, H3 is strongly supported. An important 
managerial value of this finding is that the amount of pre- 
release buzz can be a useful indicator of the ultimate box 
office performance. The effectiveness of box office forecast 
models (e.g., Sawhney and Hliashberg 1996) can be 
improved by including actual prerelease WOM information. 


Potentlal Antecedents of WOM 


As discussed previously, prerelease WOM can be related to 
three groups of factors: movie genres, MPAA ratings, and 
whether major stars are in the cast. Beginning with the 
opening week, WOM can also be related to critical reviews 
and the WOM in the previous week. Because volume is the 
most critical factor that correlates with box office revenue, 
it is the focus of the subsequent analyses. Analyses for the 
valence measures (POSIPER, and NEGPER,) were also 
conducted, but they could not be consistently explained by 
various models in almost all the periods. Table 6 summa- 
rizes the results. 

All the regressions for prerelease and the eight weeks 
achieved significant model fit. During the prerelease period, 
it appears that action and adventure movies tend to receive 
higher-than-average WOM communication, whereas R- 
rated movies tend to receive less. Beginning with the open- 
ing week, the only explanatory variable that is consistently 
significant 1s the volume of WOM in the previous week 
(LNMSG, _ |). This shows the carryover effect such that a 
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TABLE 6 


Potential Antecedents of the Volume of WOM 





O(Pre- 1 (Open- 
Week (t) release) ing Week) 2 3 4 5 6 7 8 
CONST 3.817 615 —243 1.380 —456 -4.320 4.086 4.351 .066 
(.000)** (.680) (.868) (.323) (.838) (.164) (.133) (.326) (.911) 
GACADV 555 —.143 —.075 —448 —.152 .295 —.181 .340 — 
(.045)* (.615) (.764) (.046)**  (.688) (.444) (.549) (.467) 
GCOMEDY —.330 ~.358 343 —.633 .458 .136 .140 .601 = 
(.279) (.278) (.231) (.008)* (.284) (.712) (.734) (.496) 
MPAAPG 668 —.684 343 418 —.303 618 095 -1.823 —.834 
(.104) (.141) (.364) (.192) (.597) (.258) (.892) (.237) (.197) 
MPAAR —913 —.322 758 —,346 .408 —.047 —.098 .225 —452 
(.009)** (.870) (.028)* | (.165) (.377) (.913) (.811) (.720) (.463) 
STAR 408 343 —.109 .469 .136 —.145 .806 .558 — 
(.162) (.269) (.694) (.068)* (.765) (.762) (.095)* (.489) 
LNMSG,. , = 1.169 .929 .715 .986 1.037 .625 .915 .595 
(.000)** (.000)* | (.000)*  (.000)*  (.000)* (.000)** (.001)*  (031)* 
POSIPER, .. , = 1.997 —1.148 1.149 .102 1.984 -1.888 1.206 2.067 
(.125) (.309) (.114) (.917) (.048)**  (014)*  (.340) (.049)** 
NEGPER,. ; m 2.405 —2.085 . -—1.988 —.735 1.889 —.698 2.603 512 
(.190) (.098)* (.008)* | (.602) (.041)** — (236) (.349) (.781) 
LNCRITIC — —,834 —115 ~.442 .039 808 -1.375 -2.211 = 
(.132) (.810) (.367) (.963) (.476) (.203) (.275) 
CRPRO = .792 .302 .897 ~.965 .929 1.099 1.759 — 
(.165) (.573) (038)* (214) (.182) (.220) (.373) 
Model fit F 4.36 7.53 9.78 16.22 3.69 5.78 5.75 3.41 2.89 
(p value) (.004) (.000) (.000) (.000) (.006) (.001) (.003) (.094) (.067) 
Adjusted R2 301 626 698 813 .464 .648 .683 .617 .371 
*p « .10. 
"*p < .05 


Notes: The dependent varlable is LNMSG;. p values are In parentheses. In Week 8, movie genre, star power, and critical reviews are excluded 


because of high multicollineartty. 


week of active WOM is likely to be followed by another 
week of active WOM. However, consistent patterns linking 
WOM valence in a previous week to WOM volume in the 
current week were not found. Table 6 also indicates that star 
power is insignificant in explaining the volume of WOM 
during the prerelease period and most of the subsequent 
weeks. In addition, WOM is not significantly influenced by 
critical reviews, confirming the previous result that WOM 
and critical reviews are more of complements to each other 
than substitutes.1? 

Regarding H4, consistent support was found for the car- 
ryover effect. The effects of other factors (e.g., movie genre, 
MPAA ratings) are mixed and mostly insignificant. Man- 


I3Although the existence of movie stars and the measures of 
critical reviews do not significantly predict WOM, there are 
caveats regarding the roles of stars and movie critics. For example, 
some movie stars and critics may be on television or radio shows 
to talk about a to-be-released movie, which may lead to greater 
WOM. Furthermore, some stars may have popular Web sites, 
which also may influence WOM. An indicator variable of star 
power, albeit a typical approach ın the literature, may not com- 
pletely capture the effects of these scenarios (e.g., whether a par- 
ticular movie has been promoted by its stars on popular television 
shows). 
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agerially speaking, these results point to the challenge that 
studios may face when trying to create and manipulate 
WOM by working on factors such as star power and movie 
genres (Rosen 2000). They also point to the importance of 
managing WOM early on, preferably beginning in the pre- 
release period. After WOM is created, a high volume of 
WOM, which is beneficial to sales, tends to positively affect 
the volume of WOM in subsequent periods. 


Summary, Managerial Implications, 
and General Discussion 


This study contributes to the movie-marketing and WOM 
literature by examining movie WOM. Among other things, 
the mechanism through which WOM functions and the 
extent to which WOM helps explain box office sales are 
investigated. The data and analyses show that movie WOM 
is active during a movie's prerelease period and that WOM 
is more of a complement to other information sources than 
a substitute. Movie audiences tend to hold comparatively 
high expectation before a movie is released but become 
more critical in the opening week. The volume of WOM 
(but not the valence) offers significant explanatory power 
for box office revenue, both in the aggregate and for the 
early weeks, in support of a theory that WOM functions in 
the movie market primarily through an informative effect 
on awareness. 


The finding that WOM is very active during the pre- 
release period implies that at least for entertainnient goods, 
such as movies, WOM communication about a to-be- 
released new product may not depend on actual experience. 
Many potential users talk about the product on the basis of 
speculations. In this sense, managing buzz should be treated 
as a multidimensional task that extends beyond the new 
product itself. That is, management needs to be conscious 
about additional things that influence speculations, such as 
the firm’s other products (both new and extant), the firm’s 
reputation, various events, and possibly “rumors” that occur 
during the new product development process. . 

These findings highlight the need for studio and theater 
managers to observe and respond to WOM communications 
actively, especially during a movie's early weeks when 
most of the box office revenue is generated. However, what 
ultimately matters is the volume of WOM. It may not be 
critical to track information similar to the valence measures. 
Practically, volume can be measured either directly by a 
precise number, as this study does, or indirectly by proxies, 
such as the degree of public interest a movie receives, how 
widespread the communication occurs across demographic 
groups, and so on. These findings offer considerable sup- 
port to the idea that firms should try to create active WOM 
communication among potential users (Godes and Mayzlin 
2004a; Rosen 2000). 

Another important managerial value of these findings is 
that WOM can be a useful measure to help forecast box 
office sales. Among other things, good forecasts help stu- 
dios and theaters better plan screening capacity and poten- 
tially optimize exhibition contracts. However, it is not the 
purpose of this study to develop a full forecasting system. 
Rather, this study can be used to illustrate how the perfor- 
mance of a forecasting model can be improved by including 
actual WOM information. To do so, this study used the set 
of independent variables and adopted the standard 
calibrating-forecasting approach. Because the sample size 
makes it difficult to split the data into an estimation sample 
and a prediction sample, a rotating method was used to con- 
duct the calibration and forecasting tasks. Specifically, each 
of the 40 movies 1s taken in turn as the movie to be pre- 
dicted, while the other 39 movies are used in the calibration 
process to generate parameter estimates. The absolute per- 
centage errors of the 40 predictions are then averaged as the 
indication of prediction effectiveness. The calibration pre- 
diction is conducted for both aggregate box office revenue 
and the opening-week box office revenue. The WOM mea- 


sures are included in the proposed model but not in the 
benchmark model. Other independent variables are the 
same as those that appear in Tables 4 and 5. 

This analysis shows that including WOM in the fore- 
casting system significantly reduces forecasting errors by 
31% for a movie's opening week (from a 55% absolute 
forecasting error with the benchmark model to 3896 with 
the proposed model) and 2396 for the aggregate box office 
revenue (from 61% to 47%). Different from most extant 
studies, these forecasts are made purely on the basis of the 
prerelease information of a new movie. This is probably the 
most difficult forecast task for movies (Sawhney and 
Eliashberg 1996). It is similar to Sawhney and Eliashberg's 
(1996) BOXMOD-I model when no revenue data for the 
new movie are available and a meta-analysis procedure is 
employed. Sawhney and Eliashberg report an average 
71.1% prediction error for the aggregate box office revenue. 
Although the actual improvement through incorporating 
WOM depends on what forecasting system is eventually 
used, the added value of WOM appears to be significant. 

As in previous studies (Basuroy, Chatterjee, and Ravid 
2003; Eliashberg and Shugan 1997; Godes and Mayzlin 
2004b), the current analysis focuses on statistical correla- 
tions between variables and uses these temporal relation- 
ships to infer how WOM may influence box office sales. 
Although these correlations offer interesting insights and 
useful managerial implications, further research might 
investigate the degree of causality, perhaps with more disag- 
gregated data and different methodologies. In this regard, 
lab experiments could be a useful approach to examine the 
relationships such as those that Figure 2 illustrates. 

Online user-generated information offers unique oppor- 
tunities for research in marketing. Such data come in differ- 
ent formats, which render them suitable for different 
research purposes. Some data, such as the WOM messages 
used herein, are actual individual postings. Although their 
unstructured nature provides great flexibility for the analy- 
sis, coding them can be a difficult task. Other data, such as 
online user ratings, contain less flexibility but make it easier 
to conduct large-scale cross-sectional studies. Another issue 
involved with this growing literature is how similar the 
online information is to what occurs in the physical world. 
There is, evidence that the degree of such similarity can be 
very high (Dellarocas, Awad, and Zhang 2004). Neverthe- 
less, further research examining the difference between 
online user data and that in the physical context appears to 
have potential. 


————————————————————Á—————— —!A————— Á—— A JÓ————— mH 
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Jan B. Heide & Kenneth H. Wathne 


Friends, Businesspeople, and 
Relationship Roles: A Conceptual 
Framework and a Research Agenda 


This article develops a conceptual framework of roles in marketing relationships. Drawing on emerging theory from 
economic sociology and March’s (1994) notion of decision “logics,” the authors discuss two prototypical relation- 
ship roles, namely (1) a “friend” who uses a “logic of appropriateness” and follows established rules, and (2) a 
“businessperson,” whose decisions are guided by utility-maximizing considerations under a “logic of conse- 
quences.” Next, they use extant theories of interfirm governance to suggest that firms’ relationship strategies can 
be used both to create different relationship roles in the first place and to activate them over time. The authors 
posit that activation can have several different outcomes, including reinforcement of an existing dominant role or 
actual switching to a new one. Theoretically, the conceptual framework allows for integration of different perspec- 
tives on interfirm relationships, some of which have provided seemingly inconsistent accounts of firm behavior. 
From a managerial perspective, the framework identifies specific matches between particular relationship roles on 
the one hand and firms’ governance strategies on the other hand. In general, the framework suggests that an 
in-depth understanding of roles is a prerequisite for the deployment of relationshlp management initiatives toward 


resellers, customers, and suppliers. 


orientations that exchange partners bring to bear on 
their relationships. In branches of the new institu- 
tional economics literature, decision makers’ assumed prin- 
ciple of action is utility maximization, sometimes to the 
point of “self-interest seeking with guile” (e.g., Williamson 
1985, p. 47). This assumption has produced an extensive 
body of research on so-called governance mechanisms (e.g., 
monitoring and incentive devices), which have the pre- 
sumed ability to manage parties’ inherent “calculative” ori- 
entations (see, e.g., Rindfleisch and Heide 1997). In con- 
trast, other streams of literature, particularly in sociology, 
have been overtly critical of such an “undersocialized” view 
of exchange, arguing that relationship behavior follows 
from rules or “heuristics,” which are applied regardless of 
their economic consequences (e.g., Granovetter 1985; Uzzi 
1997). 
Notably, although these economic and sociological per- 
spectives provide different accounts of relationship behav- 


Av body of literature exists on the general 
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ior, they actually share a fundamental underlying assump- 
tion, namely, that exchange partners are “unitary actors" 
who play a single role over time and consistently make 
decisions according to a fixed pattern (i.e., in a calculative 
or heuristic manner). However, this assumption has been 
challenged. For example, Montgomery (1998) argues that 
attributing global characteristics to exchange partners, 
either calculative or heuristic, is potentially misleading. 
Using a “prisoner’s dilemma” game as an illustration, he 
suggests that the “players” are not individuals with fixed 
traits or predispositions, as is often assumed, but rather col- 
lections of roles. In his analysis, Montgomery distinguishes 
between two role archetypes: (1) a “businessperson,” whose 
decisions are guided by utility maximization considerations 
and who will “defect” if such a move is individually prof- 
itable, and (2) a “friend,” who will cooperate as a matter of 
principle. ! 

In this article, we build on this general view of relation- 
ship parties and interactions to develop a conceptual frame- 
work of roles in marketing relationships. Our framework 
explicitly accounts for the possibility that different orienta- 
tions (calculative and heuristic) and the associated roles 
(businessperson and friend) can coexist and that switching 
among them is both possible and likely. Moreover, we argue 


I'We acknowledge that these particular labels carry surplus 
meaning and can be misleading. Any given friend is unlikely to be 
devoid of task orientation, and real-life businesspeople frequently 
show exemplary service. Nevertheless, we rely on these particular 
terms for two reasons: First, they have been used in the literature 
(e.g., Montgomery 1998) as “ideal type" (in a Webenrian sense) 
theoretical constructions. Second, field studies have shown that 
these specific labels are used in actual industry settings. 
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that firms’ governance strategies (e.g., socialization and 
monitoring) are capable of (1) creating relationship roles, 
(2) reinforcing existing roles over time, and (3) producing 
role switching (e.g., from friend to businessperson). 

From a theoretical standpoint, our framework provides a 
novel perspective on relationship management. As we dis- 
cuss subsequently, our perspective can explain empirical 
results that run counter to established governance theory; 
for example, certain governance mechanisms actually pro- 
mote rather than suppress opportunistic behavior (e.g., John 
1984). 

The role concept also has important practical implica- 
tions. Ridgway (1957, p. 465) noted this early on, claiming 
that “the establishment and maintenance of the 
manufacturer-dealer system itself—that is, the provision of 
mutually satisfying roles in the relationship—is a major 
administrative responsibility.” Importantly, although indus- 
try observers have since made a case for the importance of 
relationship roles (e.g., Normann and Ramirez '1993), spe- 
cific guidelines for decision making are almost absent from 
the literature. Frazier (1999, p. 235) has argued that (1) "lit- 
tle is known about the specification of channel roles" and 
(2) “many manufacturers make role decisions very poorly." 

We organize the article as follows: We provide a brief 
synthesis of existing theoretical perspectives and propose a 
new decision-theoretic conceptualization of relationship 
roles. Then, we present our conceptual framework, discuss 
the implications, and suggest areas for further research. 


Role Theory: Origins and Emergent 
Views 


According to Biddle (1986, p. 68), role theory began as a 
“theatrical metaphor.” Over time, the general domain of role 
theory has become the "study of behaviors that are charac- 
teristic of actors within contexts" (Biddle 1979, p. 56). 
Although several different approaches to role theory exist 
(e.g., Kahn et al. 1964; Mead 1932; Sherif 1936), the two 
branches, known as functional (e.g., Bates and Harvey 
1975; Parsons 1937) and structural (e.g., Burt 1982; Win- 
ship and Mandel 1983) role theory, have historically 
enjoyed some prominence and have also received attention 
in the marketing literature. For example, applications of 
functional role theory in marketing are evident in the use of 
terms such as “wholesaler” and "retailer" to describe par- 
ticular channel positions with associated functions and 
activities (e.g., El-Ansary and Stern 1972). Structural role 
theory has been used in marketing to describe the properties 
of distribution networks and their interaction patterns (e.g., 
Anderson, Hákansson, and Johanson 1994; Iacobucci 
1996). In general, however, despite Stern's (1969) early 
call, role theory has not been extensively used in research 
on interfirm phenomena.? 

Although conventional role theory has generated impor- 
tant insights, it has also been criticized (e.g., Powell and 


2Role theory has been used more extensively in research on 
intraorganizational phenomena (e.g., Behrman and Perreault 1984; 
Churchill et al. 1985; Singh 2000; Webster and Wind 1972). 


Smith-Doerr 1994; Salancik 1995) for, among other rea- 
sons, not going far enough in terms of articulating the prop- 
erties of the relevant parties and their decision-making pat- 
terns. For example, Blau (1993) notes that a party's actual 
behavior in a relationship is not necessarily isomorphic with 
its position. This is consistent with Larson's (1992) obser- 
vation that standard role designations, such as "principal" 
and "agent," often provide little information about the 
actual properties of a set of parties and their relationship. 

Emerging role theory has attempted to address these 
limitations. In this article, we draw on recent role research 
in economic sociology (e.g., Montgomery 1998) and define 
an organizational role as an organizational "identity" or a 
"collective mind" (Kohli and Jaworski 1990), which pro- 
vides the foundation for shared perceptions and coordinated 
decision making (Albert and Whetten 1985; Messick 1999; 
Tajfel and Turner 1986). Our focus is on two particular 
archetypes of roles or identities—friend and business- 
person—which differ systematically in their decision- 
making patterns. 


Role of a Friend: The Logic of Approprlateness 


Most versions of role theory presume that expectations, 
broadly defined, are major determinants of role behavior 
(Biddle 1986). The term "expectation" is commonly used in 
a prescriptive sense, referring to behaviors that "ought to" 
be performed (Turner 1974). In certain branches of role 
theory (e.g., Montgomery 1998), the term "rule" 1s used to 
describe how expectations guide organizational decision 
making. 

Montgomery's (1998) role-theoretic arguments are 
derived from the work of March (1994), who argues that 
decision makers explicitly or implicitly ask three key basic 
questions. These questions pertain to (1) recognition 
(“What kind of situation is this?" ), (2) identity (“What kind 
of organization is this?"), and (3) rules ("What does an 
organization such as this do in a situation such as this?"). 
Conceptually, the first two questions involve categorization 
tasks (Turner et al. 1987)—for example, the categorization 
of a firm's exchange partner and the firm itself—and the 
third question involves the matching of behaviors to cate- 
gories (Forgas 1982). For example, a distributor that agrees 
to carry additional inventory for a manufacturer may be cat- 
egorized as "cooperative" and, subsequently, may be 
granted "gold partner" status. 

Research in contract law and marketing (e.g., Dwyer, 
Schurr, and Oh 1987; Heide and John 1992; Macaulay 
1963) describes how rules of behavior in interfirm relation- 
ships may manifest themselves as norms. For example, a 
supplier and a buyer may develop a norm of flexibility, 
which specifies how certain aspects of the relationship will 
be modified under changing circumstances. March (1994) 
uses the term “logic of appropriateness” to describe such a 
rule-driven decision-making mode. 

Montgomery (1998) uses March’s (1994) logic of 
appropriateness to coin a prototypical relationship role, 
namely, that of a friend. Using a game-theoretic analogy, 
Montgomery defines a friend role in terms of a tendency to 
play a game based on fixed rules that prescribe cooperative 
moves, even when such moves undermine a party’s indi- 
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vidual payoff. Uzzi’s (1997) study provides several “thick 
descriptions” of such a role. For example, a production 
manager in an apparel company described how her firm 
would forgo self-gain by contracting early for services to 
help a friend suffering from slow business. In the process, 
the company purposely sacrificed potential profits from 
alternative contractor relationships. 


Role of a Businessperson: The Logic of 
Consequences 


Although rules may serve as explanations for behavior, and 
roles themselves (e.g., a friend) can be conceptualized on 
the basis of rule-following behavior, some researchers sug- 
gest that roles are induced through the consequences of 
behavior (Biddle 1979). This principle is the cornerstone of 
rational choice theory (e.g., Becker 1976; Hogarth and 
Reder 1987), which tends to describe decision-making pro- 
cesses in utility maximization terms. March (1994, 1999) 
refers to such processes as involving a “logic of 
consequences.” 

Many of the extant theories of interfirm relationships, 
including transaction cost, agency, and game theory, share 
the idea of anticipatory, consequential choice. For example, 
a common explanation for sustained cooperation between 
firms is “calculative trust" (Williamson 1993).3 For exam- 
ple, a manufacturer may trust a distributor to fulfill its role 
because the known short-term benefits to the distributor 
from opportunism are outweighed by the long-term benefits 
from cooperation. 

The prototypical relationship role that corresponds to a 
logic of consequences 1s that of a businessperson, who is 
not bound by rules per se but whose primary motivation is 
utility maximization. In the terminology of game theory 
(Axelrod 1984), a businessperson plays the game by choos- 
ing the strategy (cooperation or defection) that maximizes 
individual payoffs. A businessperson may even pursue 
opportunistic behaviors (e.g., quality shirking) at the 
expense of a partner, if the gains from doing so are suffi- 
ciently large (Klein 1996). 


Distinctions Between the Two Logics 


Importantly, although rational choice theory does not deny 
the importance of rules, it tends to view them as being 
derived from, or somehow supported by, underlying utility 
maximization considerations (Scott 2000). For example, 
although Gibbons (1999) recognizes the importance of 
informal rules within relational contracts, he also notes how 


3Barney and Hansen (1994) make a distinction between “‘semi- 
strong" and "strong" or "hardcore" forms of trust. Semistrong trust 
corresponds to Williamson's (1993) and Uzzi's (1997) notion of 
calculative trust and reflects a calculative decision logic in line 
with the work of March (1994). Conversely, strong trust corre- 
sponds to Uzzi's (1997) notion of heuristic trust and reflects a 
rule-based decision logic ın line with the work of March (1994). 
As Barney and Hansen note (p 179), hardcore trustworthy 
exchange partners are trustworthy because “that 1s who they are.” 
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such rules must be explicitly enforced in a relationship 
(e.g., through a pattern of repeated interaction). As such, 
rule-following behavior (and, indeed, the very notion of 
relational contracting) is inherently tied to (economic) 
consequences. 

However, other scholars argue that the standard rational 
choice characterization of utility maximization is incom- 
plete. For example, Montgomery (1998, p. 100) argues, 


Rational choice theorists have trouble explaining adher- 
ence to "social norms" not because these rules would be 
difficult to derive from utility maximization, but because 
the associated maximization problems (entailing merely 
“psychic” costs and benefits) often seem vacuous. The 
need to construct nontrivial utility-maximization problems 
have led rational choice theorists to emphasize external 
sanctions (I follow a norm because others are watching 
me) over internalization (I follow a norm because of who 
I am). To the extent that a role has been internalized (i.e., 
the individual’s self-assessed “degree of membership”’ is 
positive), the cost of disobeying the norm is found not 
merely in external sanctions (which are completely irrele- 
vant if the individual knows that she is not being moni- 
tored), but in the violation of self-consistency— 
individuals would no longer recognize themselves. 


Montgomery's account of an "appropriateness" mode of 
decision making is consistent with the “moral imperative" 
view that Durkheim (1915), Etzioni (1975), and Elster 
(1989) describe. For example, Etzioni argues (p. 24) that 
“utilitarian satisfaction should not be confused with the 
sense of affirmation that accompanies discharging one’s 
moral commitments, commitments that are often in them- 
selves taxing rather than pleasurable.”4 Moreover, Elster 
writes (p. 104), "When norms are internalized, they are fol- 
lowed even when violation would be unobserved and not 
exposed to sanctions. People have an internal gyroscope 
that keeps them adhering steadily to norms, independently 
of current reactions of others.” On this note, and in contrast 
with Gibbons's (1999) view, Hackett (1994) provides evi- 
dence of "pure" relational contracts, in which the rules 
themselves are capable of guiding decision making, even in 
the absence of explicit enforcement devices. 

On the basis of the previous discussion, we suggest that 
two distinct decision logics exist and that they can be used 
to delineate two prototypical relationship roles, namely, 
friend and businessperson. We summarize the preceding 
discussion in Table 1. 


4Work within the rational choice paradigm has tried to reconcile 
these perspectives by suggesting that decision makers possess 
multiple utility functions for individual and social welfare, respec- 
tively (e.g., Harsanyi 1955; Margolis 1982). As such, utility may 
exist at different levels. In the marketing literature, Rokkan, Heide, 
and Wathne (2003) describe how solidarity norms prescribe 
behaviors that maximize relationship utility. In addition, it has 
been argued that utility exists in different forms, including that of 
psychological “warm glow" (Andreoni 1989, p. 1448). Still, the 
assumption that decision makers engage in calculations to maxi- 
mize utihty (conceptually, a logic of consequences) 1s consistent 
across all these models. 


i TABLE 1 
Prototypical Relationship Roles 


Key Characteristics 


General theoretical origin of role 


Subfields 


Representative literature 


Role’s underlying decision logica 


Role’s guide to decision making} Rules 


Actual role behavior 


aSee March (1994). 
bSee Montgomery (1998). 


. The Coexistence of Logics: A 
Decision-Theoretic Perspective on 
Relationship Roles- 


Notably, although the bodies of theory that provide the 
basis for the two prototypical relationship roles are quite 
different, as Table 1 shows, they tend to conyerge on the 
same general assumption, namely, that decision makers 
either follow rules or maximize utility or, more generally, 
that they play a single role (friend or businessperson) over 
time. In Gambetta’s (1996) terminology, a decision maker 
either (1) “is pushed from behind” (by preexisting rules) or 
(2) “jumps” (toward the alternative with the highest payoff). 

We wish to challenge this assumption. Although we 
view the two roles as distinct, in the sense that one or the 
other may be applied to an individual decision, we believe 
that the two can also coexist within a given’ party in the 
form of latent predispositions (Friedberg 2000; Greenwald 
and Pratkanis 1984). This is consistent with Powell and 
Smith-Doerr’s (1994, p. 371) argument that “identities are 
forged out of porous, multi-stranded relations in which 
business, reputation, and friendship are entangled.” 

A largely unaddressed issue, however, pertains to the 
specific manner in which roles coexist. Consider again the 
argument that an organizational role represents a form of 
identity or mental model of a relationship (Albert and 
Whetten 1985; Messick 1999). At a given time, based in 
part on a relationship's unique interaction history, a particu- 
lar identity will dominate a party's approach to that rela- 
tionship (Oakes 1987). For example, a manufacturer may 
view a certain supplier as a friend, and the manufacturer's 
general approach to the relationship may be reflected in 
friend types of actions, as in Uzzi's (1997) example, which 
we described previously. 


Friend 
Sociology 
Economic sociology (embeddedness 


perspectivo) 
Pure (self-sustalning) relational contracts 


Granovetter 1985; Etzloni 1975; Elster 
1989; Macaulay 1969 


Appropriateness (heuristic) 


Cooperation In principle 


Role 


Buslinessperson 


Economlcs/rational choice theory 


Transaction cost theory 
Agency theory - 
Game theory 
Extemally enforced relational contracts 


Willlamson 1985, 1996; Jensen and 
Meckling 1976; Axelrod 1984; Gibbons 
1999 


Consequences (calculatlve) 
Incentives 


Defectlon or cooperation - f (incentive 
structure) 


However, although the supplier in question may have 
friend status, the manufacturer can simultaneously possess 
a mental model of businessperson suppliers based on actual 
experiences in other relationships or on general industry 
observation. Over time, depending on the focal supplier's 
actions, the established relationship roles may be subject to 
reassessment. 

Research in organization theory (e.g., Albert and Whet- 
ten 1985) describes how identities are formed by interorga- 
nizational comparison processes that directly affect organi- 
zational action. If a given comparison process produces a 
sufficiently large "identity gap," certain congruence- 
enhancing responses are triggered, such as a reassessment 
of a party's own identity and/or the partner's. For example, 
to the extent that a supplier pursues actions normally asso- 
ciated with a businessperson role, a complex process of role 
switching may be initiated. For example, the former sup- 
plier friend may be assigned to a new category (a busi- 
nessperson), and the manufacturer's own role may receive a 
similar designation. 

Larson (1992, pp. 86-100) provides an example of how 
a role shift can also be triggered by external factors: 


[T]he role shift occurred when "Support Products" (SP) 
was acquired by a much larger corporation. Before it was 
acquired, the company had a reputation for developing 
close supplier relationships, characterized by an ethos of 
friendship and mutual assistance. SP's procurement 
manager described the philosophy behind the firm's key 
supplier partnerships: "These people are extensions of 
your own organization—they are part of your family." 
When SP was acquired, the new parent imposed internal 
sourcing requirements on its newly acquired division that 
dramatically altered the terms and character of SP's sup- 
plier alliances. The partnership philosophy was replaced 
by a short-term, cost-only focus. SP's purchasing manager 
explained that as a consequence, the relationships shifted, 
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and collaboration and special treatment became increas- 
ingly unlikely. 


At a general level, role-switching processes have been 
described as transitions between “primary and secondary 
frameworks” (Goffman 1969) or between “quasi-stationary 
equilibria” (Ashforth 2001). The actual transitions have 
sometimes been described in terms of “unfreezing” and 
"refreezing," suggesting that role switching is subject to 
inertia pertaining to both (1) a partner’s new designation 
and (2) a party’s own role and associated behaviors. 
Regarding the former, a given event or situational cue (e.g., 
a supplier that used to be viewed as a friend refuses to adapt 
to changing circumstances) may trigger doubts about the 
viability of the existing role. However, an actual recatego- 
rization of the supplier need not take place, at least not 
without additional confirmatory evidence. A chief executive 
officer who Uzzi (1997, pp. 44-45) interviewed provided 
the following description: “Sometimes they ask for a favor 
(e.g., a low price) and we'll do it. But if they always do that, 
they’re ripping me off.” 

Similarly, the focal party (i.e., the manufacturer) may 
continue to play an existing role, even in the face of evi- 
dence about a partner's new role, to the extent that the focal 
actions reaffirm the firm's own identity and, in general, 
reduce uncertainty (Turner et al. 1987). In addition, an 
existing role may be "sticky" (Bolton and Reed 2004) to the 
extent that it is shared by other parties or referents (Ash- 
forth 2001). For example, a manufacturer that has a global 
reputation as a friend in a larger industrial network may 
continue to enact a friend role, even toward suppliers whose 
actions belong to a standard businessperson repertoire. 
Decision makers may have a baseline tendency to pursue 
behaviors that reaffirm existing roles. 

Numerous accounts of firms' struggles to change their 
relationship orientations provide evidence of role stickiness. 
For example, Lyons, Krachenberg, and Henke (1990, p. 33) 
report the following statement that a senior executive made 
about his firm's attempted change in supplier strategy: 
"We've had to outplace or retire some of our most experi- 
enced, veteran buyers. It was just too much to expect them 
to change from playing poker with suppliers to cooperating 
with them. The old ways and the new game just didn't 
match." 

Despite the possibility of role stickiness, the general 
conclusion from the previous discussion is that parties may 
possess multiple identities over time, even within the con- 
text of an individual relationship. As such, rather than 
applying fixed characterizations, such as calculative or 
heuristic, we view relationship parties as collections of 
roles. Although the dual role conceptualization of friend 
and businessperson is admittedly somewhat simplistic, it 
nonetheless provides a useful starting point for building a 
theory of relationship roles. 


Roles and Governance Decisions: 
A Conceptual Framework 


In this section, we outline a framework that specifies how 
roles are (1) initially created in relationships and (2) subse- 
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quently activated. We also sketch out possible linkages 
between roles and particular relationship outcomes. As we 
discuss throughout the section, these outcomes pertain both 
to the roles themselves (e.g., reinforcement of current roles 
or switching) and to various economic dimensions. For 
example, the consequence of a firm's decision to begin 
monitoring a supplier "friend," which, by definition, would 
not require external surveillance, may be an economic 
opportunity cost. 

In developing the framework, we integrate the notion of 
a role with extant frameworks of interfirm governance (e.g., 
Cannon and Perreault 1999; Heide 1994; Ouchi 1979; 
Wathne and Heide 2000). The governance literature pro- 
vides a useful building block because it identifies particular 
relationship management mechanisms with different time 
dimensions, which can be expected to serve particular role 
creation and activation purposes. We first review a core set 
of governance mechanisms ànd their general properties. 
Then, we discuss their particular purposes in relation to 
relationship roles. 


General Governance Mechanisms 


Ouchi (1979) discusses two broad relationship management 
strategies available to firms: À firm can (1) identify parties 
that "fit its needs exactly" (p. 840) or (2) design a manage- 
rial system that specifically promotes desired behaviors. In 
practice, as suggested by extant governance typologies 
(e.g., Heide 1994; Wathne and Heide 2000), the former 
approach can be implemented through selection and sociali- 
zation efforts, whereas the latter tends to be based on incen- 
tives and monitoring. Although our primary focus is not on 
these mechanisms per se, a brief review of each, including 
examples of actual industry practice, provides a useful 
foundation for the presentation of our role framework. 


Selection. Relationship governance through selection 
involves identifying exchange partners a priori that exhibit a 
good fit on particular criteria. The selection criteria may 
involve both skills (e.g., product quality and human 
resources) and values (e.g., cultural compatibility). Recent 
industry accounts (Killian 2003) show how companies such 
as Culver's and Home Depot rely on both types of criteria 
to select their franchisees and suppliers, respectively. Ford 
Motor Company has formalized its selection efforts through 
specialized supplier qualification programs, which serve as 
a trial period for prospective suppliers (Purchasing 1995). 
For some companies, such as the food retailers Whole 
Foods Market and Pret A Manger, selection is such an inte- 
gral part of the companies’ overall strategies that their elab- 
orate practices are explicitly communicated to their stake- 
holders. For example, Pret A Manger’s promotional 
materials describe its selection practices as “treasure 
hunting.” 


Socialization. Conceptually, socialization is the process 
by which new parties learn skills and internalize another 
party’s values, goals, and rules (Chatman 1991). Socializa- 
tion can happen informally over the normal course of a rela- 
tionship, or processes can be explicitly designed to promote 
learning and internalization. As an example of the latter, 
many corporations have established their own universities 


as a way to both develop particular skills and promote par- 
ticular values. These efforts range from McDonald’s 
attempt to get franchisees to “March to a single McDrum” 
(The Economist 1999), to Saturn's investments in “Saturniz- 
ing” dealership employees (Meister 1998), to Motorola’s 
training of supplier and distributor “Black Belts” (Corpo- 
rate University Review 2000). Supplier qualification pro- 
grams, as we discussed previously, may also serve distinct 
socialization purposes, in that a party is asked to take on a 
future role in an “anticipatory” way (Biddle 1979; Dubin- 
sky et al. 1986). 


Incentives. A cornerstone of agency theory (e.g., Jensen - 


and Meckling 1976) is relationship management through 
explicit incentives. For example, manufacturers can manage 
their reseller relationships through commissions (John and 
Weitz 1989), margins (Dutta, Bergen, and John 1994), 
bonuses (Gilliland 2003), or some combination thereof. 
Cisco recently launched a reseller program called “Take 
Stock in Cisco,” which provides “share points” in return for 
selling particular Cisco products (Schick 2001). In general, 
the rationale for the use of incentives is to structure a rela- 
tionship in such a way that particular actions are explicitly 
rewarded or punished (Telser 1980). 

Monitoring. The final governance strategy involves 
explicit measurement of partner performance 'relative to 
some preexisting agreement or standard (Anderson and 
Oliver 1987). In principle, a firm's monitoring may include 
a partner's output or behavior, and it can be implemented in 
various ways. For example, industrial buyers may monitor 
their component suppliers using incoming inspection, 
evaluation of statistical process control charts, or actual 
audits of a supplier's plant. Firms such as Sears have for- 
malized their monitoring process by developing "vendor 
report cards" (Aron 1998). Franchisors such as McDonald's 
rely on "undercover" agents who evaluate franchisees on 
attributes such as food quality, speed, and store cleanliness 
(Tatge 2001). Recently, Arby's and 7-Eleven have begun to 
experiment with electronic surveillance systems (Shirouzu 
and Bigness 1997). 

Our conceptual framework (see Figure 1) is based on 
the premise of inherent matches between particular gover- 
nance mechanisms on the one hand and different relation- 
ship roles on the other hand. Specifically, we propose that 
relationship strategies, which emphasize "clanlike" selec- 
tion and socialization, are inherently linked with the role of 
a friend and rule-based decision making under a logic of 
appropriateness. In contrast, the "harder" ,governance 
mechanisms of incentives and monitoring are linked to a 
businessperson role and its logic of consequences. Our 
framework suggests that a firm's governance decisions can 
affect both (1) the creation of relationship roles ex ante and 
(2) the activation of roles over time. 


Ex Ante Hole Creation 


In principle, all the governance mechanisms we discussed 
previously are capable of creating relationship roles. How- 
ever, the specific manner in which the relevant roles are cre- 
ated may differ dramatically. 


Consider selection and socialization as a starting point. 
Such efforts are inherently associated with a logic of appro- 
priateness, in the sense that a firm attempts either (1) to 
identify a friend that already possesses the relevant rules or 
(2) to develop one through social processes. Supplier quali- 
fication programs, which we discussed previously, may cre- 
ate friend roles in both of these manners. Note that the role 
creation process under a qualification program is direct, in 
that one firm (e.g., a buyer) targets its governance efforts 
toward a particular party (e.g., an individual supplier). 
Theoretically, role creation in this manner involves creating 
an "embedded" market through social processes (Granovet- 
ter 1985). We illustrate the relevant processes in Figure 1 
through the top-left vertical arrow. 

Governance mechanisms may also establish roles in an 
indirect way by allowing parties to signal their predisposi- 
tions and self-select into the relationship. Consider the pre- 
viously discussed businessperson role and governance 
efforts in the form of incentives and monitoring. As we dis- 
cuss in further detail subsequently, such strategies may be 
inherently incompatible with a friend role. However, to the 
extent that a given firm has a known reputation ex ante for 
relying on monitoring and incentives, it allows other parties, 
which already possess dominant businessperson roles and 
for which such mechanisms are consistent with their 
decision-making mode (i.e., a logic of consequences), to 
self-select into the relationship. 

As Stryker (1968) describes, people have a tendency to 
seek out opportunities to enact a “salient identity.’ Gam- 
betta (1996) provides an interesting account of how pub- 
licly known incentive systems serve as distinct self- 
selection devices into academic institutions. Similarly, 
franchisors attract franchisees to run their independent 
stores whose mind-sets are consistent with incentive sys- 
tems based on "residual claims" (Brickley and Dark 1987). 
The Swedish furniture company IKEA has a well-known 
system of supplier competition, in which both incumbent 
and integrated suppliers must compete like independent 
contractors with new suppliers on an ongoing basis (Mar- 
gonelli 2002). This system creates a particular form of 
incentive structure, which allows “appropriate” suppliers 
(i.e., those that possess compatible orientations) to self- 
select into the manufacturer relationship. The (indirect) 
creation of a businessperson role in this manner involves the 
establishment of a conventional "market" rather than an 
"embedded" relationship (Baker 1990; Williamson 1975). 
The top-right vertical arrow in Figure 1 depicts these 
processes.’ 


5For simplicity, in the previous discussion (and in Figure 1), we 
associated direct governance efforts with selection and socializa- 
tion and a friend role, but firms may also rely on such efforts to 
create a businessperson. For example, Ryan and Deci (2000) 
describe how one party may internalize another’s economic goals. 
Conceptually, however, the ongoing behavior that follows from 
this form of internalization is guided by utility-maximizing con- 
siderations under a logic of consequences. Moreover, although we 
discussed the possibility of (indirect) self-selection in the context 
of monitoring and incentives and a businessperson role, self- 
selection can also be associated with other governance mecha- 
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Ex Post Role Activation certain linkages exist between (1) the decision “logics” that 
underlie particular roles and (2) the inherent nature of par- 
ticular governance mechanisms. 

These linkages may manifest themselves in two differ- 
ent ways: First, the role that currently dominates a party’s 
approach to a certain relationship may be reinforced by the 
other party’s governance choices. The reinforcement sce- 
narios, which involve “matches” between roles and gover- 


In addition to creating roles ex ante, as we discussed previ- 
ously, governance mechanisms also play important ex post 
functions in relation to relationship roles. As was the case 
previously, our framework is based on the assumption that 


nisms (e.g., socialization). For example, Camerer and Vepsalainen 


(1988) discuss how companies that make their cultures visible cre- nance mechanisms, appear in Cells 1, 5, and 6 in Figure 1. 
ate opportunities for self-selection. Alternatively, to the extent that a particular governance 
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mechanism is incompatible with the current role’s decision 
logic, a mismatch scenario is created, as we show in Cells 2, 
3 and 4. If the current role is sticky, as we previously dis- 
cussed, the immediate outcome of a mismatch scenario may 
be maintenance or perhaps dilution of the current role. 
However, there is also the possibility that a mismatch will 
produce actual role switching. 

As we discuss subsequently, the upshot of our frame- 
work 1s that the different ex post scenarios produce immedi- 
ate. outcomes with respect to the roles themselves (e.g., 
reinforcement or switching). In addition, the different sce- 
narios have long-term consequences of both a behavioral 
and an economic nature. We first discuss the possible ex 
post scenarios for an existing friend role, and then we 
sketch out the parallel scenarios for a businessperson role. 


Existing friend role. Assume that a firm's initial gover- 
nance efforts toward a particular exchange partner involved 
selection or socialization and that these efforts succeeded in 
establishing a friend role for the partner. Given these initial 
efforts, the relationship will end up in Cell 1, Cell 2, or Cell 
3, depending on the subsequent governance choices. If the 
firm continues to rely on socialization (Cell 1), the partner's 
current role will be reinforced. For example, a supplier that 
has achieved friend status may sustain and possibly increase 
the original level of rule-based cooperation. Given the evi- 
dence that initial socialization effects tend to decay over 
time (Harrison and Caroll 1991), this form of role reinforce- 
ment may be an important relationship initiative. As a spe- 
cific example, Tupperware organizes regular *homecoming 
jubilees" for its top sellers, which include the singing of an 
anthem called "I've got that Tupper Feeling" (Brooks 
2004). 

However, if the firm in question deploys explicit incen- 
tives (Cell 2) or monitoring (Cell 3), there is a distinct pos- 
sibility of a mismatch between (1) the decision logic inher- 
ent in the established role (i.e., a friend's logic of 
appropriateness) and (2) the use of governance mechanisms 
that are inherently tied to a logic of consequences. From a 
purely descriptive perspective, the available evidence sug- 
gests that such mismatches are not unlikely. For example, 
Gardiner (2003, p. 18) argues that firms "are not good at 
judging motivations" and, consequently, aré prone to 
"heavy and misplaced" reliance on incentive-based 
strategies. 

What are the potential outcomes of a mismatch? It is 
possible that the existing role will continue to be enacted by 
the partner, though in a weakened or diluted state. Specifi- 
cally, a new governance regime may create an "identity 
gap, as we previously discussed, and trigger doubts about 
the viability of the existing relationship roles. For example, 
a supplier that historically has categorized a given buyer as 
a friend but subsequently becomes subjected to a stringent 
monitoring system may reclassify the buyer as a busi- 
nessperson.Ó It is conceivable that the supplier itself, as a 
result of role stickiness, will continue to enact the friend 
role. However, going forward, the supplier's behaviors are 


6Technically, such determinations are made on the basis of styl- 
ized "if-then" statements (Hayek 1945). For example, if a buyer 
monitors, then he or she will be recognized as a businessperson. 


likely to be in the form of perfunctory compliance with the 
buyer's strategy (Moschandreas 1997). For example, the 
supplier may continue to deliver components according to a 
fixed delivery schedule but refuse to accommodate special 
requests during unusual demand conditions (Wathne and 
Heide 2000). In this scenario, the buyer's governance 
efforts are actually dysfunctional and associated with 
opportunity costs because costly initiatives are deployed 
toward a party that has already internalized a friend role. 
Perhaps more seriously, it is also possible that the 
buyer's use of market-based governance mechanisms in 
such situations produces actual role switching. The identity 
gap may lead to a recategorization for the supplier, resulting 
in a matching businessperson role with an entirely new 
behavioral repertoire. As a specific example of role switch- 


‘ing, consider 7-Eleven’s decision to impose an electronic 


monitoring system on its franchisees. As various sources 
(e.g., Coughlan et al. 2001; Lal and Han 2005) describe, 
7-Eleven franchisees are frequently attracted to the system 
in the first place because of perceived compatibility 
between the company's operating philosophy and their own 
mind-sets. However, the implementation of a “draconian” 
electronic surveillance system was viewed as such a funda- 
mental shift in management philosophy that it caused 
severe reactions among the franchisees, some of which 
were subsequently described as "militant radicals" (Shi- 
rouzu and Bigness 1997). 

In addition to the direct effects on roles themselves, the 
mismatch scenarios may produce various long-term out- 
comes. For example, the new behavioral repertoire that fol- 
lows from a role switch may include retaliatory actions, 
which are "characteristically opportunistic" in nature (John 
1984, p. 280). Over time, this may have economic conse- 
quences in the form of high direct transaction costs in the 
ongoing relationship. 

Importantly, both the general possibility and conse- 
quences of role switching are well documented. Kreps 
(1997, p. 363) notes that a firm that historically has not 
monitored closely in a given relationship and that has not 
relied on strong extrinsic incentives may create “muddi- 
ness" and a shift in relationship orientation. Tenbrunsel and 
Messick (1999) show empirically that surveillance and 
sanctioning systems can change both (1) a party's orienta- 
tion toward a partner and (2) the actual choice rule used. 
Specifically, they show that the use of such hard relation- 
ship features increased the likelihood that a business type of 
frame was evoked, which in turn induced a calculative deci- 
sion rule. Notably, Tenbrunsel and Messick also show that 
after a business frame was evoked, a weak surveillance and 
sanctioning system that was intended to increase coopera- 
tion actually decreased the likelihood of cooperative 
behaviors. 

The preceding discussion implies that a perceived role 
switch by an exchange partner may affect a party's own 
decision logic and that the shift may have nonintuitive (and 
negative) consequences. In Nagin and colleagues' (2002) 
terminology, these consequences follow from applying a 
"rational cheater" model and its underlying assumption 
(i.e., logic of consequences) to situations in which decision 
makers are currently working under a "conscience model" 
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or under a logic of appropriateness. Notably, and somewhat 
disturbingly, Anderson and Jap (2005) cite fieldwork and 
psychological research that document that close interfirm 
relationships are particularly vulnerable to showing a “dark 
side.” 


Existing businessperson role. We now consider a situa- 
tion in which a party already makes decisions as part of a 
businessperson role and in which the focal behavioral reper- 
toire is based on a logic of consequences. In Figure 1, the 
possible scenarios appear in Cells 4, 5, and 6. The matches 
are represented by Cells 5 and 6, in which incentives and 
monitoring are deployed toward a party whose current role 
is consistent with the use of such mechanisms. We expect 
that the outcome of these scenarios is reinforcement of the 


current role. Notably, there is evidence that suggests that . 


such matches have distinct performance implications. For 
example, Tenbrunsel and Messick (1999) show that when a 
business type of decision frame was evoked in a relation- 
ship, strong sanctions indeed increased cooperation. 

The mismatch scenario for an existing businessperson 
role appears in Cell 4. A party whose current mental model 
is based on a logic of consequences is subsequently sub- 
jected to socialization. Although this scenario involves a 
mismatch, we do not necessarily predict a role switch. 
Instead, relying on socialization in such a situation may 
simply cause a resource waste, to the extent that the relevant 
efforts fail to convert the businessperson to a friend. The 
magnitude of the socialization efforts needed to induce such 
a switch is likely to be substantial, as evidenced by Mit- 
subishi Motor Corporation’s efforts to reverse its adver- 
sarial supplier relationships (Smitka 2001). To regain its 
suppliers’ support, Mitsubishi Motor Corporation went so 
far as to establish a formal supplier association and a tech- 
nical support center staffed with 70 full-time engineers. 

Notably, to the extent that a firm’s relationship efforts 
involve dedicated investments, as in the preceding example, 
a failure to bring about a role switch in a businessperson 
may produce a mismatch scenario with substantial eco- 
nomic implications. To illustrate, consider Jackson’s (1985) 
analysis of the risks associated with using “relationship 
marketing” initiatives toward customers that operate under 
a “transaction” orientation. For example, a supplier that 
develops a dedicated product and delivery system for a 
buyer that is looking for only a temporary source of 
capacity may not only fail to reap the returns from the 
investments but also subsequently face a significant lock-in 
condition (Klein, Crawford, and Alchian 1978). This sug- 
gests that explicit attention to roles can be an important pre- 
requisite for relationship investment decisions. 


Discussion 


Theoretical Implications and Relationships with 
Prior Research 

Much of the extant research on interfirm relationships has 
focused, explicitly or implicitly, on the orientations that par- 
ties bring to bear on their interaction. In some of this litera- 
ture, a sharp distinction has been drawn between different 
types of orientations. For example, one particular body of 
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research emphasizes calculative orientations, which corre- 
spond to March’s (1994) logic of consequences. Many of 
the theoretical frameworks used to study interfirm relation- 
ships (e.g., Axelrod 1984; Eisenhardt 1989; Williamson 
1985) belong to this category. At the same time, there are 
theoretical accounts and empirical evidence, primarily from 
sociology and contract law (e.g., Macaulay 1963; Uzzi 
1997) of heuristic orientations, which are consistent with a 
logic of appropriateness. Overall, however, the extant litera- 
ture tends to take an "either-or" type of approach, implic- 
itly assuming that relationships can be described in terms of 
a single orientation. 

Our central argument is that such categorical distinc- 
tions may be too simplistic. On the one hand, we subscribe 
to the general distinction between the logics of conse- 
quences and appropriateness, as they pertain to individual 
relationship episodes. As such, our framework is consistent 
with prior work that describes relationships in terms of 
locations on particular continuums, such as the degree of 
trust, commitment, and long-term orientation. Specifically, 
a high "score" on such a dimension in a given situation 
would most likely reflect the strength of a prevailing friend 
role. On the other hand, the upshot of our framework is that 
descriptions along single dimensions may not fully capture 
the true nature of relationship interactions. We posit:that 
exchange relationships may simultaneously involve differ- 
ent types of orientations. Importantly, this goes beyond 
acknowledging that relationships are multidimensional, in 
the sense of joint descriptions on intercorrelated attributes 
(e.g., different relational norms). Although we endorse this 
view as it applies to individual transactions, our framework 
is based on the assumption that actual shifts among funda- 
mentally different relationship orientations are both possi- 
ble and likely. 

Theoretically, our framework begins to provide an inter- 
active perspective on relationship formation and manage- 
ment. For example, a firm may rely simultaneously on 
(direct) selection processes and on (indirect) opportunities 
for self-selection through various incentive and monitoring 
devices. The notion of self-selection suggests that though a 
particular governance process can be initiated by one party, 
its ultimate success depends on the other party's tendency 
to seek out a salient role. As such, our framework suggests 
that certain governance mechanisms, which have often been 
described primarily from the perspective of a single firm 
(i.e., the principal), actually involve and, in a broader sense, 
are available to both relationship parties. Moreover, interac- 
tivity follows from the notion that firms judge a partner's 
role on the basis of the particular governance choices made, 
and these judgments ultimately influence the focal firm's 
own role and relationship decisions. Our framework con- 
ceptualizes relationship roles as the products of complex 
processes that involve multiple parties, governance reper- 
toires, and judgments. 

In a recent article, Stout and Blair (2001) criticize cer- 
tain branches of extant social science research for treating 
decision making as a "black box" and for being limited to 
observing particular antecedents and outcomes without 
accounting for the underlying process. Similar critiques 
have been voiced by others, who have asked for greater 


attention to what lies behind choices (Hodgson 1997) and 
for “mechanism-based explanations" (Hedström and Swed- 
berg 1998, p. 9). In a broad sense, our framework is an 
attempt to shed some light on this issue. We propose that 
the concept of a role may yield some insight into the black 
box and that it may help integrate discrepant theoretical 
perspectives on relationships and interactions. 

Finally, we believe that our role framework has the 
potential to generate new insights into the unit-of-analysis 
issue in relationship research. Consider the literature on 
transaction costs as an example. In this literature, the indi- 
vidual transaction has been suggested as the key unit of 
analysis (Williamson 1996). Rindfleisch and Heide (1997) 
note how this assumption has manifested itself in empirical 
work in the form of a focus on dyadic relationships. 
Recently, however, this particular focus has been described 
as being too restrictive, insofar as it ignores the effects of 
related relationship dyads. As a consequence, researchers 
have begun to adopt larger units of analysis, such as multi- 
ple dyads and networks (e.g., Wathne and Heide 2004). 

For all practical purposes, adopting a network perspec- 
tive means broadening the unit of analysis and taking a 
macro perspective on relationship research. Although we 
acknowledge the importance of such a perspective, our 
framework represents a call for disaggregation, that is, for a 
micro-level account of relationships and parties. Specifi- 
cally, our role framework suggests that relationships cannot 
be properly understood without explicit recognition of both 
(1) the set of roles with which firms approach relationships 
in the first place and (2) the ability of relationship manage- 
ment strategies to reinforce or dilute those roles or to cause 
actual switching. In certain respects, our framework recog- 
nizes the "decision premise" as an important unit of analy- 
sis, consistent with Simon's (1947) early recommendation. 

Importantly, this does not diminish the need to study 
larger marketing networks. However, accounting for roles 
suggests that certain phenomena are more complex than fre- 
quently assumed. For example, it is possible that the use of 
particular strategies in one relationship influences not only 
the pattern of roles in that particular relationship but also 
the possibility of role switching in related ones. Heide 
(2003) recently showed that market relationships that coex- 
ist with an integrated operation within a so-called plural 
system exhibit more hierarchical characteristics than singu- 
lar ones, as a result of spillover effects across connected 
relationships. In general, we believe that a role-theoretic 
model of relationship governance may be useful in docu- 
menting processes at both a micro and a macro level and 
both within and across parties and relationships. 


Managerial Implications 


A considerable body of literature has emerged in marketing 
on the general topic of governance and relationship man- 
agement strategies. Both academic research and industry 
observation suggest that firms' governance decisions are 
important parts of their overall marketing strategies (Ghosh 
and John 1999). Moreover, firms' governance choices have 
been shown to have distinct performance popneeuons (e.g., 
Buvik and John 2000). 


However, the focus in prior research has often been 
strictly on the governance mechanisms themselves. For 
example, Baker and Faulkner (1991) develop a set of gen- 
eral recommendations for relationship management, which 
include the explicit use of incentives (“link pay with perfor- 
mance”), monitoring (“increase accountability"), and 
related initiatives. Although we do not dispute the possible 
usefulness of such efforts, we suggest a caveat about the 
generic use of any governance program. Our conceptual 
framework accounts for the notion that inherent linkages 
exist between roles and governance mechanisms and that 
firms' governance decisions must be made in a context- 
sensitive manner and with explicit attention to the ability of 
governance mechanisms both to create roles initially and to 
activate them over time. In general, understanding relation- 
ship roles is an important prerequisite for governance 
deployment. 

In managing relationships, firms must also carefully 
consider both roles and functions. Consistent with recent 
role theory, we draw an explicit distinction between the 
two. For example, in a supply chain context, there is a cru- 
cial difference between functions such as inventory mainte- 
nance and forecasting and the specific manner in which 
they are carried out (i.e., as a friend or businessperson). As 
Kahn (2003) notes, the implementation of efficient- 
consumer-response and quick-response types of initiatives 
requires fundamental changes in relationships between sup- 
ply chain partners with respect to both the allocation of 
functions and overall relationship orientation. 

To some extent, the focus of our conceptual framework 
is on mismatches between governance decisions and roles 
and on their potential outcomes. For example, we suggest 
that incentives and monitoring, which have been recom- 
mended as generic relationship management initiatives 
(e.g., Baker and Faulkner 1991), are intrinsically linked 
with only one type of role, namely, that of a businessperson. 
Crucially, deploying such strategies in other contexts (1.e., 
toward friends) not only may be ineffective or cause 
resource waste but also may cause role switching and sub- 
sequent activation of behavioral repertoires that may 
include opportunism. 

Notably, although the concept of relationship roles has 
received little systematic attention in extant research, it is 
frequently suggested as a key explanation of relationship 
problems. For example, Solomon and colleagues (1985, p. 
105) note how a "failure to read from a common script" is a 
typical cause of difficulties in relationships between clients 
and service providers. Similarly, Uzzi (1999, p. 502) con- 
cludes that it is inherently difficult to form ties between par- 
ties that "use alternate scripts." Although more fine-grained 
guidelines for managerial decision making require a body 
of empirical evidence, our general conclusion is that firms' 
governance decisions are inherently more complex than is 
frequently assumed and that such decisions may hinge cru- 
cially on detailed assessments of relationship roles. 


Issues for Further Research 


Our conceptual framework in Figure 1 involves a series of 
research issues and hypotheses that can be tested empiri- 
cally. In general, the underlying philosophy of the frame- 
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work is that linkages exist among (1) ex ante roles or “ini- 
tial conditions” (Cowen 1998), (2) governance strategies, 
and (3) outcomes. Clarification of the specific nature of 
these linkages requires empirical testing, including efforts 
to measure directly and account for relationship roles. We 
echo Kreps’s (1997, p. 364) sentiments that such empirical 
efforts are “likely to be messy.” Nonetheless, increasing the 
understanding of roles and their interplay with established 
relationship constructs seems to be an important research 
priority. 

In addition to the general research agenda inherent in 
our conceptual framework, there are additional, more spe- 
cific research issues that should be systematically exam- 
ined. We discuss some of them next. 


Relationship investments and resource allocation deci- 
stons. Important issues pertain to the linkage between roles 
and profitability. Most likely, the two particular roles we 
discuss herein differ substantially in terms of their cash- 
flow patterns. For example, creating a friend role may 
require significant up-front investments, for example, in the 
form of selection and socialization efforts. At the same 
time, after a friend is established, it may be cheaper to 
maintain than a businessperson. However, note that certain 
types of exchange partners that may fit a friend profile can 
become demanding and costly to serve (Reinartz and 
Kumar 2003). 

It is also important to document how strongly roles are 
held and how easily they can be switched or “unfrozen.” For 
example, if certain customers have undesirable cost impli- 
cations, aS suggested in Reinartz and Kumar’s (2003) 
research, and if established roles are indeed sticky, it sug- 
gests that a firm’s role creation decisions must be 
approached in a deliberate and forward-looking manner. 

At the same time, although existing sticky roles may 
have nontrivial cost implications, they may also represent 
sources of competitive advantage. For example, consider an 
industrial supplier that enjoys sticky friend relationships 
with its buyers. From the standpoint of a competing sup- 
plier, this raises important questions about both (1) the spe- 
cific sources of role stickiness and (2) the particular “cor- 
rective procedures” (Bolton and Reed 2004) that may 
undermine the incumbent supplier’s relationships. 

Finally, a related profitability issue pertains to resource 
allocation decisions across relationship types within an 
overall portfolio Johnson, Sohi, and Grewal 2004; Johnson 
and Selnes 2004). At a given time, a firm may possess a mix 
of roles in its portfolio, and decisions must be made about 
how to allocate scarce resources among them. Dickson and 
colleagues (2004) note how firms may need to make funda- 
mental allocation choices between (1) using scarce 
resources to reinforce prior behavior (e.g., in a current 
friend relationship) and (2) using them to solve problems in 
others (e.g., trying to convert a businessperson). These 
types of issues require firms to consider both initiation and 
maintenance costs carefully, at the relationship and the port- 
folio levels. 


Relationship perceptions. As we noted previously, 
deploying governance mechanisms without a careful con- 
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sideration of existing relationship roles may have 
unintended consequences, such as resource waste and role 
switching. Given the well-documented finding that firms 
have divergent perceptions about their interaction (e.g., 
Wathne, Biong, and Heide 2001), such scenarios are not 
unlikely. Indeed, on the basis of previous research, it might 
be expected that the potential for mismatches between gov- 
ernance deployment and prevailing roles is high and that 
this may be a possible explanation of relationship failure. 
Research could be directed usefully toward documenting 
(1) the manner in which role perceptions are established 
and (2) whether particular roles are uniquely held by an 
individual party or are shared in some way. A related ques- 
tion is whether a party may misrepresent a given role. In 
The Republic, Plato discusses how Socrates defines a friend 
as "one who both seems and is an honest man." It is clear 
that distinct opportunities exist, at least in the short run, for 
“seeming” and for the misrepresentation of roles. This 
raises important relationship management challenges in its 
own right. 

In general, role theory suggests that a distinction can be 
made between perceptions that pertain to (1) a specific 
exchange partner, as per our framework, and (2) certain 
positions or organizational forms (Biddle 1979; Forman and 
Whetten 2002), as per conventional role theory. For exam- 
ple, a manufacturer may have a perception of retailers in 
general, which will influence its approach to a particular 
partner. As a specific example, a manufacturer whose prior 
interaction has primarily involved smaller retail clients may 
perceive retailers in general as friends and therefore 
approach new retail relationships with this particular orien- 
tation. Our framework highlights the risks associated with 
such mental shortcuts, insofar as they produce costly mis- 
matched governance initiatives. 

Issues such as those we present in the preceding discus- 
sion illustrate how roles can be conceptualized at multiple 
levels of analysis. With increasing demand for cross-level 
constructs and theories (e.g., Klein and Kozlowski 2000; 
Rousseau 1985), role theory represents a promising avenue 
for further research. Our discussion also reinforces the 
importance of continuing to direct relationship research 
toward improving the understanding of human behavior. 
Notably, Williamson (1996, p. 266), who is frequently criti- 
cized for the particular behavioral assumptions that underlie 
transaction cost economics, explicitly suggests that “more 
microanalytic attention to the processes through which trad- 
ing relationships evolve is indeed a rewarding research 
enterprise.” Similarly, Tenbrunsel and Messick (1999, p. 
706) emphasize the “need to explore assumptions” about 
decision makers and their reactions to the mechanisms that 
are deployed in exchange relationships. Although continued 
theoretical integration and empirical research may turn out 
to be messy, the resultant insights could be considerable. 





7See Plato (2000), The Republic, trans. Tom Griffith. Cam- 
bridge, UK: Cambridge University Press. 
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Human Brands: Investigating 
Antecedents to Consumers’ Strong 
Attachments to Celebrities 


This article explores recent advances in self-determination research to address why consumers develop strong 
attachments to “human brands,’ a term that refers to any well-known persona who is the subject of marketing 
communications efforts. Study 1 uses a survey that is analyzed with structural equation modeling. Study 2 is 
qualitative and offers corroborating evidence for the proposed theoretical model. Study 3 extends the model with a 
more naturalistic sample and tests several alternative hypotheses using hierarchical regression. The results 
suggest that when a human brand enhances a person's feelings of autonomy and relatedness and does not 
suppress feelings of competence, the person is likely to become more strongly attached to it. This article documents 
that strong attachments are predictive of satisfied, trusting, and committed relationships and proposes that 
attachment strength may be a parsimonious proxy for consumer—brand relationship strength. The results imply that 
benefits would accrue to organizations such as entertainment firms and political parties that establish direct and 
routine interaction between human brands and consumers, that human brands to which consumers are attached 
offer significant potential as endorsers, and that organizations should address how to make the human brands they 





manage more authentic. 


“When I go to the grocery store, people hug me. But that's 
OK. People do know me. They've seen me grow up on 
television." 

—Michelle Kwan! 


consumer relationships has flourished (e.g., Sheth and 

Parvatiyar 1995). One approach to studying these rela- 
tionships focuses on feelings of attachment and love (e.g., 
Carroll and Ahuvia 2006), which lay “at the core of all 
strong brand relationships” (Fournier 1998, p. 363). 

This article focuses on consumers’ attachments to 
human brands, a term that refers to any well-known persona 
who is the subject of marketing communications efforts 
(e.g., Rindova, Pollock, and Hayward 2006). The central 
purpose of this article is to address why consumers form 
strong attachments to human brands. From a practical 
standpoint, advancing the understanding of the relationship 
between consumers and human brands is important because 
much of the success of the $190 billion U.S. media and 
entertainment industry hinges on the successful positioning 
of one of its key assets, the celebrity (Plunkett Research 


[: the past decade, academic and practitioner interest in 
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Horovitz (2003). 
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2004). Organizations spend vast sums annually in an effort 
to establish psychological connections between consumers 
and human brands, such as Michael Jordan, Tiger Woods, 
and Oprah (Serwer 2001). Firms even specialize in brand 
management with respect to dead celebrities, such as Steve 
McQueen and John Wayne (Halpern 2005). Similarly, in the 
2004 U.S. presidential election, each of the two main politi- 
cal parties spent in excess of $1 billion dollars in an effort 
to position their candidates favorably with voters (Edsall 
and Grimaldi 2004). That is, organizations are already dedi- 
cated to managing human brands and building emotional 
bonds with consumers. This article is an effort to explore 
this phenomenon methodically; it does so by advancing 
hypotheses and testing them in three studies. Study 1 uses a 
survey that is analyzed with structural equation modeling. 
Study 2 is qualitative and offers corroborating evidence for 
the proposed theoretical model. Study 3 extends the model 
with a more naturalistic sample and tests several alternative 
hypotheses using hierarchical regression. 


Conceptual Foundation 
Human Brands and the Branding Literature 


“Your client, whether they are an athlete or an actor or an 
actress, has intangible assets: a name, a reputation, a cred- 
ibility and an image. All of those attributes may be com- 
bined into something that could be made into a brand.” 

—Brian Dubin (quoted in Towle 2003) 


This article’s underlying premise is that human brands may 
be viewed as one of several operationalizations of the 
broader concept of a brand. In marketing, the term “brand” 
is typically applied to firms, products, and services, and in 
general, marketers accept that brands may be described in 
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terms of perceived quality, image, and so forth. ‘Celebrities 
can also be considered brands because they can be profes- 
sionally managed and because they have additional associa- 
tions and features of a brand. Evidence for this idea can per- 
haps be most readily found in the context of political 
campaigns, in which the candidate's message, public 
appearance, endorsements, and so on, are all controlled by 
consultants and political parties in the hopes of managing 
perceived quality and brand image to increase "market 
share" at the polls (Simon, Gilgoff, and Samuel 2004). 
Likewise, the National Basketball Association recently 
established an off-the-court dress code for its players in an 
effort to improve the league's appeal by managing the 
image of its individual human brands (Lee 2005). However, 
many of the players (e.g., Allen Iverson) had: previously 
established and lucrative images that were misaligned with 
the new dress code; this underscores the importance of 
understanding how to better manage the brand Peay of 
individual human brands. 


Why Attachments Matter 


Considerable research has examined features of the various 
types of relationships that people form with human brands. 
For example, idolatry (Houran, Navik, and Zerrusen 2005), 
fandom (Leets, De Becker, and Giles 1995), and celebrity 
worship (Dietz et al. 1991) have been explored. Much of 
this research has been advanced under the rubric of attach- 
ment theory and its associated literature, which is both well 
articulated and relevant to marketing (Kleine and Baker 
2004). Attachments are a type of strong relationship that 
people usually first experience as children with their par- 
ents; later in life, these attachments routinely develop with 
other "targets," such as human brands (Leets, De Becker, 
and Giles 1995). A person immersed in such an emotionally 
significant relationship normally perceives the relationship 
partner as differentiated and irreplaceable (i.e.,, target spe- 
cific; see Hazan and Shaver 1994). When these types of 
relationships are experienced in reference to human brands, 
they are typically referred to as "secondary object" attach- 
ments and have been described as "intimacy at'a distance" 
(Horton and Wohl 1956). 

This article focuses on the strength of attachment, 
defined as the intensity of a person's target-specific emo- 
tional bond with a human brand. Prior research reveals that 
attachments are differentiated from other constructs. For 
example, the strength of an attachment is orthogonal to 
involvement, satisfaction, loyalty, and attitude favorability 
(Ambler et al. 2002; Thomson, MacInnis, and Park 2005). 

This literature suggests that attachment theory can make 
a contribution to marketing because of the distinctive quali- 
ties of an attachment. For example, much as marketers may 
attempt to create relationships that are trusting, committed, 
and satisfied (Fournier, Dobscha, and Mick 1998), people 
routinely report elevated levels of each in their attachments 
to a variety of objects (Rempel, Ross, and Holmes 2001; 
Spake et al. 2004). Prior literature points to the indepen- 
dence of attachment strength from trust, satisfaction, and 
commitment, but the correlation among these constructs 
suggests that understanding how to create or intensify 
attachments could offer both an effective and an economical 


means of achieving stronger marketing relationships. 
Indeed, Fournier’s (1998) article on consumer-brand rela- 
tionships suggests that feelings linked to attachments are 
fundamental to strong brand relationships. Therefore, 
attachment strength may provide a parsimonious and unidi- 
mensional indicator of “relationship quality” or strength. 

Furthermore, more intense attachments are linked to 
several consequences that are desirable to marketers. For 
example, attachments in a marketing context may prevent 
consumer defections (Liljander and Strandvik 1995), 
increase consumers’ forgiveness in the face of negative 
information (Ahluwalia, Unnava, and Burnkrant 2001), and 
predict brand loyalty and willingness to pay (Thomson, 
MacInnis, and Park 2005). If marketers understand what 
determines the strength of attachments, they may be in a 
better position to foster more durable relationships with 
consumers. 

Finally, research implies that attachments may be piv- 
otal to understanding customer-based brand equity, in 
which certain brands “resonate” with consumers and indi- 
cate financial value for the firm (Ambler et al. 2002; Keller 
2001). For various reasons, therefore, contemplating what 
strengthens attachments seems to be an endeavor that is 
worthwhile from both a practical and a theoretical 
standpoint. 

Relationship similarities. In some respects, relation- 
ships involving human brands are a hybrid of other relation- 
ships. For example, they are pertinent to marketing because 
of the central role of the consumer, but they are also a rele- 
vant extension of interpersonal research because they impli- 
cate a bona fide human being (e.g., David Beckham, Hilary 
Swank), not a comparatively inanimate object (e.g., Tide 
laundry detergent). Prior research has considered celebri- 
ties, well-known figures, and group entities, such as sports 
teams or musical acts, and has documented how they are 
similar to interpersonal relationships in many ways. For 
example, people often experience “seeming face-to-face” 
relationships with human brands who are “met as if they 
were in the circle of one’s peers” (Horton and Wohl 1956, p. 
215). These relationships evince many of the same expecta- 
tions, cognitions, emotions, and behaviors that operate in 
normal interpersonal relationships to the point that a con- 
sumer might view a human brand as a pleasant companion, 
good friend, or romantic mate (Cole and Leets 1999; Rubin 
and McHugh 1987). Many of the same variables that pro- 
vide the basis for interpersonal attachments operate with 
respect to human brands, such as increasing perceptions of 
a shared background and elevated social appeal (Cole and 
Leets 1999; Perse and Rubin 1989). 

Relationship differences. Although research in market- 
ing has embraced the idea that consumer relationships may 
be analogous to interpersonal bonds (Fournier 1998), there 
are differences between the two; however, in general, these 
important difference are underresearched. For example, 
Rubin and McHugh (1987) note that unlike interpersonal 
relationships, people’s relationships with celebrities are less 
likely to be truly interactive (e.g., mutual self-disclosure 
and interrogation are not possible). However, the effect of 
these and other differences in relationship functioning is not 
well understood (Kleine and Baker 2004). Until their 
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impact is addressed, comprehension of how consumers’ 
human-brand attachments form may be advanced by attend- 
ing to theory development that is dependent on similarities 
between these contexts. That is, many of the same causes of 
strong attachments operate when the focal object is a per- 
son, a brand, or some hybrid of the two (i.e., human 
brands). 


What Strengthens Attachments? 


In general, it is construed that the chief function of attach- 
ments is to confer emotional security to the attached party 
by being responsive to a person’s needs (Hazan and Shaver 
1994). This theory is expanded on by recent work on funda- 
mental human needs that suggests that if an object is 
responsive to a person’s needs for autonomy, relatedness, 
and competence (i.e., A-R-C), intense attachments may 
result (La Guardia et al. 2000; Ryan and Deci 2000). 


Defining “autonomy.” Autonomy refers to a person’s 
need to feel that his or her activities are self-chosen, self- 
governed, and self-endorsed (Deci and Ryan 2000; Ryan 
and Deci 2000). Associated with the fulfillment of this need 
is a person's perception that he or she is free from pressure 
to behave 1n certain ways and is able to express him- or her- 
self as he or she wishes. À person whose need for autonomy 
is satiated is likely to report feelings of “volition, agency, 
and initiative" (La Guardia et al. 2000, p. 368). Autonomy 
is similar to what Austin and Vancouver (1996, p. 357) label 
"self-determination," that is, a person's sense of freedom to 
make his or her own choices and to avoid feeling con- 
strained or coerced. 


Defining "relatedness." Relatedness refers to a person's 
need to fee] a sense of closeness with others (Deci and Ryan 
2000). The need for relatedness is a homonomous tendency; 
it is a desire to belong to a social sphere (Ryan and Deci 
2000) and to avoid feeling isolated (Austin and Vancouver 
1996). A person whose need for relatedness is satisfied is 
likely to report feeling “connected with and cared for by 
another" (La Guardia et al. 2000, p. 368). 


Defining “competence.” Competence refers to a per- 
son's innate, life-span tendency to seek feelings of effec- 
tiveness, achievement, and challenge in his or her activities 
(Deci and Ryan 2000). This idea is similar to what previous 
researchers have termed "mastery," which refers to a per- 
son's avoidance of mediocrity and decrements in perfor- 
mance (Austin and Vancouver 1996). A person whose need 
for competence is satiated will report feeling curious and 
skilled (La Guardia et al. 2000). 

The identification of these three variables emerges from 
recent studies on human motivation, and "although motiva- 
tion is often treated as a singular construct, even superficial 
reflection suggests that people are moved to act by very dif- 
ferent types of factors" (Ryan and Deci 2000, p. 69). Rather 
than considering motivation as rising from general human 
needs, the current research attempts to parcel out the spe- 
cihc motivations and needs that are active in particular 
social spheres by "asking what kind of motivation is being 
exhibited at any given time" (Ryan and Deci 2000, p. 69). 
As such, this article proposes that the consumer-human- 
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brand dyad may describe a relationship context in which 
these three needs are potentially important because their 
fulfillment in that context may lead to carefully targeted 
feelings of attachment. 


A-R-C as responsiveness. Autonomy, relatedness, and 
competence are fundamental human needs (Deci and Ryan 
2000). 'They are different from most needs because they are 
universal, innate, and enduring. However, this article is not 
interested in the existence of these needs but rather in the 
notion that specific partners, such as human brands, might 
provide an individual with relational inputs to satisfy these 
needs; this concept is referred to as "responsiveness." 

The idea of responsiveness is foundational to attach- 
ment research, and self-determination theorists propose that 
the fulfillment of the A-R-C needs qualifies as responsive- 
ness (e.g., Deci and Ryan 2000). Responsive relational part- 
ners “are ones who respond in ways that promote a person's 
experienced satisfaction of these basic [A-R-C] psychologi- 
cal needs" (La Guardia et al. 2000, p. 368). People gravitate 
toward relationships that serve their A-R-C needs; social 
experiences that make people feel autonomous, related, and 
competent also promote stronger attachments. 


Hypotheses 


There is conceptual and empirical support for the view that 
a responsive object contributes to the creation of intense 
attachments, but much of this literature provides only indi- 
rect support, has been overlooked by marketing researchers, 
or has not been applied to the realm of human brands. Con- 
ceptually, Ryan and Deci (2000) argue that the satisfaction 
of A-R-C needs leads to the “development of secure attach- 
ments within specific relationships" (p. 326) because "prox- 
imal supports for basic psychological needs in any relation- 
ship ... play a crucial role in predicting feelings of 
attachment" (p. 262). Although Ryan and Deci focus on an 
interpersonal context, they propose that the A-R-C needs 
might be fulfilled in other ways, inviting the possibility that 
human brands qualify as responsive relationship partners. 
Furthermore, La Guardia and colleagues (2000) asked 
participants to rate their levels of attachment security with 
respect to different interpersonal relationships. They also 
assessed the extent to which relationship partners satisfy 
participants’ A-R-C needs. The results show that greater 
A-R-C feelings positively predict attachment security. How- 
ever, because the study uses a composite measure of the 
A-R-C needs, it is not possible to determine the distinct 
contribution of each need in predicting attachment strength. 


Autonomy. Empirical research suggests that there is a 
positive relationship between felt autonomy and attachment 
security in teenagers (Ryan and Lynch 1989). By providing 
an atmosphere in which autonomy is encouraged, parents 
facilitate the strengthening of attachment bonds. Prior work 
also documents a positive correlation between respondents’ 
attachment and the extent to which their relationship part- 
ners make them feel autonomous in both interpersonal and 
human-brand contexts (Giles and Maltby 2004; Leak and 
Cooney 2001). Likewise, Schultz, Kleine, and Kernan 
(1989) argue that strong attachments are linked to the 
theme of individuation, reflecting a person’s desire to feel 


autonomous and in control of the self. They report that 
many objects to which respondents are strongly attached 
show evidence of this theme. Taken jointly, these results 
suggest that the degree to which a relationship partner 
serves the need for autonomy should positively affect the 
intensity of a person’s attachment to that partner. 


Hı: The more a person perceives a human brand:as fulfilling 
his or her autonomy needs, the more intensely attached he 
or she will be to the human brand. 


Relatedness. Similar to Deci and Ryan’s (2000) previ- 
ously discussed arguments, Baumeister and Leary (1995) 
argue that the fulfillment of the need for relatedness by an 
object results in increased positive affect, such as feelings 
of attachment and love. In addition, prior literature has 
demonstrated empirically that many consumers who are 
strongly attached to particular objects show evidence of 
"integration," which, in part, refers to their preference to be 
connected and joined with others (Schultz, Kleine, and Ker- 
nan 1989, p. 360). 


H5: The more a person perceives a human brand,as fulfilling 
his or her relatedness, the more intensely attached he or 
she will be to the human brand. 


Competence. As discussed previously, it is not possible 
to distinguish the independent effects of each A-R-C need 
on attachment strength in all prior empirical work (e.g., La 
Guardia et al. 2000). This is important when considering 
the dearth of empirical work that addresses the independent 
contribution of competence. Conceptually, research sug- 
gests that attachments can develop for reasons tied to self- 
cultivation and self-development (Kleine and Baker 2004), 
themes that are at least consistent with the idea of a person 
feeling competent as a result of consuming a particular 
object. For example, elderly people in nursing homes sur- 
round themselves with special possessions that serve as 
reminders of a lost sense of proficiency (Rubinstein and 
Parmelee 1992). Similarly, consumers who partake in cer- 
tain high-risk consumption activities can develop a sense of 
both mastery and emotional bonds in those contexts 
(Arnould and Price 1993; Celsi, Rose, and Leigh 1993), 
though it is unclear whether these bonds qualify as attach- 
ments because they may not be appropriately target spe- 
cific. This research has not been tested in the context of 
human brands and provides only conceptual or indirect 
empirical support. 

Evidently, the effect of the satiation of the need for 
competence on attachment is “a bit less straightforward" 
(La Guardia et al. 2000, p. 368). Although the satisfaction 
of both autonomy and relatedness needs is likely to occur 
primarily in emotionally significant relationships, a per- 
son's competence needs may be satiated in contexts that are 
unrelated to attachment, for example, at leisure, work, or 
elsewhere (Celsi, Rose, and Leigh 1993). Thus, if a husband 
fails to make his wife feel effective and capable (but does 
not actively make her feel ineffective or incapable), she may 
yet remain intensely attached to him because he satisfies her 
needs for relatedness and autonomy. Althoügh theory 
implies that the fulfillment of the need for competence may 
qualify as responsiveness and thus produce intense attach- 


ments, prior empirical research is less conclusive. Thus, 
there is reason to predict that the linkage between compe- 
tence and stronger attachments may be either positive or 
nonnegative. 


H3,: The more a person perceives a human brand as fulfilling 
his or her competence needs, the more intensely attached 
he or she will be to the human brand. 


Hay: The relationship between how a person perceives a 
human brand to be fulfilling his or her competence needs 
and intense attachments 1s nonnegative. 


Study 1 

Study 1 is designed to test H;—-H3 directly. In addition, 
because prior literature has described how strong attach- 
ments are associated with satisfied, trusting, and committed 
relationships (Feeney and Noller 1996; Simpson 1990) and 
because these variables are also pivotal to marketing rela- 
tionships (Fournier, Dobscha, and Mick 1998), Study 1 
examines whether strong attachments positively predict the 
three variables. This endeavor extends prior work to a novel 
context, permits the testing of the independence of these 
constructs from attachment strength, and highlights the 
managerial implications of consumer brand attachments. 


Method 


One hundred sixty-four undergraduate volunteers, solicited 
in signs posted around a university campus, were paid $5 in 
exchange for filling out a survey that took most respondents 
approximately 15 minutes to complete. Young-adult respon- 
dents may be particularly well suited to studying the 
domain of human brands because they constitute a coveted 
target market for human-brand marketers and because it is 
common for them to develop attachments to human brands 
(Boon and Lomore 2001; Perse and Rubin 1989). Volun- 
teers received one of two versions of an instrument whose 
preamble read as follows: "Many people think of them- 
selves as being attached to certain celebrities or well-known 
figures (e.g., actors, athletes, singers ...). Think about a 
celebrity or well-known figure to whom you are very [not] 
attached.” The two conditions (i.e., “very attached" and “not 
attached") were designed to create variance in responses on 
the dimension of attachment strength. Next, respondents 
answered a series of questions (in order) that mapped onto 
the dependent (i.e., attachment strength), independent (e.g., 
autonomy), and outcome (e.g., satisfaction) variables. 


Moasures 


The measures for the A-R-C needs were taken from the 
work of La Guardia and colleagues (2000), who developed 
metrics appropriate for an interpersonal context (see the 
Appendix). Using two pretests, these measures were 
adapted to suit the current human-brand context, though 
minimal changes were actually required. Respondents were 
asked to indicate the extent of their agreement (anchored by 
] = “strongly disagree" and 7 = "strongly agree") with a 
series of statements about the human brand (denoted as 
"XYZ"). The items for relatedness (œ = .81) were as fol- 
lows: “XYZ makes me feel cared about" and “I feel a lot of 
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closeness with XYZ.” Items for autonomy (a = .80) 
included “XYZ makes me feel controlled and pressured to 
be certain ways" (reversed) and “XYZ makes me feel free 
to be who I am.” Items for competence (à = .70) included 
"Generally, XYZ makes me feel very capable and effective" 
and “XYZ makes me feel inadequate or incompetent” 
(reversed). 

Previous research has argued that the degree to which 
people demonstrate separation distress is a good indicator 
of the strength of their attachment bonds (Berman and Sper- 
ling 1994). That is, with the real, imagined, or threatened 
separation from the object (e.g., death, loss), a strongly 
attached person will experience a negative emotional reac- 
tion that researchers can use as a measure of the intensity of 
the attachment bond. Prior research has advanced multi- 
item measures (Hazan and Shaver 1994; Hazan and Zeif- 
man 1994) that were adapted and extended to the current 
context. The four items (anchored by 1 = “strongly dis- 
agree" and 7 = "strongly agree") show good reliability (œ = 
.89): (1) "I feel better if I am not away from or without 
XYZ for long periods of time"; (2) "I miss XYZ when XYZ 
is not around"; (3) "If XYZ were permanently gone from 
my life, I'd be upset"; and (4) "Losing XYZ forever would 
be distressing to me.” 

Finally, the instrument included several metrics to 
gauge some of the potential positive qualities associated 
with people's stronger attachments and to provide further 
evidence that attachment strength is differentiable from 
prior constructs. Specifically, respondents were adminis- 
tered three metrics that were adapted from the perceived 
relationship quality scale (Fletcher, Simpson, and Thomas 
2000), which taps the degree to which respondents assess 
their relationships to be satisfying, trusting, and committed. 
Although this scale has not been widely used in marketing, 
it shows excellent internal consistency and validity. The 
items (all anchored by 1 = “not at all” and 7 = “very much") 
for satisfaction (a = .97) were as follows: “How satisfied 
are you with the relationship?" "How content are you with 
the relationship?" and “How happy are you with the rela- 
tionship?” Similarly, items for trust (œ = .92) were as fol- 
lows: "How much can you count on XYZ?" *How much do 
you trust XYZ?” and “How dependable is XYZ?” Finally, 
items for commitment (à = .98) were as follows: "How 
dedicated are you to the relationship?” “How committed are 
you to the relationship?” and “How devoted are you to the 
relationship?” 


Results 


There was an enormous range in the types of human brands 
that respondents listed, including actors, musicians, politi- 
cians, and media personalities. These figures represent a 
target of investment by, for example, movie studios, record 
labels, and political parties in terms of promoting the 
human brands and managing their brand associations. A 
manipulation check composed of an alternative measure of 
attachment strength (Thomson, MacInnis, and Park 2005) 
showed a significant difference between respondents in the 
weak- (M - 1.86) and those in the strong- (M - 4.05, p « 
.05) attachment conditions. 


Results of a multivariate analysis of variance. The 
results of a multivariate analysis of variance provide initial 
support for the idea that A-R-C may be important in the 
production of stronger attachments. The mean of autonomy 
in the strong-attachment condition (M = 3.66, SE = .18) 
was higher than that in the weak-attachment condition (M = 
1.98, SE = .17, p « .05). The mean of relatedness in the 
strong-attachment condition (M = 2.94, SE = .14) was sig- 
nificantly different from that in the weak-attachment condi- 
tion (M = 1.30, SE = .13, p « .05), as was the mean of com- 
petence in the weak- (M = 2.71, SE = .18) versus the 
strong- (M = 3.67, SE= .19, p < .05) attachment condition. 


Correlational analysis. A consideration of the correla- 
tion between the A-R-C needs and strong attachments mar- 
shals evidence of a significant linkage. Autonomy (r = .62, 
p < .05), relatedness (r = .70, p < .05), and competence (r = 
.44, p < .05) are each significantly correlated with attach- 
ment strength. In addition, the correlations among the 
A-R-C needs are positive (r's = .59—.75; see Table 1). 

Full structural models. The remaining analysis is con- 
ducted with structural equation modeling (AMOS). It is 
customary to evaluate structural models in reference to 
alternative conceptualizations of the data and by comparing 
fit statistics. As such, the analyses included three models. 

Model 1 is the hypothesized model, with the A-R-C 
indicators loading onto their respective factors, which are 
allowed to correlate and, in turn, predict attachment 
strength. Model 2 is the same as Model 1, except that the 
latent A-R-C needs are modeled as uncorrelated. Finally, 
Model 3 depicts all the indicators for the A-R-C needs load- 
ing onto a single latent factor. 

In each model, the relationships between the indicators 
and the latent variables are positive and significant. Model 1 


TABLE 1 
Correlation Matrix of Study 1 Measures 
1 2 4 5 6 7 
1. Autonomy — 75 69 62 11 58 55 
2. Relatedness — 59 70 30 72 .69 
3. Competence — 44 .12* 49 .45 
4. Attachment strength — .25 .66 .66 
5. Satisfaction — .46 33 
6. Commitment — .68 
7. Trust — 





Notes: All ps < .05, except *p = .14. 
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shows excellent minimum sample discrepancy divided by 
degrees of freedom (CMIN/DF = 1.92), comparative fit 
index (CFI = .96), and normed fit index (NFI = .92) statis- 
tics. Model 2, in which the A-R-C needs are uncorrelated, 
has universally unacceptable fit statistics and therefore is 
rejected. Specifically, the CMIN/DF statistic is inflated 
(6.04), and both the CFI (.77) and the NFI (.74) statistics 
are inferior. The fit statistics for Model 3 are mixed, with 


acceptable CMIN/DF (2.98) and CFI (.91) values but a mar- 
ginal NFI value (.87). Because of its better fit and because it 
has a significantly different chi-square (%2 = 92.27, d.f. = 
48) value from Model 3 (p < .05), Model 1 is superior (see 
Figure 1). 

H; proposes that when a human brand fulfills autonomy 
needs, a person is more likely to become strongly attached 
to it. The path between autonomy and attachment strength 
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FIGURE 1 
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Notes: From Study 1, Model 1 shows all tems for the Independent variables loading onto their respective latent factors, which predict attach- 
ment strength (x2 = 92.26, d.f. = 48; CMIN/DF = 1.92; CFI = 96; and NFI = .92). Model 2 Is Identical to Model 1, except that the A-R-C 
latents are uncorrelated (x? = 307.95, d.f. = 51; CMIN/DF = 6.04; CFI = .77; and NFI = .74). Model 3 shows all tems for the Independent 

, varlables loading onto one latent factor, which predicts attachment strength (x? = 158 07, d.f. = 53; CMIN/DF = 2.98; CFI = .91; and - 
NFI = 87). All path estimates are standardized and shown above the respective path. 


in Model 1 is positive (y = .41, p < .05), in support of Hj. 
The path between relatedness and attachment strength is 
also positive (y= .49, p < .05), in support of H;. Finally, Hs, 
is rejected, but H4; is supported; there is an insignificant 
relationship between competence and attachment strength. 
This finding allows for the possibility that a human brand 
must not negatively affect competence for a strong attach- 
ment to exist. 


Outcome variables. Although increasingly intense 
attachments may have positive qualities, these are not due 
to a lack of construct orthogonality. An exploratory factor 
analysis that included the attachment strength items and the 
satisfaction, trust, and commitment items was conducted 
(see Table 2). The results show a clean four-factor solution; 
no inappropriate loadings exceed .3, suggesting that these 
constructs are distinguishable. 

Furthermore, stronger attachments are characterized by 
consumers' perceptions of their relationships with the 
human brand as satisfied, trusting, and committed. The cor- 
relation matrix (Table 1) shows that attachment strength is 
positively correlated with satisfaction (r = .25, p « .05), 
trust (r = .66, p « .05), and commitment (r = .66, p « .05). In 
addition, a multivariate analysis of variance of the differ- 
ences in satisfaction, commitment, and trust between weak- 
and strong-attachment conditions suggests that there are 
benefits associated with strong attachments. In the strong- 
versus the weak-attachment conditions, respondents 
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reported having greater satisfaction (M = 4.46, SE = .18 
versus M = 3.14, SE = .19; F = 25.17, p < .05), commitment 
(M = 3.59, SE = .16 versus M = 1.66, SE = .17; F = 71.56, 
p < .05), and trust (M = 3.68, SE = .16 versus M = 1.90, 
SE = .17; F = 60.16, p < .05). 


Discussion 


The main contribution of Study 1 is that the fulfillment of 
autonomy and relatedness needs explains attachment 


' strength, but the need for competence does not. Although 


theory and prior research suggests that an object that makes 
a person feel competent may bolster attachments (La 
Guardia et al. 2000; Rubinstein and Parmelee 1992), the 
results suggest that competence does not play a role in 
strengthening attachments to human brands. Although Table 
1 reveals that competence is correlated with stronger attach- 
ments, its contribution is limited when the variance of 
autonomy and relatedness is taken into account. However, 
these same results suggest that the relationship between 
competence and attachment strength must be nonnegative. 
That is, an object that suppresses a person's competence 
needs may also diminish the strength of a person's attach- 
ment. This issue is considered again in the subsequent 
studies. | 

Although the results of Study 1 are encouraging, the 
choice of survey design dictates that some of the affective 
richness evident in consumers' attachments to human 


TABLE 2 
Study 1 Exploratory Factor Analysis with Relationship Constructs 


Item Satisfaction 
Satisfaction 1 .99 
Satisfaction 2 95 
Satisfaction 3 .94 


Attachment Strength 1 

Attachment Strength 2 

Attachment Strength 3 

Attachment Strength 4 

Trust 1 

Trust 2 l 
Trust 3 ' 
Commitment 1 | 
Commitment 2 

Commitment 3 


Factor 
Attachment 
Strength Trust Commitment 
.98 
.87 
75 
.67 
.96 
.88 
.72 
.97 
.93 
.91 


Notes: Principal axis factoring with Promax rotation; all loadings >.3 are reported. 


brands is lost (e.g., Rubin and McHugh 1987). It is clear 
from the literature that respondents use the same words to 
describe their attachments to human brands as they do to 
describe their strong interpersonal relationships (a result 
found by Fournier [1998] in reference to product brands). 
Study 1 builds primarily on the reliability and validity of the 
measures, whereas Study 2 employs a qualitative approach 
to corroborate the results of Study 1 and to flesh out some 
of the affective richness of consumers' attachments to 
human brands. In the process, Study 2 is intended to garner 
evidence that relationships characterized by strong attach- 
ments lie between and are comparable with both person-to- 
person (Hazan and Zeifman 1994) and pronaci -brand 
(Fournier 1998) attachments. 


Study 2 


Study 2 is a qualitative study that evaluates evidence of the 
A-R-C needs in the context of human brands to comple- 
ment the empirical results of Study 1. The use of qualitative 
procedures (Study 2) in conjunction with quantitative 
approaches (Studies 1 and 3) may be particularly well 
suited to research on attachments (Belk 1992) because, 
together, they richly describe the focal variable while statis- 
tically establishing the linkages among variables. 


Method 


Twenty-five undergraduate respondents were recruited on a 
university campus; each was paid $5 in exchange for 
answering questions about human brands in writing. No 
limitations were provided about which or what type of 
human brands to discuss; respondents were instructed, to 
discuss any human brand (“celebrity or well-known per- 
son”) to whom they felt “attached.” In addition to docu- 
menting the emotional quality of these stronger attach- 
ments, the aim was to determine whether the responsiveness 
of human brands, revealed by the A-R-C needs, was evident 
in the context of these attachments because prior research is 
not conclusive on this issue. A series of questions were 
posed; for example, respondents were asked to tell the story 


of their first experience with the human brand, to describe 
their best memory involving the human brand, and to con- 
vey what the human brand meant to them. Thus, attach- 
ments were examined from the consumer’s first-hand per- 
spective on the basis of relatively unstructured descriptions 
of salient experiences with the brand (Fournier and Mick 
1999). Two independent raters, who were blind to the pur- 
pose of the study, coded respondent feedback for evidence 
of strong attachments and the A-R-C needs. Disagreements 
between coders were resolved in discussions with the 
author. 


Results 


All responses showed evidence of the types of feelings typi- 
cally associated with attachments of moderate to intense 
strength. In addition, 48% of responses indicate evidence of 
autonomy, and 60% show evidence of relatedness. Only one 
response shows evidence of the respondent experiencing 
feelings of competence. Incorporated into the subsequent 
sections are excerpts from the responses that tended to be 
extensive and thoughtful. In each, the respondent self- 
selected a human brand as the object of feelings of attach- 
ment. For example, one woman described a musical act in 
the following terms: 


I can’t really remember when [ started loving them. At 
least 15 years ago. I think probably I fell in love with the 
song “Rose Coloured Glasses,” and then they played a 
rink in a town close to me and a friend and I went to see 
them. It was a great place to see them play because you 
could dance and get really close to the stage They had all 
these table and floor lamps on the stage. A really intimate 
setting, and there weren't that many people there. I think 
we paid $7 a ticket. They absolutely rocked. I've been 
addicted to them since then. 


In this excerpt, the respondent describes a "loving" rela- 
tionship that has endured for years. This pattern was com- 
mon; many respondents referred to powerful emotions 
while describing facets of the relationships that were inter- 
preted as evidence of specific A-R-C needs. For example, a 
respondent wrote the following about a band: 
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I bought all of their albums without hearing them (I had 
only had the live one previous) and became obsessed with 
their talent. I went to see them for the first time ever and 
was so incredibly moved. I often found myself starring 
[sic] in awe at them during the show. I was even moved to 
tears on a few songs. After, I met Jim [lead singer] and got 
a picture taken with him, and I just couldn't believe at 
how nice he was. I had become from that experience, as 
other fans like to say, a Rodeohead. This was less than a 


year ago. 


It is evident that this person's response to the band is 
affect laden. Furthermore, she now considers herself a 
“Rodeohead,” a group of enthusiastic followers of the band 
(Blue Rodeo). Although this group appears to be fairly 
unstructured, many of its members gather in online forums 
(e.g., www.discussions.bluerodeo.com) to share stories, 
news, and opinions about the band. This excerpt demon- 
strates that a Rodeohead (and this respondent) enjoys social 
rewards that are linked to the band and that are interpreted 
as serving a need for relatedness. The following provides 
additional evidence of feelings of relatedness: 


Ah, now this is a better story. Another live show. This was 
my 3rd of 3 concerts;... it was in April of 2000 I think (I 
haven't gotten to a show since then—starving student and 
all) at the Grand Theatre.... It was really a fantastic night, 
my dad and I went together. He's a bit of a fan himself, 
though not quite as die-hard as myself. It's something that 
has sort of brought us together. We didn't have the great- 
est of seats because we bought our tickets late, but it 
didn't matter too much because the venue was fairly 
small. It was really a pretty magical evening. I found 
myself getting lost in the music now and then, and I just 
couldn't keep my eyes from glossing over... Anyway, 
I've rarely felt so emotionally connected to anything. 


Here, the respondent's relationship with the band is 
explained by the notion that it brings her closer to her 
father, a fairly clear representation of relatedness, even 
while her feelings of attachment are also directed at the 
band itself. A similar result with respect to feelings of relat- 
edness is evident from the following excerpt in reference to 
a sports team, the Detroit Pistons: 


I used to attend games with my family as a child. I grew 
up watching Joe Dumars and Grant Hill all throughout 
high school. Basketball was a family event—a sanctu- 
ary—and I did my first internship with the company. The 
first game we went to was against the Portland Trailblaz- 
ers—I was about 11 years old. Dennis Rodman grabbed 
18 rebounds, and we cheered in the mid tier seats as they 
beat the Blazers by about 10 points. We had so much fun 
that game that going to the contests became a regular fam- 
ily event. 


Other excerpts described people's adopting the manner- 
isms of various human brands, such as a male respondent 
who wrote, “I’m even a fan in their club. I own all of their 
CDs plus bootlegs. I even write and sign my name to reflect 
my love for them." This excerpt reveals evidence of related- 
ness ("fan in their club") and the type of self-endorsed 
behavior lying at the core of autonomy, reflected in the 
decision to model his signature after the style of print asso- 
ciated with the band's (Metallica) album covers. 

Otber passages showed similar modeling behavior that 
signifies a freedom to express the self. For example, a male 
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respondent wrote about a band, "I strive to be like them 
because they haven't sold out and they are nice people who 
treat others well," and another respondent indicated that a 
particular performer (DM X) "inspires me to be a true man 
who will love my family and give back to my community. 
He isn't just a rapper. He's a legend and a great role model. 
He's one crazy hard-knocks guy, but he's also normal cuz 
[sic] he has kids and a wife and I respect that." Perhaps no 
example better captures the intersection of self-expression, 
self-endorsed behavior, and the feeling of freedom associ- 
ated with autonomy as the following excerpt provided by a 
male respondent in relation to a musical band: 


As a musician myself (guitar, banjo, accordion, bass) I 
enjoy their folky-jazzy-rock vibe mixed with lyrics that 
hit home to me. Their songs are intelligent, well-crafted, 
and presented with impeccable taste. I think the fact that 
the band 1s accessible before and after many shows adds 
to their appeal. They sound like rock stars on stage—but 
act like normal guys off-stage. They provide songs that 
speak to me. They provide a middle-agreed [sic] family 
guy with the responsibilities of a family and mortgage and 
stressful career, an outlet to cut loose. They also inspired 
me to pick up my guitar and write songs again. My wife 
adores Jim Cuddy. Back in the mid-80's/early 90's the 
band was unique—now there is a whole industry 
(alt.country) dedicated to this blend of music. I am 
unique—as I follow a band that most everyone I meet has 
never heard of. 


Finally, many respondents discussed objects they per- 
ceived as possessing a high degree of skill. For example, 
one respondent wrote about tennis player Martina Hingis 
having “a pretty solid game.” Another respondent discussed 
Tori Amos as having “a spectacular singing voice, plays the 
piano masterfully, and is an amazing lyricist.” One thought 
that Matthew McConaughey was “really good. We’re talk- 
ing Oscar here,’ whereas another described the World 
Wrestling Federation as follows: “The acting is superb and 
not just movie actors. These athlete/actors have to get each 
detail right because they are in front of a life [sic] audience 
5—6 days a week.” However, in all but one case, respondents 
attributed skill to the human brand. With one exception, 
none of the human brands appeared to make the respon- 
dents themselves feel competent. The single exception was 
a respondent who wrote the following about a band: 


I like to emulate what they have raised me on (so I feel). 
They were a positive role model for me in my teen years, 
providing a degree of sanity and a beacon of relief in a 
turbulent world I dealt with during that time. I admire all 
these traits in them and try to make them a part of my life 
in my own, different ways. Even when I meet a goal. I 
then set out for a new one to challenge me. 


Whereas this excerpt shows evidence of self-expression 
(“make them part of my life"), the reference to being chal- 
lenged is interpreted as fulfilling his need for competence. 
In contrast to the needs for autonomy and relatedness, there 
was only limited evidence for respondents' feeling attached 
to a human brand because of feelings of competence. 


Discussion 


The juxtaposition of feelings indicative of the A-R-C needs 
and feelings of attachment provides correlational support 


for the first and second hypotheses. The relative absence of 
feelings of competence experienced by the respondents 
does not contradict the results of Study 1. Both suggest that 
competence may be present but does not explain strong 
attachments in the context of human brands. However, there 
is also another possible explanation for this result, namely, 
that the nature of the human brands selected by respondents 
(primarily musical performers, athletes, and actors) or the 
age (18—25 years) of most respondents ruled out!discussion 
of competence, which, in relation to other human brands 
(e.g., politicians, religious figures), might actually be linked 
to stronger attachments. However, it is clear that younger 
adults develop human-brand attachments (Boon and 
Lomore 2001; Cole and Leets 1999), suggesting that the 
choice of sample was appropriate (this issue is reexamined 
in Study 3). The instructions to respondents allowed them 
to discuss any human brand they desired. The predomi- 
nance of actors, musicians, and sports figures in the sample 
corresponds to prior research that suggest that these are the 
most common categories of human brands that people typi- 
cally describe (North, Bland, and Ellis 2005). Furthermore, 
respondents were clearly familiar with and actively contem- 
plating the idea of skill and competence (e.g., Martina 
Hingis, Tori Amos), except that the attachment object was 
almost universally the target of such attributions. Study 3 
further addresses the role of competence. 

Study 2 presents evidence that respondents use the same 
words (e.g., love, adoration, emotional connection) to 
describe their attachments to human brands as they use in 
the context of other attachments (e.g., romantic). Thus, it 
would be premature to consider consumers’ human-brand 
attachments trivial or unimportant; the meaningfulness and 
intense emotions ascribed to these relationships argue 
against this conclusion. Furthermore, although much of the 
literature cited in laying out the theoretical model originates 
in the interpersonal domain, the results suggest that the 
human-brand context is sufficiently similar to apply the 
A-R-C framework constructively. 

In conjunction with Study 1, the results of Study 2 are 
encouraging, but additional support for the hypotheses and 
for the role of competence can be obtained by assessing 
attachment strength in a more natural environment. This is 
pursued in Study 3. In addition, consistent withithe efforts 
of Study 2 to describe the phenomena more fully, Study 3 
uses expanded measures of the A-R-C needs to ensure that 
the domain of these constructs is adequately captured. 


Study 3 ! 


The hypotheses tested in Study 1 pertain to the role of the 
A-R-C needs in reference to human brands. Each is speci- 
fied by self-determination theorists as potentially important 
to strengthening attachments. However, to address several 
alternative hypotheses, the analysis is expanded beyond the 
A-R-C needs. Specifically, there are other important human 
needs that are at the same high-order level of analysis as the 
A-R-C needs. For example, a recent publication that 
reviews and consolidates prior research on important human 
needs (see Sheldon et al. 2001) advances measures of vari- 


ous needs discussed in the overlapping research of scholars 
such as Maslow, Epstein, and Atkinson; these needs are 
referred to as (1) "self-actualization," (2) “pleasure— 
stimulation,” (3) “money—luxury,” (4) “self-esteem,” and (5) 
“popularity—influence.” This background is important in the 
current context because the extant literature seems to 
advance indirectly the more expansive view that servicing 
needs unrelated to the A-R-C needs may also bolster attach- 
ments. Although the articles cited subsequently support or 
are consistent with the view that the A-R-C needs are 
important, they also propose that through the course of an 
object’s serving additional needs, a consumer may develop 
feelings that are emblematic of stronger attachments. 

For example, Keller (2001) indicates that when a con- 
sumer gains feelings of fun, excitement, and self-respect 
from a brand, he or she may be more likely to become 
attached it. These feelings seem to mirror the needs for 
pleasure-stimulation and self-esteem (Sheldon et al. 2001). 
Therefore, a positive relationship between fulfillment of 
these two needs and increasingly strong attachments to 
human brands might be expected. 

Similarly, Escalas and Bettman (2005) propose that a 
consumer will form self—brand connections when the brand 
helps him or her achieve goals, such as providing self- 
esteem, particularly when the brand is symbolic and when a 
person’s in-group has already embraced the brand. This rea- 
soning indicates that a person’s needs for self-esteem, 
money-luxury (i.e., the brand’s symbolic stature), and 
popularity—influence (i.e., the brand as a tool for a person to 
maintain position within a group) may be active in the 
process of attachment formation. In turn, this suggests that 
as an object services a person’s needs for self-esteem, 
money-luxury, and popularity-influence, attachments 
become more intense. 

Finally, Pimentel and Reynolds (2004) describe con- 
sumers who become more affectively committed to sports 
teams after undergoing a transcendent process of “sacraliza- 
tion," in which the team takes on meaningful status in the 
consumer's mind. This process seems consistent with Shel- 
don and colleagues' (2001) description of self-actualization, 
which refers to a person's identification of important 
aspects of life's meaning and purpose. Accordingly, it might 
be expected that the relationship between attachment 
strength and self-actualization, as well as the other basic 
needs, will be positive and significant. 

H4: The more a person perceives a human brand as fulfilling 
his or her needs for (a) self-actualization, (b) pleasure— 
stimulation, (c) money-duxury, (d) self-esteem, and (e) 
popularity-influence, the more he or she will be intensely 
attached to that human brand. 


This hypothesis does not undermine the role of the 
A-R-C needs in strengthening attachments. It is possible 
that these additional needs are complementary. Study 3 
retests the hypotheses with a nonstudent sample in a more 
naturalistic setting and considers these five alternative needs 
in an effort to address the rival hypothesis that the satisfac- 
tion of any important need explains stronger attachments. 
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Method 


Adult respondents were recruited by posting a letter of 
solicitation on Web sites organized as online communities 
for or by fans of various celebrities. These sites provide 
forums in which people gather electronically to share their 
thoughts and feelings about a celebrity with like-minded 
people. These sites were targeted because they are accessi- 
ble and populated by people who are communicative and 
familiar with (though not necessarily attached to) at least 
one celebrity. Along other dimensions, such as age, geo- 
graphic location, and gender, there is enormous heterogene- 
ity in membership within and among sites. 

A soliciting message was posted at each forum, which 
directed volunteers to send an e-mail to a university- 
affiliated address that was set up to for this study. To pro- 
vide an incentive, the soliciting letter informed people that 
respondents who completed the survey would be entered 
into a random drawing for one of five prizes of $50 and 
would be eligible to receive a summary of the research. 
Those who sent an e-mail were directed to the location (i.e., 
a URL) of the survey, which was constructed for the Web 
using the nTreePoint software package. The study descrip- 
tion, instructions, and order of presentation for the mea- 
sures were nearly identical to those of Study 1, and respon- 
dents were randomly assigned to list a human brand to 
whom they either were or were not strongly attached. 


Measures 


The metric used to assess attachment strength was com- 
posed of the same four items used in Study 1 and showed 
good reliability (a = .93; see the Appendix). The A-R-C 
needs were each measured with the items employed in 
Study 1; additional items were adapted from the work of La 
Guardia and colleagues (2000) and Sheldon and colleagues 
(2001). Compared with Study 1, these showed improved 
internal consistency (as = .83—.89). 

The preamble for each metric that assessed needs read, 
*When I see, hear, think about or interact with this 
celebrity,..." The need for relatedness was measured with 
the two items from Study 1 plus four additional items: “T 
feel loved and cared about,’ “I often feel a lot of distance 
from this celebrity,” “I feel like I am in contact with some- 
one who cares for me and who I care for," and "I feel close 
and connected to a person who is important to me." The 
need for autonomy was measured with three items in addi- 
tion to those used in Study 1: “I feel that my choices are 
based on my true interests and values," “I feel free to do 
things my own way," and "I feel that my choices express 
my ‘true’ self.” The need for competence included three 
additional items: “I feel that I can successfully complete 
difficult tasks and projects,” “I feel that I can take on and 
master hard challenges,” and “I feel very capable in what I 
do.” 

Measures of other needs were included to assess the 
rival scenario that, in general, need fulfillment leads to 
strong attachments; these measures were adapted from the 
work of Sheldon and colleagues (2001). Respondents were 
asked in relation to each need the extent of their disagree- 
ment (“strongly disagree") or agreement (“strongly agree") 
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with certain statements. Items for self-actualization (à = 
.95) were as follows: “I feel like I am “becoming who I 
really am,” “I feel a sense of deeper purpose in my life,” 
and “I feel a deeper understanding of myself and my place 
in the universe.” Pleasure-stimulation (& = .88) was mea- 
sured with the following items: “I feel I am experiencing 
new sensations and activities,” “I feel intense physical plea- 
sure and enjoyment,” and “I feel I have found new sources 
and types of stimulation for myself.’ The three items tap- 
ping money-luxury (& = .91) were as follows: "I feel I am 
able to buy most of the things I want,” “I feel that I have 
nice things and possessions,” and “I feel that I have plenty 
of money.” The measure of self-esteem ( = .84) was com- 
posed of the following: "I feel I have many positive quali- 
ties," “I feel quite satisfied with who I am,” and "I feel I 
have a strong sense of self-respect.” Finally, people's need 
for popularity-influence (à = .85) was assessed by the fol- 
lowing: “I feel that I am a person whose advice others seek 
out and follow," “I feel that I strongly influence other 
people's beliefs and behavior, and "I feel that I have a 
strong impact on what other people do." 


Results 


There were 107 respondents (26 males and 81 females), and 
they ranged in age from 18 to 60 years, with an average age 
of 33 years. Although gender and age were initially consid- 
ered covariates in the subsequent analysis, neither predicted 
attachment strength and thus were dropped from the analy- 
sis. Although the instrument did not explicitly collect data 
on nationality, it was apparent from respondents' e-mail 
addresses and signatures that the sample was geographi- 
cally diverse. For example, respondents appeared to be 
located in Australia, Canada, Hong Kong, New Zealand, 
South Africa, Spain, Sweden, Uruguay, and the United 
States. 

There was an enormous range in the types of human 
brands selected by respondents, including actors, comedi- 
ans, models, directors, radio personalities, writers, singers, 
athletes, musicians, politicians, and royalty. For example, 
respondents listed Ben Affleck, King Albert II, Lucinda 
Williams, Mario Lemieux, Viggo Mortensen, Michael 
Biehn, and Joan Crawford. Respondents also varied signifi- 
cantly in their popularity, activity level (e.g., retirement sta- 
tus), and profession. 


Correlation. Each of the three A-R-C needs was posi- 
tively correlated with the measure of attachment strength 
(see Table 3). The correlations between attachment and 
autonomy (r = .37, p < .05), relatedness (r = .31, p « .05), 
and competence (r = .20, p « .05) were all positive and sig- 
nificant. In general, the correlation between attachment 
strength and the other needs was weak or nonsignificant: 
self-actualization (r = .16, p = not significant [n.s.]), 
pleasure-stimulation (r = .19, p < .05), money—luxury (r = 
.15, p — n.s.), self-esteem (r = .09, p = n.s.), and popularity— 
influence (r = .19, p « .05). 


Hierarchical regression. Using hierarchical regression 
allows variables entered in prior stages to account for vari- 
ance in the dependent variable (i.e., attachment strength) 


TABLE 3 
Correlation Matrix of Study 3 Measures 








Variable 1 2 3 4 5 6 7 8 9 

1. Attachment strength — .38 .31 .20 .16* .19 .15* .09* .19 
2. Autonomy — .37 .60 .47 .20 .35 .33 .J3 
3. Relatedness — .52 .67 52 .45 42 .36 
4. Competence — 56 .35 .46 .69 45 
5. Self-actualization — 76 .56 .48 .37 
6. Pleasure-stimulation — .33 38 .27 
7. Moneyuxury — .50 .A4 
8. Self-esteem — .66 


9. Popularity-influence 





Notes: All ps < .05, except *p = n.8. 


before the independent contribution of variables entered in 
subsequent steps is assessed. Thus, it provides a relatively 
conservative test of the contributions of the variables 
entered last. 

Considering the weak relationships between attachment 
strength and the non-A-R-C needs that were revealed by 
the previous correlation analysis, it might be expected that 
there is little or no relationship between an object’s fulfill- 
ing these alternative needs and attachment strength. A con- 
sideration of the hierarchical regression with the alternative 
(i.e.; non-A-R-C) needs entered in the first step and the 
A-R-C needs entered second shows that this is true. 

In Regression 1 (see Table 4), in which the A-R-C needs 
are not entered, none of the alternative needs significantly 
predicts attachment strength. The coefficients for self- 
actualization (B = .01, p = n.s.), pleasure-stimulation (B = 
20, p = n.s.), money—luxury (B = .13, p = n.s.), self-esteem 
(B = —.06, p = n.s.), and popularity-influence (B = .10, p = 
n.s.) are uniformly insignificant. In light of (1) the weak or 
insignificant correlations between attachment and the inde- 
pendent alternative needs and (2) the weak to moderate cor- 
relations among the alternative needs, it is unlikely that this 
result is due to multicollinearity. 

Likewise, in Regression 2, which includes the A-R-C 
needs in the second step, none of the alternative needs is a 
significant predictor of attachment strength. Combined with 


the results from Regression 1, this demonstrates a lack of 
support for H4. However, the links between two of the three 
A-R-C needs and the dependent measure are significant. 
Specifically, autonomy (D = .31, p < .05) and relatedness 
(B = .29, p < .05) are significant predictors, but competence 
(B = —.03, p = n.s.) is not. Thus, the results of Study 3 sup- 
port Hi, Hp, and H3p. 


Discussion 


Study 3 mirrors Study 1, insofar as both reveal that the sat- 
isfaction of autonomy and relatedness predicts strong 
attachments. In interacting with human brands, people who 
are made to feel capable of self-expression and self- 
governance and those who are made to feel a sense of close- 
ness and intimacy are likely to develop strong attachments. 
In Studies 1 and 3 and in support of Hap, the relationship 
between competence and attachment strength was not sig- 
nificant, suggesting that at least in a celebrity context, feel- 
ings of efficacy are not fundamental to promoting stronger 
attachments. Furthermore, competence and five additional 
needs were considered but were found to be insignificant 1n 
Study 3; this suggests that general need fulfillment does not 
explain the creation of strong attachments. Although other 
predictors of attachment strength may be identified in fur- 
ther research, the results suggest that autonomy and related- 
ness play an important role. 


TABLE 4 








Study 3 Hierarchical Regression Results 
Step In Regression 1 Regression 2 
Hierarchical Standardized Standardized 
Regression Predictor i Beta p Value Beta p Value 
(Constant) .00 .01 
Step 1 Self-actualizatlon .01 .94 —.31 .15 
Pleasure—stimulation .20 .24 .26 .12 
Money-luxury 13 43 .14 .37 
Self-esteem —.06 .72 —.13 .50 
Popularity-influence .10 52 10 52 
Step 2 Autonomy — — .31 .03 
Relatedness — — .29 .05 
Competence — — —.03 .89 


Notes: Dependent variable = attachment strength 
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General Discussion 


Study 1 revealed that consumer attachments to human 
brands result in elevated levels of satisfaction, trust, and 
commitment. Study 2 revealed the affective richness of the 
experience for consumers attached to human brands. Study 
3 offered additional support for the theoretical model using 
a separate sample. On the basis of this and other research 
(e.g., Ambler et al. 2002; Thomson MacInnis, and Park 
2005), it is reasonable to conclude that attachment strength 
is separable from trust, satisfaction, commitment, and a 
variety of other constructs, such as favorable attitudes and 
brand loyalty, that have been linked to strong relationships. 
The promise of attachment strength to marketers is that the 
construct is predictive of these and other relationship fea- 
tures (e.g., Ahluwalia, Unnava, and Burnkrant 2001; Liljan- 
der and Strandvik 1995), is theoretically rigorous and well 
articulated, and is achievable through two means (i.e., creat- 
ing feelings of autonomy and/or relatedness). Few, if any, 
relationship constructs appear to offer the same comprehen- 
sive benefits as attachment strength, which may provide a 
useful means of conceptualizing and an efficient metric for 
assessing the strength of consumer-brand relationships. 


Managerial Implications 


If consumers can be made to feel autonomous and related, 
organizations may be able to foster strong attachments and 
thus enjoy the benefits of superior relationships with con- 
sumers. In general, human brands that make consumers feel 
appreciated, empowered, and understood succeed in creat- 
ing feelings of autonomy. Similarly, feelings of relatedness 
are likely to be produced when a human brand promotes 
acceptance, openness, and belonging. 


General conditions. For an attachment to form, the con- 
sumer must initially be minimally attracted in some way to 
the human brand, whether socially or in some other manner 
(Boon and Lomore 2001). In other words, an attachment is 
unlikely to develop if the starting point of the relationship is 
characterized by intense negative feelings or thoughts. This 
suggests that managers must introduce a human brand to 
the world carefully and deliberately and choose a position- 
ing that is appealing and sustainable over time. 


Need for interaction. The importance of human brands 
interacting with consumers was underscored by the qualita- 
tive results from Study 2, in which one respondent spoke of 
having his picture taken with the human brand. Repeated 
interaction between a consumer and the human brand 
reduces uncertainty and provides the basis for an attach- 
ment to grow (Berman and Sperling 1994), but in the con- 
text of human brands, it is not just the quantity but also the 
quality of interaction that matters. 

With direct interaction, consumers are more likely to 
view the human brand as accessible, increasing the opportu- 
nity for feelings of autonomy and relatedness to be pro- 
moted. Although "being seen" (e.g., public appearances) 
may be acceptable as a form of indirect interaction, more 
direct means offer a richer, more rewarding choice. Firms 
should focus on determining how to make their human 
brands approachable. For example, technological alterna- 
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tives for direct interaction show potential not only because 
they expand the human brand's reach and exposure. If the 
human brand had a meaningful presence online through the 
use of blogs or chat rooms, attached consumers would 
gather to ask questions or to learn more about the human 
brand in his or her own words. An example of this direct 
approach is Mark Cuban, owner of the Dallas Mavericks 
basketball team, whose immense popularity is linked to his 
spending several hours daily answering e-mails from fans 
(Fielding 2005). The direct approach not only improves the 
quality of interaction but also may encourage feelings of 
autonomy and relatedness by reducing hierarchy, making 
consumers feel appreciated and understood, and reducing 
the prominence of economic motives by the human brand 
(Silvera and Austad 2004). 

Interaction should also occur on a consistent basis 
(Beech, Chadwick, and Tapp 2000). Consumers may 
respond better to a human brand who has a routine that con- 
sumers may view as a guarantee of interaction. For exam- 
ple, Ellen DeGeneres is known for a daily segment on her 
show in which she invites the audience (both in the studio 
and watching on television) to dance with her. This is an 
especially astute move because of its regularity and because 
it simultaneously promotes feelings of autonomy (what 
DeGeneres calls “an expression of freedom") and closeness 
(dancing "with" a celebrity; see Elber 2005). Ensuring 
regular interaction between a consumer and a human brand 
helps create the conditions under which an attachment is 
likely to grow. 

Role of authenticity. Human brands that are perceived as 
authentic are more readily embraced (Cole and Leets 1999; 
Perse and Rubin 1989). Research has elaborated on authen- 
ticity (Rose and Wood 2005), but what the term means in 
reference to a human brand remains an underresearched 
topic. However, several ideas are promising (see Tolson 
2001). For example, authenticity is best developed slowly 
and deliberately, thus signaling a long-term view, and by 
avoiding perceived opportunism by the human brand, which 
can signal that the human brand has “sold out.” Likewise, 
when human brands are seen publicly in a nonperforming 
role, they should not be “perceived as acting” (Tolson 2001, 
p. 445), because this may lead consumers to feel manipu- 
lated and to view the human brand as lacking credibility. In 
study 2, one consumer spoke appreciatively of the band to 
which he was attached acting “like normal guys off-stage.” 
Away from their trade, human brands should not be viewed 
as trying too hard to convey an image, or they will risk 
being viewed as “pretenders.” 

The role of competence. In all three studies, competence 
was not effective in bolstering attachments in the context of 
human brands. If the popular press is any indication, con- 
sumers can form attachments to human brands who lack 
any appreciable skills (e.g., Paris Hilton). However, human 
brands who make consumers feel incompetent are likely to 
weaken attachments or prevent them from forming in the 
first place. For example, when comedian Dennis Miller 
joined the Monday Night Football broadcast team, his addi- 
tion was intended to halt a slide in ratings. However, on the 
show, Miller used indecipherable analogies, complex dic- 
tion, and obscure references that alienated his audience and 


made them feel inadequate and unintelligent. In other 
words, many viewers were made to feel incompetent; this 
not only caused a weakening of any possible preexisting 
attachments but also prevented any new attachments from 
forming. Although consumers do not become attached to a 
human brand through feelings of competence, the results 
suggest that managers must nonetheless be careful not to 
permit the degradation of competence to be linked to their 
human brand. 

However, caution must be taken in extending these 
results to other domains in which attachment. and self- 
determination theories may provide a useful road map to 
building attachment bonds. For example, with respect to 
product functional brands that are intended; to solve 
consumption-related problems (Park, Jaworski, ‘and Mac- 
Innis 1986), consumers may experience feelings of compe- 
tence as a result of brand use. Similarly, consumer relation- 
ships that implicate particular service providers, such as 
coaches or psychologists, may be explicitly designed to 
develop particular skills, and it is common for strong 
attachments to develop in such contexts (Colin 1996). How- 
ever, only further empirical study of the issue can ascertain 
whether attachments in these contexts become strong 
because of the servicing of competence needs or whether 
competence is merely a correlate without any meaningful 
effect when the variance of autonomy and relatedness are 
taken into account. 


Potential as endorsers. Marketing research that has con- 
sidered human brands has tended primarily to address the 
choosing and measuring of the effectiveness of celebrity 
endorsers to enhance the appeal of a brand (Agrawal and 
Kamakura 1995). Although the current article, does not 
directly address the use of human brands as endorsers, prior 
research and this article imply that human brands to whom 
consumers are attached may be unusually effective. One 
way to assess the appeal of a particular celebrity is the Q 
ratio, a commonly used but sometimes “misleading” mea- 
sure of popularity indicated by the celebrity’s familiarity 
and likeability (Knott and St. James 2002, p. 1428). Com- 
pared with the likeability dimension, an attachment is more 
significant and profound to an individual (Thomson, 
MacInnis, and Park 2005). Although consumers may like 
many celebrities, they will be strongly attached to only a 
few and be willing to sacrifice for and invest in these human 
brands to a greater extent. 

In a related vein, research suggests that consumers who 
have positive feelings toward a particular brand are more 
likely to embrace a brand extension because of a more auto- 
matic and direct transfer of positive affect to the affiliated 
product (Yeung and Wyer 2005). This result occurs even 
when the extended brand has comparatively poer fit with 
the core brand, which implies that attachment may facilitate 
a highly effective transfer of feelings from the human brand 
to the sponsored object. In the context of celebrity 
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endorsers, this suggests that there does not necessarily need 
to be a high degree of fit between an endorser and a particu- 
lar brand if there is a strongly attached target market. A firm 
employing a human brand to whom consumers are strongly 
attached as an endorser may anticipate greater returns and 
insulation from transgressions (e.g., celebrity involvement 
in scandals; see Knott and St. James 2002) than a firm using 
a human brand who is merely well liked. 


Limitations and Further Research 


A result of these studies that has not yet been addressed is 
the overall weakness of attachments to human brands. In 
study 2, respondents used language that suggested that their 
attachments were strong and emotionally significant. This 
result mirrors prior qualitative work on different types of 
consumer-brand relationships (Fournier 1998; Schultz, 
Kleine, and Kernan 1989). However, respondents in Study 
I's strong-attachment condition reported attachment scores 
that, on average, hovered around the scale midpoint. The 
attachment scores in Study 3 were stronger than those in 
Study 1. Both studies used the same measure of attachment 
strength, but respondents in Study 3 may have been evan- 
gelical about the human brand they selected. After all, 
Study 3 respondents were recruited from online communi- 
ties revolving around human brands. Although many of the 
participants in the low-attachment condition reported weak- 
attachment scores, (e.g., Orlando Bloom = 1.8/7, Matt 
Crane — 1.0/7), most respondents in the high-attachment 
condition reported scores greater than 6.5/7 (e.g., Ben 
Affleck, Julie Andrews, Bobby Valentine). These human 
brands seemed to play a more central role in the Study 3 
respondents’ lives than might otherwise be normally 
expected. Furthermore, although the same language might 
be used to describe interpersonal and consumption attach- 
ments (e.g., Study 2), relationships implicating the latter 
may be weaker in general. Human-brand attachments are 
probably best viewed as supplementary and subsidiary 
(Colin 1996). The implication for qualitative research on 
marketing relationships is that quantitative methods provide 
a useful perspective that may not be evident from qualita- 
tive approaches alone (Belk 1992). 

Systematic research into what tactical choices are likely 
to promote autonomy and relatedness is needed. For exam- 
ple, do self-service models of service delivery or product 
customization promote autonomy by making consumers 
feel more self-expressive or self-governing? Do autonomy 
and relatedness need to be created from scratch, or can 
firms draw on existing feelings targeted elsewhere? Does 
possessing one or more strong consumption-related attach- 
ments mean that a person experiences elevated life satisfac- 
tion, or as alluded to by prior research (Kleine and Baker 
2004), are there other, negative effects? Given the power of 
the strong attachments, answering these questions would be 
a worthwhile undertaking. 
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APPENDIX 
Summary of Measures in Studles 1 and 3 








Study 1 Study 3 

Measure Items a M ltems Qt M 

Attachment strength 4 .89 2.33 4 93 5.72 
Autonomy 3 .80 2.69 6 85 5.68 
Relatedness y ne 81 1.90 6 89 4.23 
Competence 3 70 3.10 6 83 5.19 
Self-actualization 3 95 4.42 
Pleasure—stimulation 3 88 4.49 
Money-luxury 3 91 3.46 
Self-esteem 3 84 5.18 
Popularity-influence 3 85 4.45 


Notes. All measures use seven-polnt Likert scales anchored by "strongly disagree" and "strongly agree." 
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Dual Distribution and Intangible Firm 
Value: Franchising in Restaurant 
Chains 


Dual distribution systems in which firms use vertical integration and market governance simultaneously are widely 
used across diverse marketing contexts (e.g., restaurants, retailing, industrial selling). A prominent example of dual 
distribution includes business format franchising, in which firms (i.e., the franchisors) license the operation of some 
of their units to franchisees while owning and operating some units themselves. Despite the widespread prevalence 
of dual distribution, there are few insights into its performance implications. In this article, the author examines the 
relationship between a firm's dual distribution strategy and its intangible value. Franchising in restaurant chains 
serves as the empirical context for the study. It is proposed that a firm's dual distribution strategy affects its 
intangible value both independently and jointly with a set of firm characteristics. The author considers the firm's age, 
scope of vertical integration, advertising, financial leverage, and financial liquidity as firm characteristics that 
influence the relationship between dual distributlon strategy and intangible value. The author measures the firm's 
dual distribution strategy by the proportion of its franchised units to its total units and intangible value by its Tobin's 
q. The author estimates the proposed model using panel data on 55 publicly listed U.S. restaurant chains for the 
period 1992—2002. Unobserved firm heterogeneity is accommodated using latent class regression analysis. The 
results support a four-segment model. Dual distribution increases intangible value for some firms but decreases 


intangible value for others, both independently and in conjunction with other firm characteristics. 


integration and market governance are widely used 

across diverse marketing contexts, including retail- 
ing, restaurants, and industrial selling. Although the nature 
of dual distribution systems varies by marketing context, in 
principle, dual distribution implies that the firm contracts 
with market agents to manage parts of its distribution sys- 
tem while owning and managing other parts of it. Given the 
widespread prevalence of dual distribution, some observers 
suggest that it has become the dominant channel design 
with important implications for firms’ marketing strategies 
and performance (Anderson, Day, and Rangan 1997; Stern 
and Weitz 1997). 

In this article, I examine the relationship between dual 
distribution strategy and intangible value using the context 
of franchising in restaurant chains. In what is known as 
business format franchising, the chains (i.e., the fran- 
chisors) own and operate some restaurant units and simulta- 
neously license the operation of other restaurant units to 


D: distribution systems in which firms use vertical 
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market agents (ie., the franchisees). Franchisors provide 
marketing and operations support to franchisees in return 
for royalty income, which is a fixed percentage of fran- 
chisees' revenues. 

Dual distribution has been extensively studied by mar- 
keting scholars as hybrid channels, multiple channels, or 
plural governance. However, prior research has focused pri- 
marily on its various antecedents, including information 
asymmetry (Heide 1994, 2003), transaction costs (Ander- 
son 1988; Anderson and Schmittlein 1984; Bergen, Dutta, 
and Walker 1992; Dahlstrom and Nygaard 1994, 1999; 
Rindfleisch and Heide 1997; Weiss and Anderson 1992), 
and uncertainty (Dutta et al. 1995; John and Weitz 1988). 
There are fewer insights into the performance implications 
of dual distribution. As Heide (2003, p. 27) notes, "the spe- 
cific performance implications of plural systems remain 
unanswered;... establishing a link between particular gover- 
nance approaches and outcome variables seems an impor- 
tant research priority." 

Relating dual distribution strategy to intangible value is 
also valuable from a marketing metrics perspective (Bolton 
2004; Rust et al. 2004). Recent developments in accounting 
suggest that a firm's accounting data alone cannot ade- 
quately explain its value because they do not capture its 
intangible assets (Amir and Lev 1996). Accordingly, there 
is interest in identifying value-relevant, nonaccounting mea- 
sures, including business strategies that may affect share- 
holder value (e.g., Barth et al. 1998). Relating marketing 
strategies to firm value is especially pertinent because most 
marketing investments create intangible market-based 
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assets that are not reflected on balance sheets (Srivastava, 
Shervani, and Fahey 1998, 1999).! 

Consistent with conceptual advances in the marketing 
metrics area (Srivastava, Shervani, and Fahey 1998, 1999), 
empirical research has identified a value-relevant role for 
various aspects of marketing strategy, including branding 
(e.g., Rao, Agarwal, and Dahloff 2004; Simon and Sullivan 
1993), product management (Aaker and Jacobson 2001; 
Pauwels et al. 2004), and customer relationship manage- 
ment (e.g., Anderson, Fornell, and Mazvancheryl 2004). 
However, there are few insights into the relationship 
between distribution strategy, a key aspect of a firm's mar- 
keting strategy, and its value. An exception to this, however, 
is the work of Banerji and Simon (1991), who use survey 
data from 34 restaurants and report a positive relationship 
between the proportion of firms' franchised units to their 
company units and their intangible value. More recently, 
two studies (Geyskens, Gielens, and Dekimpe 2002; Lee 
and Grewal 2004) show that a firm's adoption of the Inter- 
net as a distribution channel positively influences its intan- 
gible value. These developments raise questions about the 
value relevance of distribution strategy. 

A study relating dual distribution strategy to firm value 
is also important for marketing practice because of the 
increasing scrutiny of the financial accountability of mar- 
keting investments from various stakeholders (Rust et al. 
2004). Testifying to the potential managerial significance of 
this issue, an empirical survey of restaurant chains, the 
industry context for this article, suggests an intriguing rela- 
tionship between chains' dual distribution strategies and 
their intangible values (see Figure 1). 

Notably, the scatter plot in Figure 1 indicates that 
restaurant chains using given dual distribution strategies 
have different intangible values. For example, in 2002, 
McDonald's Corporation and Panera Bread Company, 
which had similar dual distribution strategies (.72 of their 
system units were franchised), had intangible values of 
1.290 and 5.300, respectively. Likewise, Benihana and Cali- 
fornia Pizza Kitchen, which had similar dual distribution 
strategies (.18 of their system units were franchised), had 
intangible values of .990 and 2.780, respectively. Similar 
patterns were observed in other industries in which dual 
distribution is prevalent (e.g., automotive repair services, 
hotels), suggesting a complex and perhaps contingent rela- 
tionship between dual distribution strategy and intangible 
value. Insights clarifying this relationship will be useful to 
managers whose business strategies must be informed by 
their performance implications. In this article, I examine 
two résearch questions: Does a firm's dual distribution 
strategy affect its intangible value? and Do the effects of 
dual distribution on intangible value vary across firms? 

Firms are urged to leverage synergy among elements of 
their marketing strategy to optimize resource allocation 
(Capon, Farley, and Hoenig 1990, pp. 1144—59; Gatignon 





II use the terms “firm,” “restaurant chain," and "chain" inter- 
changeably to denote the restaurant chain and the terms “intangi- 
ble value," “value,” and "intangible firm value" to denote the 
firm's intangible value as measured by its Tobin's q. 


and Hanssens 1987, p. 257). Surprisingly, however, prior 
studies relating marketing to firm performance have 
focused on one aspect of marketing, such as new product 
introductions (e.g., Bayus, Erickson, and Jacobson 2003) or 
brand management (e.g., Rao, Agarwal, and Dahloff 2004), 
without consideration of potential synergies among them. 
Indeed, the considerable variability in intangible values of 
firms for a given dual distribution strategy, as observed in 
Figure 1, suggests that a contingent relationship between 
dual distribution strategy and other firm characteristics 
affects intangible value. 

Accordingly, I apply contingency theory (Hambrick 
1983; Zeithaml, Varadarajan, and Zeithaml 1988), which 
proposes complementary effects of various aspects of a 
firm's profile, strategy, and resources on its performance, to 
relate a firm's dual distribution strategy to its intangible 
value. I propose that a firm's dual distribution strategy 
affects intangible value both independently and jointly with 
other firm characteristics. The firm's age, scope of vertical 
integration, advertising, financial leverage, and financial 
liquidity ‘are considered characteristics that influence the 
effects of its dual distribution strategy on its intangible 
value. 

I estimate the model using panel data on 55 publicly 
listed U.S. restaurant chains in the period 1992-2002, 
resulting in 394 firm years. Unobserved firm heterogeneity 
is accommodated using latent class regression analysis that 
derives latent segments (for a review, see Wedel and 
Kamakura 2000). I use the proportion of a chain’s fran- 
chised units to the total number of its units in its distribution 
system as the indicator of its dual distribution strategy. I 
measure the chain’s intangible value by its Tobin’s q, a 
forward-looking, stock market-based measure (e.g., Bha- 
radwaj, Bharadwaj, and Konsynski 1999). 

The results support a four-segment model and indicate 
that a firm’s dual distribution strategy influences its intangi- 
ble value both independently and jointly with other firm 
characteristics. The effect of dual distribution on intangible 
value varies across firms in the four segments, in support of 
firm heterogeneity. The proposed model explains a firm’s 
intangible value significantly better than its accounting 
data, in strong support of the value relevance of its dual dis- 
tribution strategy. The model also predicts a firm’s intangi- 
ble value well, given its characteristics. 

I organized the article as follows: In the next section, I 
define dual distribution and provide an overview of it. Then, 
I develop the hypotheses. In the subsequent sections, | 
describe the data and the model estimation procedure, and 
then I present the results. The article concludes with a dis- 
cussion of the study's contributions, its limitations, and 
opportunities for further research. 


Theory 


Definition of Dual Distribution 

I conceptualize a firm's dual distribution as a hybrid sys- 
tem, involving the simultaneous use of vertical integration 
(known as "vertically integrated channels" or "company 
units" in franchising) and market governance (also called 
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“market-based channels" or “franchised units").2 In the 
restaurant industry, most chains use dual distribution, 
though some (e.g., Planet Hollywood International, Lone 
Star Steakhouse & Saloon) use only vertically integrated 
channels and others (e.g., Noble Roman’s, Triarc Compa- 
nies) use only market-based channels. 

I define the proportion of a chain's franchised units to 
the total number of its system units in its distribution sys- 
tem as the indicator of its dual distribution strategy for three 
reasons. First, this definition captures the chain’s emphasis 
on market-based channels relative to its system size, reflect- 
ing the conceptualization of dual distribution as a hybrid 
system. Second, the definition also captures the chain's 
` relative emphasis on the creation of intangible assets (1.e., 
franchised units, franchisee relationships, and knowledge) 
in its distribution system; this is consistent with the study's 
focus on firm value, which is achieved through intangible 
market-based assets (Srivastava, Shervani, and Fahey 
1998). Third, the proportion of a chain's market-based 
channels to its system size is a strategic choice variable 
because chains first determine their overall distribution 
strategy (e.g., system size, location of units) and then deter- 
mine which units should be franchised (Lafontaine 1992). 


Overview of Dual Distribution 


Vertically integrated channels and market-based channels 
embody different economic, structural, and managerial 
characteristics and serve different marketing functions 
(Dutta et al. 1995; Heide 1994, 2003). For.example, firms 
own, operate, and manage vertically integratéd channels 
through hierarchical, corporate control, whereas market 
agents own and operate market-based channels, which are 
managed primarily through provisions and incentives in a 
relational contract between the firm and its agents 
(Williamson 1975, 1985). 

Marketing scholars have examined the conditions in 
which plural governance in general and dual distribution in 
particular lead to greater efficiency using the theoretical 
lenses of agency theory (Bergen, Dutta, and Walker 1992; 
Dutta et al. 1995) and transaction cost economics (Heide 
1994, 2003; John and Weitz 1988; Rindfleisch and Heide 
1997). The emergent consensus is that under some condi- 
tions, dual distribution eliminates problems associated with 
either exclusive vertical integration or market governance 
(for discussions of agency theory and transaction cost eco- 
nomics applicable to dual distribution, see Bergen, Dutta, 
and Walker 1992, pp. 13-16; Heide 1994, pp. 74—78; Rind- 
fleisch and Heide 1997, pp. 32-43). 


2The terms “multiple channels” and “dual channels" have also 
been used to denote the situation in which a firm owns and oper- 
ates more than one channel (e.g., J.C. Penney has catalog opera- 
tions, an online store, and bricks-and-mortar stores). I use the term 
“dual distribution” in this article to denote a situation in which one 
channel is owned and operated by the firm and the other channel is 
market-contracted to external agents. In addition, the term 
“hybrid” has been used in various ways in the literature. Some 
authors (e.g., Moriarty and Moran 1990) use it to describe a com- 
bination of institutional forms, as in this article, but others (e g., 
Williamson 1991) use the term “hybrid” to describe a governance 
form that 1s intermediate between market governance and hierar- 
chical governance. 


Advantages of Dual Distribution 


Both vertical integration and market governance have dis- 
tinctive economic and structural characteristics and market- 
ing functions. Thus, the advantages and disadvantages of 
dual distribution arise from the aggregation of benefits and 
costs of each channel and the synergy across the dual 
channels.3 


Greater control over operations in vertically integrated 
channels. In the presence of dual distribution, vertical inte- 
gration enhances the credibility of the firm's contract termi- 
nation safeguard, curtails agents' opportunism, and reduces 
the firm's vulnerability (Dutta et al. 1995). Vertical integra- 
tion also provides a firm with greater control over its busi- 
ness processes, products, and profits (Heide 1994). Consis- 
tent with greater control over company units, the chief 
executive officer (CEO) of McDonald's stated, “We main- 
tain a global base of company-operated restaurants to gen- 
erate profits, link our interest with franchisees, develop 
management talent, gather research, and test ideas for better 
restaurant execution" (McDonald's Corporation 1995, p. 
11). 


Lower costs and higher returns in market-based chan- 
nels. Market agents are an important source of scarce finan- 
cial and managerial resources that are bundled efficiently in 
the contract (Alchian and Demsetz 1972; Grossman and 
Hart 1986; Jensen and Meckling 1976). As Anderson and 
Weitz (1986, p. 13) note, "administrative costs associated 
with external agents are quite low;... the ‘invisible hand’ of 
the market places substantial pressures on independent 
agents to provide a marketing activity efficiently." 

For example, in restaurant chains, owner-managers of 
franchised units, with rights to residual profits from their 
units, are more motivated and work more efficiently than 
salary-based employees in company units, thus lowering 
operating costs. Chains monitor franchised units infre- 
quently, relying instead on provisions in the contract, which 
lowers monitoring costs as well (Lafontaine and Shaw 
2005). Furthermore, chains make no capital investment in 
franchised units, even as they earn income, which is set as a 
fixed percentage of franchisees' revenues. As the executive 
vice president at a leading fish-and-chips chain noted, "with 
franchise units, you receive a smaller margin but the return 
on investment approaches infinity because there is no 
investment" (Bradach 1998, p. 68). 


Market-based channels are more responsive to market 
needs. Owner-managers of market-based channels follow 
complex strategies and are more responsive to market 
changes than are bureaucratic, vertically integrated chan- 
nels (Yin and Zajac 2004). As a Pizza Hut franchisee noted 
(Bradach 1998, p. 127), "Franchisees have a faster reaction 
time. With all the layers in the company, it takes a long time 
to get things done." Because they are responsive to market 
needs, franchised units are an important source of innova- 
tions (Lewin-Solomons 1998). For example, three of 





3I gratefully acknowledge the guidance of two anonymous 
reviewers 1n the development of this section. 
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McDonald’s most successful products—the Big Mac, Filet- 
O-Fish, and Egg McMuffin—were originally introduced by 
its franchisees (BusinessWeek 1997). 


Market-based channels provide low-cost, low-risk 
access to new markets. Market-based channels provide 
firms with low-risk, low-cost access to markets that are not 
economically worthwhile to pursue with vertically inte- 
grated channels (Dahlstrom and Nygaard 1994). Indeed, 
franchised units are frequently located in remote, low- 
potential markets that, on average, achieve lower sales per 
unit than company units (Brickley and Dark 1987). Further- 
more, the franchise contract is designed such that the upside 
benefit of increased revenues accrues to chains, whereas the 
downside risk of poor performance is borne solely by the 
franchisees. As Robert Nugent, the CEO of Jack in the Box, 
noted, “The economic advantage to franchising is that there 
is less risk to our income statement because we’re not 
affected by cost pressures” (Bradach 1998, p. 68). 


Synergy between dual channels. In addition to the dis- 
tinct advantages of vertically integrated channels and 
market-based channels, synergy between the dual channels 
provides advantages (Lafontaine and Kaufmann 1994; Mar- 
tin 1988). Chains leverage their experience in company 
units, over which they have control, to model responses in 
franchised units, over which they have much less control. 
Bradach (1997) calls this the “ratcheting process.” In addi- 
tion, Bradach (1997) finds evidence of two-way, “mutual- 
learning” processes between company units and franchised 
units. Testifying to synergy between the dual channels, the 
CEO of a restaurant chain noted (Bradach 1998, p. 2), “The 
chain [company units] gives you a system perspective, 
while franchisees give you a local perspective. We are con- 
stantly working to balance both perspectives. By having 
both company and franchised units, we are able to do that.” 


Disadvantages of Dual Distribution 


Moral hazard problems in market-based channels. 
Because agents invest only in their business (i.e., the fran- 
chised unit), they experience higher risk than well- 
diversified investors and may invest suboptimally in their 
unit, creating moral hazard problems for the firm (Blair and 
Kaserman 1994; Fama and Jensen 1983). For example, 
franchisees may attract customers on the basis of the fran- 
chisor’s reputation but deliver inferior products (e.g., Pizza 
Hut franchisees can reduce the quality and quantity of 
cheese in pizzas), thus profiting from such actions with lim- 
ited negative consequences to them while hurting the firm’s 
reputation. Such moral hazard problems are especially com- 
mon when franchised units are located in remote markets 
characterized by high monitoring costs (Oxenfeldt and 
Kelly 1969). 


Uncertainty about income from market-based channels. 
Restaurant chains that use dual distribution rely on market- 
based channels that are not under their control for a portion 
of their income, resulting in uncertainty about this income 
stream (Bradach 1997). Indeed, the forward-looking state- 
ments in chains’ annual reports identify uncertainty in 
income from franchised units as an important risk factor 
(e.g., Papa John’s International 2004). However, as I dis- 
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cussed previously, this risk is mitigated partly by vertical 
integration. 


Hypotheses 


The considerable variability in chains’ intangible values for 
a given dual distribution strategy, as evident in Figure 1, 
suggests that a contingent relationship between dual distrib- 
ution strategy and other firm characteristics affects intangi- 
ble value. Accordingly, I use the contingency-based 
approach, which argues for a complementary effect of a 
firm’s resources and strategy on its performance (Hambrick 
1983; Zeithaml, Varadarajan, and Zeithaml 1988). Extend- 
ing the contingency theory, I suggest that a firm’s dual dis- 
tribution strategy affects its intangible value both indepen- 
dently and jointly with its other characteristics. Thus, in 
addition to the main effect of dual distribution strategy on 
intangible value, I consider the effects of its interactions 
with the firm’s age, scope of vertical integration, advertis- 
ing, financial leverage, and financial liquidity on its intangi- 
ble value. Note that though age, financial leverage, and 
financial liquidity are not marketing variables in the tradi- 
tional sense, developments in agency theory (e.g., Bergen, 
Dutta, and Walker 1992) and transaction cost economics 
(e.g., Rindfleisch and Heide 1997) imply that they may 
influence the relationship between governance mode (i.e., 
dual distribution strategy) and firm performance. 

Extending theoretical developments in market-based 
assets theory (Srivastava, Shervani, and Fahey 1998), I pro- 
pose that under some conditions, defined by these interac- 
tion effects, the stock market will anticipate increasing cash 
flows, higher speed of cash flows, and lower risk of cash 
flows, which should increase the firm's intangible value. 
Under other conditions, the stock market will anticipate 
decreasing cash flows, lower speed of cash flows, and 
higher risk of cash flows, which should decrease the firm's 
intangible value. Because dual distribution offers both 
advantages and disadvantages, I hypothesize opposite 
effects when appropriate. The latent class regression 
approach, which I describe subsequently, accommodates 
opposite effects across firms in the different segments. 


Main Effect of Dual Distribution 


As I discussed previously, dual distribution offers several 
advantages, including lowering a firm's capital require- 
ments, operating costs, monitoring costs, and risk. From a 
shareholder value perspective, market-based channels cre- 
ate relational, knowledge, and intellectual assets that 
increase future cash flow while reducing risk (Srivastava, 
Shervani, and Fahey 1998, 1999). From a real-options per- 
spective (Bowman and Hurry 1993; Kulatilaka and Trigeor- 
gis 1994), market-based channels are “options” investments 
that are useful for managing demand growth and uncer- 
tainty. Thus, an increasing proportion of market-based 
channels should increase a firm's intangible value. 
However, agency theory accounts suggest some disad- 
vantages of dual distribution as well, including potential 
underinvestment and free riding by market agents (Bergen, 
Dutta, and Walker 1992). Thus, an increasing proportion of 
market-based channels in such firms may decrease their 


future cash flows and increase the uncertainty of these cash 
flows, thus negatively affecting the firm’s intangible value. 

Given these opposing arguments, I expect that there is a 
positive effect of the proportion of market-based channels 
on intangible value for some firms and a negative effect for 
others. Thus: 


Hj, (Hib): The greater the proportion of market-based chan- 
nels in a firm that uses a dual distribution strategy, 
the higher (lower) is its intangible value. 


Dual Distribution and Firm Age 


Over time, most high-potential markets are covered by a 
firm’s existing distribution system so that its expansion may 
be limited to smaller, remote markets, which are more effi- 
ciently served by market-based channels (Lafontaine and 
Shaw 2005). In such a situation, the stock market may view 
an increasing proportion of market-based channels as a 
cost-effective way to grow, thus strengthening the firm’s 
cash flows and increasing its intangible value. : 

However, other arguments from the strategy and fran- 
chise literature suggest an opposite effect. The life cycles of 
firms influence their resource profiles and strategies (Pen- 
rose 1959). Franchising is important in the early years of a 
firm when its resources are scarce (Carney and Gedajlovic 
1991). Over time, as resources increase, franchising may 
become less important, resulting in the well-documented 
reversion to company ownership of franchised units in older 
firms (e.g., Lafontaine and Kaufmann 1994; Oxenfeldt and 
Kelly 1969). Thus, the older the firm and the greater the 
proportion of its market-based channels, the lower is its 
future cash flows, the higher is the risk of these, cash flows, 
and the lower is its intangible value. Given these opposite 


effects, | 


Hə, (Hy): The interaction between the proportion of market- 
based channels in a firm that uses a dual distribu- 
tion strategy and its age increases (decreases) its 
intangible value. 


Dual Distribution and Scope of Vertical 
Integration 


In the current context, the scope of a firm's vertical integra- 
tion is the proportion of the firm's sales from its vertically 
integrated channels to its total sales (John and Weitz 1988). 
Several factors suggest a positive effect of the interaction 
between the scope of a firm's vertical integration and dual 
distribution on its intangible value. 

A firm that depends more on its vertically integrated 
channels for sales will have higher fixed investments in its 
integrated channels, offering a credible termination safe- 
guard against agents’ opportunism and improving both the 
agents’ and the system’s performance (Heide 1994), Fur- 
thermore, a firm that depends more on its vertically inte- 
grated channels for sales will have higher marketing expen- 
ditures (Lafontaine 1992; Lafontaine and Shaw 2005), 
creating positive marketing spillovers for its market-based 
channels and, again, strengthening its performance. 

In addition, when a firm has an increasing proportion of 
sales from its vertically integrated channels andja large pro- 
portion of market-based channels, its market-based chan- 


nels will have lower sales per unit than its vertically inte- 
grated channels, suggesting that they are covering smaller 
markets with lower potential. In such a situation, the firm’s 
market-based channels are a cost-effective way to reach 
these smaller markets without attendant capital investment 
and risk. Thus, the interaction between the scope of a firm’s 
vertical integration and an increasing proportion of market- 
based channels may strengthen its future cash flows, reduce 
the risk of these cash flows, and increase intangible value. 


H3, The interaction between the proportion of market-based 
channels in a firm that uses a dual distribution strategy and 
the scope of the firm’s vertical integration increases its 
intangible value. 


Dual Distribution and Advertising 


Empirical studies indicate that the effects of advertising on 
performance are industry specific, varying in magnitude, 
timing, and persistence (Comanor and Wilson 1967; 
Ravenscraft and Scherer 1982). The interest here is in the 
effect of the interaction between a firm’s advertising and the 
proportion of its market-based channels on intangible value. 
The firm’s and the market agents’ incentives are closely 
aligned when brand equity, which is consistent with high 
advertising, is high (Arrow 1985; Mathewson and Winter 
1984). For example, when a firm has high advertising, its 
market agents may be motivated to perform better to extract 
higher rents from the higher brand reputation, improving 
system performance. Furthermore, when a firm with high 
advertising also has a high proportion of market-based 
channels, its advertising investment may be efficiently 
recovered over the larger market, increasing the firm’s cash 
flows and reducing their uncertainty. Thus, the higher the 
firm’s advertising and the higher the proportion of its 
market-based channels, the higher is its intangible value. 
However, other arguments suggest the opposite. Agency 
theorists (Brickley and Dark 1987) argue that market agents 
of firms with higher advertising may have an incentive to 
free ride on the firm’s reputation, especially in remote mar- 
kets with high monitoring costs. For example, franchisees 
of well-known brands located in remote markets may 
reduce product quality, inferring that such actions bear no 
negative consequences for them or for the firm. However, 
such actions can negatively affect the firm’s product percep- 
tions, reputation, and performance. Thus, when a firm with 
high advertising also has a high proportion of market-based 
channels, the increased moral hazard threat may lower its 
future cash flows and increase the risk of these cash flows, 
lowering the firm’s intangible value. Given these opposite 
effects, 
Ha, (Hap): The interaction between the proportion of market- 
based channels in a firm that uses a dual distribu- 


tion strategy and its advertising increases 
(decreases) its intangible value. 


Dual Distribution and Financlal Leverage 


In the current context, financial leverage is the extent to 
which a firm uses debt to finance its assets (Jensen and 
Meckling 1976). A firm’s financial leverage affects its strat- 
egy and performance through agency problems between the 
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firm and its bondholders and stockholders (Bradley, Jarrell, 
and Kim 1984). As I discuss next, these agency problems 
may result in an effect of the interaction between a firm’s 
financial leverage and dual distribution strategy on its intan- 
gible value. 

Stock and bond markets cannot differentiate between 
“good-quality” and “bad-quality” firms. Thus, good-quality 
firms with high financial leverage signal their quality with 
actions that are too costly for bad-quality firms (Myers and 
Majluf 1984). For example, good-quality firms may invest 
in fixed assets, increasing the security for their bondholders 
and reducing their cost of capital (Myers 1977). 

By the reverse logic, when a firm with high financial 
leverage has a high proportion of market-based channels, an 
inexpensive source of financial resources (Carney and 
Gedajlovic 1991), the bond market may interpret this as a 
signal of low funds, inferring the firm’s inability to service 
its debt, lower its debt rating, and increase its cost of capital 
(Lang, Ofek, and Stulz 1996). In such a situation, the stock 
market may expect the firm’s future cash flows to decrease, 
even as its risk increases, thus negatively affecting its intan- 
gible value. Thus: 


Hs: The interaction between the proportion of market-based 
channels ın a firm that uses a dual distribution strategy 
and its financial leverage decreases its intangible value. 


Dual Distribution and Financlal Liquidity 


Firms vary in their financial liquidity, or acid ratio (i.e., the 
extent to which current assets cover current liabilities); a 
higher acid ratio suggests a larger proportion of current 
assets than current liabilities. Financial liquidity affects a 
firm’s strategy and performance through investment con- 
straints (e.g., Bond and Meghir 1994), creating agency 
problems between the firm and its shareholders (Fazzari, 
Hubbard, and Petersen 1988; Hsiao and Tahmiscioglu 
1997). A high acid ratio suggests cash availability to fund 
investments with superior earnings potential. Indeed, the 
stock market prefers a lower acid ratio to a higher one 
because it views current assets as less productive than fixed 
assets (Myers and Rajan 1998). 

When a firm has a high proportion of market-based 
channels and a high acid ratio, the stock market may infer 
that the firm is forgoing income generation opportunities by 
not investing in vertically integrated channels. Thus, the 
stock market may expect a firm's increasing liquidity and 
an increasing proportion of its market-based channels to 
lower its intangible value. Thus: 


Hg: The interaction between the proportion of market-based 
channels in a firm that uses a dual distribution strategy 
and its financial liquidity decreases its intangible value. 


Method 


Data 


I used unbalanced panel data of publicly listed U.S. restau- 
rant chains for the period 1992-2002, which I obtained 
from various sources, including Technomic (an industry 
market research firm), Compustat, and the firms’ annual 


126 / Journal of Marketing, July 2006 


reports. The data included 55 firms observed for 394 firm 
years. Some firms entered after 1992 or exited before 2002, 
contributing fewer firm years. The average number of firm 
years was 7.164 (minimum = 1 year; maximum = 11 years). 

The firms in the study constituted 2896 of publicly listed 
restaurant chains. The profile of firms in the study did not 
differ from the population of firms in Compustat (N — 198) 
in terms of profit (tprofitabuity) = 1.012, not significant 
[n.s.]), but they were larger (tates) = 1.912, p < .10; 
t(employees) = 1.987, p < .05). The firms in the study are rep- 
resentative of the population, though they are slightly larger. 
This is not surprising, because larger firms are favored in 
public data sources. 


Measures 


Tobin's q is the measure of a firm's intangible value. Chung 
and Pruitt's (1994) method was used to calculate Tobin's q: 


eoa 


(1) TA 


where MVE is the closing price of shares at the end of the 
financial year x number of common shares outstanding; PS 
is the liquidation value of outstanding preferred stock; 
DEBT is the sum of book value of inventories, long-term 
debt, and current liabilities less current assets; and TA is the 
book value of total assets. 

As I discussed previously, the proportion of the restau- 
rant chain’s franchised units to the total number of its sys- 
tem units, a continuous measure bounded between 0 (only 
vertically integrated channels) and 1 (only market-based 
channels), was the indicator of its dual distribution strategy. 
The trends in dual distribution for the period 1992-2002 
suggested variability in dual distribution across chains and 
over time. The firm’s age was measured by the number of 
years since its incorporation. The scope of vertical integra- 
tion was measured by the proportion of the firm’s sales 
from its company units to its total sales. The firm’s sales 
from its company units and its total sales were obtained 
from Technomic and Compustat, respectively. The firm’s 
advertising was measured by its advertising stock, using 
advertising expenditures reported in Compustat.4 To 
account for the persistence of advertising, advertising stock 
was measured using one-, two-, and three-year lags with 
annual discount rates of .45 (Hirschey and Weygandt 1985). 
Through the use of information criteria, the three-year lag 
was identified as the appropriate lag.? The firm's financial 


4None of the firms in the study reported research and develop- 
ment expenditure. 

5I] thank an anonymous reviewer for this suggestion. The 
Schwarz Bayesian information criterion (BIC) for the four- 
segment model (reported subsequently) without profiling variables 
is 784.682, and it is 786.029 for models with advertising stock 
with a one-year and two-year lagged term for the advertising 
Stock, respectively, compared with 778.354 for the model with a 
three-year lag for the advertising stock. Thus, the three-year 
lagged term for advertising stock was selected as the appropriate 
lag term. In addition, following the suggestions of a reviewer, I 
reestimated the model with advertising stock adjusted for the 
firm's sales. In general, the results were consistent, but the model 
with the advertising adjusted for sales performed worse (BIC = 


t 


leverage was measured as the ratio of its long-term debt to 
its total assets (Jensen and Meckling 1976), and the firm’s 


financial liquidity was measured as the ratio of its cash, 


marketable securities, notes receivable, and accounts receiv- 
ables to its current liabilities (Schmidgall, Hayes, and Nine- 
meier 2002). Table 1 contains the descriptive statistics and 
the correlation matrix of the measures. 

The correlations were within acceptable limits (highest 
correlation = —438 between firm age and cial lever- 
age). Following the work of Belsley, Kuh, and Welsch 
(1980), I assessed potential threats from multicollinearity. 
The variance inflation factor and condition numbers were 
lower than 10 (average — 1.238; maximum - 3. 456) and 15 
(average — 1.875; maximum - 2.893), respectively, suggest- 
ing that multicollinearity is not a threat to the validity of the 
study's findings. 


Latent Class Regression | 


To account for unobserved firm heterogeneity, I|used latent 
class regression that simultaneously classifies observations 
into latent segments and estimates regression models within 
each segment (see Wedel and Kamakura 2000). The latent 
class regression approach directly identifies laterit segments 
on the basis of the inferred relationship between|a response 
variable (i.e., Tobin's q) and the set of explanatory variables 
(i.e., dual distribution and related interactions) within each 
homogeneous group (i.e., a given latent segme it). An out- 
line of latent class regression models appears in the 
Appendix. 





Results 


Correction for Sample Selection Bias 
Some firms (n = 20) were not observed for the entire period. 
If the data were not missing randomly and the model was 
estimated with the observed data, the parameter estimates 
may be biased (Heckman 1979). I corrected for potential 


801.932) than the proposed model (BIC = 718.354). I obtained 
similar results using the consistent Akaike’s eee criterion. 


sample selection bias by including Lee’s lambda (Inverse 
Mill's Ratio), which I obtained from a Heckman selection 
model in the regression model for Tobin's q.6 

I first estimated the Probit selection model, including 
the firm's age, sales; and return on assets as factors that 
influence its exit from the data set. The results support the 
selection model (%2 = 20.342, d.f. = 1, p < .01). The firm's 
age (b = —.063, p < .01), size (b = —.005, p < .01), and return 
on assets (b = —.965, p « .01) had negative effects in the 
selection model. Consistent with the liabilities-of-newness 
arguments in organizational sociology (Brüderl and Schus- 
sler 1990), newer, smaller, and less profitable firms were 
more likely to exit. 


Model Estimation and Selection 


Following the procedure outlined in the Appendix, I esti- 
mated two sets of latent class regression models (1) with the 
restaurant chain's class (i.e., whether it was casual dining, 
quick service, and limited service) and the number of 
brands in its portfolio (e.g., Yum! Brands had four brands, 
including KFC, A&W, Pizza Hut, and Taco Bell) as profil- 
ing variables and (2) with only a constant term (i.e., no 
restaurant characteristics) as the profiling variable. I 
included all main effects of explanatory variables used to 
create the interaction effects to ensure complete model 
specification. I also included the firm's return on its share- 
holders’ equity, measured as the ratio of net income to 
shareholders' equity, and a quadratic term of the proportion 
of its market-based channels as control variables in the 
regression model. 

To determine the number of latent segments, I used the 
Bayesian information criterion (BIC) to compare the model 
with s segments with a model with s + 1 segments V s = 1, 
2, ..., until the model fit stopped improving (Schwarz 
1978). The four-segment: model, without the restaurant 
characteristics term for profiling, outperformed models with 


6For details on the statistics of sample selection correction, see 
Heckman (1979). 

7I included the quadratic term for dual distribution, following 
the suggestion of an anonymous reviewer. 





TABLE 1 
Descriptive Statistics and Correlations (N z 394) 
| 
Variable M (SD) | 1 2 3 4 5 6 7 
1. Tobin’s q | 1.548 | 1.000 
(1.046) |! 
2. Dual distribution (proportion -390 | .025 1.000 
of market-based channels) (.326) 
3. Firm age (years) 26.980 = —.052 .229** 1.000 
(18.000) 
4. Scope of vertical integration 654  |—.105* —.079 —.116* 1.000 
' (.562) 
5. Advertsing stock (in millions 58.132 | am .273** .347** —.222** 1.000 
of dollars) (152.693) 
6. Financlal leverage .259 | —.185"* .123* —.488** —.199** .207** 1.000 
(.237) 
7. Financial liquidity 741 | .185** 221" | —231" .013 —.099* —225** . 1.000 
(978) | 
*p < .05 
**p « .01. 
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one, two, three, and five segments both with and without 
profiling variables, as well as the four-segment model with 
profiling variables.3 The entropy of separation for the 
selected four-segment model was .972, which compares 
well with the results in the literature (Wedel and Kamakura 
2000). 


Tests of Hypotheses 


Table 2 contains the results of the four-segment model with 
the constant term for profiling. First, Lee’s lambda signifi- 
cantly affected a firm’s intangible value in two segments 
(Segment 2: b = —379, p < .01; Segment 4: b = 1.405, p < 
.01), in support of the correction for sample selection bias. 
There were several unhypothesized main effects of the 
explanatory variables. Specifically, the negative effect of 
advertising stock on intangible value was counterintuitive 
(Segment 2: b = —.001, p < .01; Segment 4: b = —.031, p< < 
.01). I examine this result in detail subsequently. 


8For the model formulation with restaurant class and number of 
brands as profiling variables, the BIC is 1131.655 for the one- 
segment model, 861.773 for the two-segment model, 846.560 for 
the three-segment model, 822.564 for the four-segment model, 
and 808.100 for five-segment model. Similarly, for model formu- 
lation with only the constant term as the profiling variable, the 
BIC is 1131.655 for the one-segment model, 843.380 for the two- 
segment model, 828.113 for the three-segment model, 778.354 for 
the four-segment model, and 786.839 for the five-segment model. 
I obtained similar results using the consistent Akaike’s informa- 
tion criterion. 


As I predicted in H4, and Hip, respectively, the results 
support both the positive (Segment 2: b = .694, p « .01) and 
the negative (Segment 1: b = —321, p « .05) effects of the 
firm's proportion of market-based channels on its intangible 
value. In addition, the square of the proportion of market- 
based channels had an unhypothesized positive effect on 
intangible value in Segment 1 (b = 1.190, p « .01). As I pre- 
dicted in H5, and H»,, respectively, the results support the 
positive (Segment 1: b = .022, p « .01) and negative (Seg- 
ment 2: b = —.034, p < .01; Segment 4: b = —.091, p < .01) 
effects of interaction between the firm's age and the propor- 
tion of its market-based channels on its intangible value. As 
I predicted in H}, the interaction between-the scope of a 
firm's vertical integration and the proportion of its market- 
based channels had a positive (Segment 2: b = .634, p < .05) 
effect on its intangible value. Somewhat unexpectedly, the 
interaction between the scope of a firm's vertical integration 
and the proportion of its market-based channels also had a 
negative effect on its intangible value (Segment 1: b = 
—.413, p < .01). I examine this result in detail subsequently. 

The results also support H4, and Hay, which predict 
positive (Segment 1: b = .005, p « .05; Segment 2: b = .005, 
p < .01) and negative (Segment 4: b = —.073, p < .01) effects 
of the interaction between a firm's advertising stock and the 
proportion of its market-based channels on its intangible 
value. As I predicted in Hs, there was a negative effect of 
the interaction between a firm's financial leverage and the 
proportion of its market-based channels on its intangible 
value (Segment 2: b = —1.634, p « .01; Segment 3: b = 
—1.207, p < .05). Finally, as I predicted in Hg, there was a 
negative effect of the interaction between financial liquidity 


TABLE 2 
Results for the Model Relating a Firm's Dual Distribution to Its Intangible Value (N = 394) 

Varlable Segment 1a Segment 2 Segment 3 Segment 4 
Intercept .853 (.049)*** 1.393 (.071)*** 1.891 (.254)*™ 593 (.592) 
H44/H45: Dual distribution? —.321 (.135)** .694 (.149)*** 564 (.605) —1.110 (1.009) 
HH: Dual distribution x 

firm age .022 (.007)*** —.034 (.009)*** .028 (.028) —.091 (.031)'** 
Ha: Dual distribution x scope 

of vertical integration —.413 (.160)*** .634 (.253)** .100 (.781) —140 (.206) 
Hag/H4p: Dual distnbution x 

advertising stock .005 (.002)** .005 (.001)*** —,003 (.010) —073 (.025)*** 
Hg: Dual distribution x financial 

leverage —.275 (.471) —1.634 (.449)*** —7.207 (3.215)™ 7.175 (5.218) 
Hg: Dual distribution x financial 

liquidity —.507 (.072)*** —.138 (.153) —.367 (.474) —2.144 (1.043)** 
Firm age —.004 (.002)** .006 (.003)** 009 (.004)** —038 (.014)*** 
Scope of vertical integration .001 (.000)*** —.001 (.000)* —.001 (.000)*** .001 (.000)*** 
Advertising stock .001 (.002) —.001 (.000)*** —.002 (.003) —.031 (.007)'** 
Financial leverage .947 (.135)*** —.661 (.142)*** —3.150 (.864)'** —3.019 (1.384)** 
Financial liquidity .323 (.033)*** —.192 (.050)*** .199 (.179) 585 (.405) 
Dual distnbution? 1.190 (.358)*** —.695 (.498) —3.151 (2.657) 1.328 (1.486) 
Heturn on shareholders' equity .035 (.025) .013 (.013) 884 (.348)** —.582 (.235)™ 
Lee's lambda .011 (.049) —.379 (.113)*** .111 (111) 1.405 (541)*** 
Segment size (number of firms) 36% (20) 29% (16) 20% (11) 15% (8) 
*p « .10. 
**p « .05 
«ep < .01. 


aParameter estimates and standard errors. 


bDual distribution is measured by the proportion of market-based channels in the firm. 
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| 
and the proportion of a firm’s market-based channels on its 
intangible value (Segment 1: b = —.507, p < .01; Segment 4: 
b = -2.144, p < .05). Thus, the results — QUE the 
hypotheses. 


Latent Segment Profiles 


In this section, I report the post hoc segment profile of the 
four segments. The results of an analysis of variance (see 
Table 3) suggest that there are significant differences across 
firms in the four segments. The following is a summary of 
the Scheffé multiple comparisons of means of variables 
across the four segments (Pedhazur 1997):9 ! 


*Intangible value. Firms in Segment 4 had higher intangible 
value than firms in the other segments (p « .01), firms in Seg- 
ment 3 had higher intangible value than firms ını Segments 1 
and 2 (p « .01), and firms in Segment 2 had higher intangible 
value than firms in Segment 1 (p « .05). 


«Return on assets. Firms in Segment 1 had a much lower 
return on assets than other firms (p « .01). 


eSize. Firms in Segment 2 were larger in terms of system size, 
total assets, and sales than other firms (p « .01), whereas 
firms in Segment 4 were smaller in system size than other 
firms (p « .01). 

eNumber of brands. Firms in Segments 3 had more brands 
than other firms (p « .01). 


eAdvertising stock. Firms in Segment 2 had higher advertising 
stock than other firms (p « .01), whereas firms 1n Segment 4 
had lower advertising stock than other firms (p s .05). 


l 
! 


9There are no significant differences across firms in the four 
segments on other variables. 


eFinancial leverage. Firms in Segment 4 had much lower 
financial leverage than firms in the other segments (p « .05). 


J integrate the segment profiles with the regression 
results to explain the findings across the segments. Firms in 
Segment 1 (e.g., Backyard Burgers, Nathan's Famous, Car- 
rols Corporation) performed poorly in this period. For these 
firms, the proportion of market-based channels had a 
U-shaped relationship (D¢near) = —321, p < .05; Dequadratic) = 
1.190, p < .OL) on their intangible values. Perhaps the stock 
market infers that given their poor overall performance, 
these firms have limited ability to manage market-based 
channels, though notably, this negative effect disappears at 
very high proportions of market-based channels. For firms 
in Segment 1, the scope of vertical integration has a positive 
main effect on intangible values (b = .001, p < .01), but this 
effect is reversed in conjunction with the proportion of their 
market-based channels (b = —413, p < .01). The positive 
main effect of vertical integration suggests that for these 
poorly performing firms, increasing their sales from the 
company-based channels reassures the stock market of their 
future cash flows. Conversely, the negative main effect of 
the proportion of market-based channels, both indepen- 
dently and in conjunction with the scope of vertical integra- 
tion, implies that the stock market may view these poorly 
performing firms as not being well positioned to manage 
market-based channels. However, as these firms aged, the 
proportion of their market-based channels positively 
affected their intangible values (b = .022, p < .O1), suggest- 
ing that, over time, market-based channels are an effective 
mechanism for these firms to achieve growth. Likewise, as 
firms’ advertising stock increased, the proportion of market- 





TABLE 3 
Summary of Results Across the Four Latent Segments 
Variable Segment 1 Segment 2 Segment 3 Segment 4 
Intangible value (22.192**)a 1.048 1.300 1.903 2.735 
Return on assets (8.345) | .006 .018 .016 .028 
System size (units) (5.698**) | 2229 1064 643 
Total assets (in millions of dollars) (10.082**)  ! 398.451 1450.611 443.613 343.658 
Sales (In millions of dollars) (9.178**) | 653.684 1091.030 630.852 518.781 
Number of brands (3.184*) 1.400 1.375 2.545 1.375 
Advertising stock (in millions of dollars) (9.231 "ub 42.363 83.832 32.557 23.388 
Financlal leverage (6.792**) .323 .315 .205 .155 
California 
Backyard IHOP.Corp., Brinker Pizza Kitchen, 
Burgers, McDonald’s International, Panera Bread 
Nathan’s Corp., CBRL Group, Company, 
Famous, Wendy’s AFC Diedrich 
Exemplar Members Carrols Corp. International Enterprises Coffee 
Dual distribution (H44/H45)(47—) | -14 (quadratic) + n.s. n.s. 
Dual distribution x firm age (Ha—/Ho,) (+/-) + — n.s. — 
Dual distnbution x scope of vertical integration 
(Ha) (+) + n.s. n.s. 
Dual distribution x advertising (H44/H45) (+/-) + n.s. -— 
Dual distribution x financial leverage (Hs) (—) — — n.s 
Dual distribution x financial liquidity(Hg) (—) n.s. n.s. — 


p< .05. | 
"p < .01. 


aF|gures in parentheses are the overall test of means across the four segments (Fi, 51). 
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based channels positively affected their intangible values 
(b = .005, p < .05), suggesting that market-based channels 
also enable effective leverage of advertising stocks. 

Firms in Segment 2 (e.g., IHOP Corporation, McDon- 
ald’s Corporation, and Wendy’s International) were large 
firms with large advertising stocks. For these firms, the pro- 
portion of market-based channels positively affected their 
intangible values (b = .694, p < .01), suggesting that per- 
haps because of market saturation, market-based channels 
that tap small and low-potential markets are an efficient 
way to grow. The effects of the interactions between pro- 
portion of market-based channels and both advertising (b = 
.005, p « .01) and scope of vertical integration (b — .634, 
p « .05) positively affected firms' intangible values. 
Notably, the main effects of both advertising (b = —001, p < 
.01) and scope of vertical integration (b = —.001, p « .10) for 
these firms are negative. For large firms with large advertis- 
ing stocks, perhaps the ceiling effects of advertising result 
in the negative main effect. However, the stock market may 
view these large firms as being able to leverage their large 
advertising stocks effectively over a larger market using 
market-based channels, resulting in the observed positive 
effect of the interaction between advertising stock and the 
proportion of market-based channels on intangible values. 
The negative main effect of the scope of vertical integration 
and the positive effect of the interaction between the pro- 
portion of market-based channels and the scope of vertical 
integration suggest that the stock market views expansion 
through market-based channels as the most effective way 
for these large firms to achieve growth. 

Firms in Segment 3 (e.g., Brinker International, CBRL 
Group, AFC Enterprises) were small firms with a larger 
portfolio of brands than other firms. Notably, most hypothe- 
sized effects of dual distribution on intangible value were 
not supported for these firms (the one exception was the 
interaction between financial leverage and proportion of 
market-based channels: b = —7.207, p < .05). Overall, the 
intangible values of these firms appeared to be impervious 
to their dual distribution strategies, perhaps because the 
stock market views their brand portfolio diversification as a 
sufficient hedge against market uncertainty. 

Firms in Segment 4 (e.g., California Pizza Kitchen, Pan- 
era Bread Company, Diedrich Coffee) were very small 
firms with low advertising stocks and low financial lever- 
age. For these firms, there was no main effect of the propor- 
tion of market-based channels (b = —1.110, n.s.) on their 
intangible values, but advertising had a negative main effect 
(b = —031, p < .01) and the scope of vertical integration had 
a positive main effect (b — .001, p « .01) on their intangible 
values. However, there were negative effects of the interac- 
tions between the proportion of market-based channels and 
advertising stock (b = —073, p < .01), age (b = —091, p < 
.01), and financial liquidity (b = —2.144, p < .05) on their 
intangible values. The negative effect of advertising stock 
on intangible value, both independently and in conjunction 
with dual distribution, is intriguing. Indeed, after the large 
firms in Segment 2, the firms in Segment 4 have the highest 
proportion of advertising stock to system size. I conjecture 
that the stock market may perceive these firms as having 
adequate advertising stock and thus need to achieve system 
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growth. In addition, for these firms, dual distribution has a 
negative effect on intangible value. Integrating this pattern 
of findings suggests that perhaps the stock market views 
these firms as good custodians of intangible value and 
rewards them for organic growth through company-based 
channels (i.e., lower emphasis on dual distribution). 


Additional Analyses 


Model comparisons. First, I compared the proposed 
model with a model with only the accounting data of finan- 
cial leverage, liquidity, and return on shareholders' equity. 
The hypothesized model (BIC = 778.354) outperformed 
this baseline model (BIC = 823.358). Second, I compared 
the hypothesized model with a model that included the 
firm’s age, scope of vertical integration, advertising stock, 
and financial accounting data, but I excluded dual distribu- 
tion and the related interaction effects. Again, the hypothe- 
sized model (BIC = 778.354) outperformed the model that 
excluded dual distribution (BIC = 798.567). Third, I com- 
pared the proposed model with a model that included main 
effects of dual distribution and other explanatory variables, 
but I excluded the interaction effects. Again, the hypothe- 
sized model (BIC = 778.354) outperformed the model that 
excluded the interaction effects (BIC = 810.187). Thus, the 
proposed model with the inclusion of dual distribution and 
related interactions explained firm value better than models 
that included only accounting data or models that excluded 
dual distribution; this shows strong support for the value 
relevance of the dual distribution strategy. 


Predictive validity. I evaluated the model’s predictive 
ability using a jackknifing technique, holding out a target 
firm, reestimating the model on other firms, and then using 
the estimated parameters to predict the target firm's Tobin 
q. I computed the mean absolute deviation (MAD), defined 
as (I/NT)ZT. EA. ,ITob qq; — Tob qu, where Tob go 
and Tob_q, denote observed and predicted Tobin's q for 
firm i in year t, N denotes the number of firms, and T, 
denotes the number of years for each firm i, respectively. 
The MAD using the proposed model was .473, compared 
with a MAD of .713 with the average value, representing an 
improvement of 3496. Thus, the proposed model predicts 
the firm's intangible value well. 


Discussion 

Despite the growing importance of dual distribution chan- 
nels in practice, there are few insights into their perfor- 
mance implications. In this article, I explore the relationship 
between dual distribution strategy and intangible value. I 
conclude with a discussion of the theoretical contributions, 
managerial implications, and limitations and opportunities 
for further research. 


Theoretical Contributlons 


Marketing channels. By relating dual distribution strat- 
egy to intangible value, this article addresses the call for 
research on the performance implications of plural gover- 
nance in general and dual distribution in particular (Heide 
2003). The findings indicate that a firm's dual distribution 


strategy significantly affects its intangible valüe. Further- 
more, the differential main and interaction effects of dual 
distribution across the four latent segments suggest firm 
heterogeneity in the value relevance of dual distribution. 

For some firms (Segments 1 and 2), an emphasis on 
dual distribution contributes both independently and jointly 
with other firm characteristics to their intangible values. For 
others (Segment 3), their intangible values are mostly 
impervious to their dual distribution. Still others (Segment 
4) are punished by the stock market for emphasizing dual 
distribution, albeit in conjunction with other character- 
istics. This evidence of firm heterogeneity in the value rele- 
vance of dual distribution empirically validates| prior theo- 
retical developments (e.g., Bergen, Dutta, and Walker 1992; 
Heide 1994) that indicate both advantages and disadvan- 
tages of dual distribution. The generalizability of the contin- 
gent nature of the performance effects of dual distribution 
to other performance metrics (e.g., profit, market share) 
emerges as an area for further research. 

Importantly, from a shareholder value perspective, dual 
distribution does not offer unilateral benefits to all firms, as 
implied by some observers (e.g., Bradach 1998; Moriarty 
and Moran 1990). Instead, it is contingent on their other 
characteristics, including marketing and financial strategies. 
This finding, which stresses synergies between aspects of 
firms' distribution and business strategies, represents a use- 
ful extension to marketing strategy research in general and 
to marketing channels research in particular, which, for the 
most part, has not examined such synergies. 


Marketing metrics. This article's findings also con- 
tribute to the marketing metrics literature, in which there is 
a gap With respect to the relationship between a firm's dis- 
tribution strategy (dual distribution, in particular) and its 
performance. By providing strong evidence of the value 
relevance of dual distribution strategy, this study takes a 
first step toward addressing this gap. 

In a departure from prior studies in both marketing and 
accounting literature, which have used a functional silo- 
based approach (i.e., using either marketing or accounting 
measures, respectively) to study the value relevance of strat- 
egy, this article's findings support interactions between 
aspects of marketing and finance strategies. These interac- 
tions suggest a previously unexamined potential inter- 
changeability between marketing and financial strategies in 
maximizing intangible value. For example, a firm with high 
financial liquidity (firms in Segment 1 and Segment 4) can 
decrease its emphasis on dual distribution to increase its 
shareholder value. Exploring the boundaries of such inter- 
changeability across marketing and other aspects of busi- 
ness strategies (e.g., finance, accounting, human resources) 
is a worthwhile area for further research, given the growing 
importance of the financial accountability of marketing 
(Rust et al. 2004). 

Furthermore, the value relevance of dual distribution 
suggests that intangible, market-based channels not 
accounted for on firms' balance sheets but rather on their 
market agents' balance sheets significantly affect their 
intangible values. This finding implies the existence of 
boundaries beyond firms' balance sheets that are critical for 
an accurate assessment of their intangible values. The 


demarcation of such extended boundaries for inventorying 
value-relevant, intangible, market-based assets emerges as 
an important area for further research. 

Finally, the unhypothesized negative effect of advertis- 
ing stock on intangible value (Segment 2 and Segment 4) is 
counterintuitive and merits discussion. For the large firms 
(Segment 2), perhaps ceiling effects on their large advertis- 
ing stocks produce this negative effect. For the small firms 
(Segment 4), perhaps the stock market views advertising as 
an exploratory, risky, value creation strategy and an empha- 
sis on market-based channels as an exploitative, value 
appropriation strategy (March 1991). These findings, com- 
bined with those of Mizik and Jacobson (2003), who report 
a value creation role for research and development and a 
value appropriation role for advertising, imply that whether 
the stock market views a given strategy as creating or 
appropriating value may be context specific. 


Managerial Implications 


The study's findings also generate useful implications for 
marketing practice. For example, the significant and differ- 
ent main and interaction effects of dual distribution on 
firms' intangible value across the four latent segments indi- 
cate that the value relevance of dual distribution is contin- 
gent on other firm characteristics. This clarifies the puzzling 
relationship between chains’ dual distribution strategies and 
their intangible values reported in Figure 1. For example, in 
2002, the differences in intangible values between McDon- 
ald's Corporation (ticker: MCD: Tobin's q = 1.290) and 
Panera Bread Company (PNRA: Tobin’s q = 5.300), both 
with similar dual distribution strategies (.72 of their system 
units are franchised), may be partly due to the differences in 
their characteristics, including age (MCD = 47 years; 
PNRA = 15), system units (MCD = 31,108; PNRA = 478), 
advertising stock (MCD = 1088.112; PNRA = 13.445 [in 
million of dollars]), financial leverage (MCD = .400; 
PNRA = .000), and financial liquidity (MCD = .489; 
PNRA = 1.348), which influence the relationship between 
their dual distribution and their intangible values. 

Given the increasing pressures on marketing executives 
to demonstrate the financial accountability of marketing 
and the limited prior research on the financial accountabil- 
ity of distribution strategy, the value relevance of dual dis- 
tribution is both a novel and a useful insight. Marketing 
managers can use the current findings to convince various 
stakeholders (e.g., top management and finance executives, 
some of whom may be skeptical of the financial account- 
ability of marketing) of the value relevance of marketing in 
general and dual distribution strategy in particular. Man- 
agers can use the study’s specific findings to develop 
investor communications programs to increase their firms’ 
intangible value. In addition, given the model’s predictive 
ability, investment analysts can use it to assess a chain’s 
intangible value, given a set of characteristics. 

At an operational level, the study’s findings generate 
action points for managers of firms (in the four segments) to 
maximize their firms’ intangible values: 

Segment 1. For the poorly performing, medium-sized firms in 

Segment 1, a reduced emphasis on market-based 
channels, especially at moderate levels and when 
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they have high financial liquidity, and an 
increased scope of vertical integration will reduce 
the negative effects of dual distribution on their 
intangible values. Firms in Segment 1, with high 
financial liquidity, will be well served from a 
shareholder value perspective by investing their 
cash resources in vertically integrated channels. 
Alternatively, these firms can invest their cash 
resources 1n additional advertising because as their 
advertising stocks increase, an increased emphasis 
on market-based channels increases their intangi- 
ble value. 

Segment 2. The large firms with large advertising stocks in 
Segment 2 can increase their intangible values by 
emphasizing market-based channels because dual 
distribution has both positive main and interaction 
effects with the scope of their vertical integration 
and advertising stocks. The negative effect of the 
interaction between dual distribution and financial 
leverage suggests that these firms will be well 
served from a shareholder value perspective by 
reducing their financial leverage. 

Segment 3. In general, the shareholder values of the small 
firms with a large portfolio of brands in Segment 3 
appear to be impervious to their dual distribution 
strategies. One exception 1s the negative effect of 
the interaction between their dual distribution and 
financial leverage on intangible values. These 
firms can increase their intangible values by 
decreasing their financial leverage. 

Segment 4. Surprisingly, for the very small, well-performing 
firms, there are no positive effects of dual distribu- 
tion, either independently or jointly with other 
charactenstics. However, dual distribution has a 
negative interaction effect with age, advertising, 
and financial liquidity, suggesting that over time 
and as their advertising stocks and liquidity 
increase, the stock market rewards them for 
investing in vertically integrated channels. 


Finally, a study of performance in franchise firms is 
worthy of empirical investigation in its own right because 
franchising is a vital force in the U.S. economy (Dant and 
Kaufmann 2003). In 2003, business format franchises 
accounted for more than 600,000 establishments, 8 million 
jobs, and $460 billion of economic output (Pricewater- 
houseCoopers 2004). Restaurant chains dominate the indus- 
try, and 15 of the 20 largest franchises are restaurants with 
sales representing 54% of revenues of all U.S. franchises. 
Yet there are few insights into the performance of franchise 
firms. As Bradach (1997, p. 301) notes, “future work should 
investigate how the functioning and performance of the 
plural forms compares in chains with different mixes of 
company and franchised units.” The current study’s findings 
on the factors affecting a chain’s performance could be use- 
ful to managers of franchise firms in general and to man- 
agers of restaurant chains in particular to increase their firm 
value. 


Limitations and Opportunities for Further 
Research 


In this first study on the performance implications of dual 
distribution, I focused on franchising in general and restau- 
rant chains in particular. The choice of one industry allows 
for the control of cross-industry variation, enabling a clean 
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test of the hypotheses. However, as a result, I do not con- 
sider environmental characteristics (e.g., uncertainty, 
competitive conditions), which have been shown to be per- 
tinent to channel decisions. In addition, the use of sec- 
ondary data precluded consideration of organizational fac- 
tors (e.g., culture, trust, organizational structure) that affect 
channel management. Further research on the performance 
implications of dual distribution that uses other methods 
(e.g., in-depth interviews, surveys) across diverse industry 
contexts incorporating other environmental and organiza- 
tional characteristics would be useful. 

This article’s focus on dual distribution raises questions 
about the generalizability of the study’s findings to other 
plural governance settings. The study’s sample (N = 55), 
though representative of the population of restaurant chains, 
is small. Further research on performance in other plural 
governance settings (e.g., foreign market entry, outsourc- 
ing) with larger sample sizes would represent useful 
extensions. 

The dependent variable in this study is intangible firm 
value, as measured by Tobin’s q. Although Tobin’s q is a 
reliable measure of firm value, it represents the stock mar- 
ket’s evaluation of the level, speed, and risk of the firm’s 
future cash flows. Research extensions using alternative 
performance metrics, including survival, risk, and volatility 
in earnings, would both complement and extend the study’s 
findings. 

In summary, this study represents a useful first step in 
exploring the performance implications of dual distribution 
strategies. I hope that this article stimulates further work 
that relates plural governance in general and dual distribu- 
tion in particular to firm performance. 


Appendix 


Outline of Latent Class Regression Models 


Let f = 1, ..., F index the restaurant chains; t = 1, ..., Tr 
index the year of restaurant chain f; s = 1, ..., S index the 
number of latent segments; Yg be the Tobin’s q of restaurant 
chain f in year t; and Xp, be the value of the pth explanatory 
variable for restaurant chain f in year t. Assume that the 
vector of observations for Yg arises from a population that 
is a mixture of S segments in proportions T, Ta, 13, TG, 
where a priori membership of the observations in the differ- 
ent segments is unknown. Then, each firm belongs to only 
one segment, and the point mass for each segment is 
denoted by 7,, such that 


S 
(Al) Yn =1, and 05x, <1, 


s=1 


where 1, is the point probability of a firm belonging to 
segment s. 

The objective is to predict the means of the dependent 
variable for each observation in each segment using a set of 
explanatory variables. To that end, I specify a linear predic- 
tor Nfa Which is a function of the P explanatory variables, 
X1, X5, ..., Xp (Xp = Xfip; p=1,..., P), and parameter vec- 
tors D = Bsp to be estimated in segment s: 


P 
(A2) tins » Xu 
pel 


In turn, the linear predictor is related to the mean of the 
distribution, Jg, through a link function g(-) such that in 
segment s, 


(A3) Nts = EH). 


In the current case, given the panel structure of the data, 
the Yg values are not independent across the t time periods. 
The distribution of Yp, given that it comes from segment s, 
f. (Y8l0,) is normal within each of the s segments, so that 
the identity link function is pertinent such that Na, = Hyt- 
Thus, by combining Equations À2 and A3, I obtain the stan- 
dard linear regression model within the segments, given the 
assumption that the firm can belong to only one segment. I 
model T, with a logit formulation. Specifically, 


(A4) T, = 





Ys 


s=] 


where 6, is the segment-specific vector of profiling coeffi- 
cients. Then, Equation A4 is standardized as 


eð; 


EE em 
1+ V es 


s-l 


(A5) T, 


> 


and 6, is parameterized as 


P 
(A6) 5, = Bos + >, zB, 


p-l 


where zy is the matrix of profiling variables; Bo, is a 
segment-specific constant term; is a vector of coeffi- 
cients for the profiling variables V s = 1, ..., S — 1; and P is 
the number of profiling variables. 


Estimation and Model Selection 
If the distributions conditional on the segments are formu- 
lated, the unconditional distribution of Ya is obtained as 
follows: 

S 


(A7) ECY) = 9 n f, Y,6,), 


£=] 


where ọ = (n, 0) and 0 = (By, A,). The purpose of the latent 
class regression estimation is to estimate the parameter vec- 
tor 9 = (x, 8) using the maximum likelihood method, so 
that the likelihood for q is 


F T 
(A8) Li; Y) =] [| [svo 


f-1tzl 


For a prespecified value of the number of segments s, an 
estimate of p (po) can be obtained by maximizing the like- 
lihood Equation A8 with respect to «, subject to the restric- 
tions in Equation A1, so that the parameter vector q could 
have produced the observed vector Y. To obtain the param- 
eter estimates, I use the expectation-maximization algo- 
rithm to maximize the natural logarithm of the likelihood 
function specified in Equation A8. I use 50 randomly 
selected starting values for each of the models to ensure 
convergence (Wedel and Kamakura 2000). To obtain the 
standard errors for the parameters, the Hessian (or the infor- 
mation matrix) for the final parameter estimates is inverted. 
After an estimate of @ has been obtained, estimates of the 
posterior probability p, a probabilistic allocation of the 
firm to the latent segments, are obtained using Bayes' theo- 
rem as follows: 


(A9) p BAO) . 
>) nou) 


$-1 


To determine the number of latent segments, the BIC 
(Schwarz 1978) is used to compare the model with s seg- 
ments with a model with s + 1 segments V s = 1,2, ..., until 
the model fit stops improving. The BIC is calculated as 
BIC = -2 x LL + P x In(F), where LL, P, and F stand for 
log-likelihood value, number of parameters, and sample 
size, respectively. I also report an entropy measure of sepa- 
ration (ES), or the extent of separation of the segments, 
which is calculated as 


F S 
Y, ` P& In(pg) 


(A10) ES = 1- 
F In(S) 


, 


where pg is the probability of firm f belonging to segment s, 
which is calculated using the Bayes rule specified in Equa- 
tion A9. The ES is bounded in the range 0-1, such that a 
value closer to 1 indicates good separation of the latent 
segments. 
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How Couniry Characteristics Affect 
the Perceived Value of Web Sites 


The authors examine how country characteristics systematically moderate the effects of individual-level drivers of 
the perceived value that consumers derive from visiting a brand manufacturer's Web site. They test hypotheses on 
data collected from 8886 consumers from 23 countries on three continents, involving 30 Web sites of the world’s 
largest consumer packaged goods companies. They find that the effect of privacy/security protection on perceived 
value is stronger for people from countries with a weak rule of law, whereas people from countries that are high on 
national identity give more weight to whether there is cultural congruity between the site and themselves. People 
who live in more individualistic countries give more weight to pleasure, to privacy/security protection, and to 
customization in thelr perceived value judgments than people from collectivistic countries. The authors discuss 


implications for Web site design strategies. 


is easily one of the most important developments in 

societies around the globe. Increasingly, companies 
are creating Web sites in an attempt to add value for their 
consumers. Both academics and practitioners have empha- 
sized the key role of consumer value delivery in company 
success (Day 1990; Gale 1994; Naumann 1995; Woodruff 
1997; Zeithaml 1988), including the success of Internet- 
based activities (Zeithaml, Parasuraman, and Malhotra 
2002). 

Given the global nature and reach of the Internet, con- 
sumers across the world can access Web sites. Although 
academics and managers agree about which factors (e.g., 
privacy/security, customization) affect consumer value, 
there is a lack of research into how the effectiveness of vari- 
ous factors in creating consumer value may vary across 
countries. The primary purpose of this study is to test 
whether relationships between consumer value and its dri- 
vers are systematically and predictably moderated by the 
institutional context in which consumers live. 

We investigate this issue in the context of Web sites 
operated by large consumer packaged goods (CPG) compa- 
nies. Whereas Web sites may be designed for various com- 
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mercial uses, including being a communication channel and 
offering an (additional) online sales channel, the focus of 
this article is on CPG sites that are primarily designed as a 
communication tool to forge cognitive and emotional bonds 
between a company's customers and its brands. Since con- 
sumer uptake of the Internet has rocketed, the Internet has 
become a major avenue for CPG companies to reach their 
consumers. In July 2003, the top ten sites of food and bev- 
erage makers attracted 9.5 million unique visitors, more 
than 75% more than a year before and only slightly fewer, 
for comparison's sake, than the number of visitors to the top 
ten university sites (Tedeschi 2003). Yet, in bonding with 
their consumers, “more [CPG] companies have failed than 
succeeded" (Curry 1999, p. 20), and 1596 of the top CPG 
brands still lack a Web site because of a lack of understand- 
ing of what their target consumers value in using the Inter- 
net (New Media Age 2003). 

The contribution of this study is twofold. First, we 
develop a conceptual model that pertains to the moderating 
role of the institutional context in which consumers live on 
the effects of individual and Web site characteristics. Sec- 
ond, we test the hypotheses using a large-scale data set, 
involving 30 Web sites of 16 of the world's largest CPG 
companies from 8 different European countries and more 
than 8500 visitors from 23 countries on three continents 
(Australasia, Europe, and North America). 

In the following sections, we introduce the main effect 
hypotheses. Given previous support in the literature for 
these relationships, this introduction is brief and primarily 
serves as a basis to develop the focal country-level moderat- 
ing hypotheses. These sections are followed by a discussion 
of the methods we used to test the hypotheses. Finally, we 
provide the results and discuss the implications of our 
findings. 


Drivers of Perceived Value 


The focal construct in our study is the "perceived value" 
that consumers derive from visiting a brand manufacturer's 
Web site. Drawing on Holbrook's definition and theory of 
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perceived value (Holbrook 1994, 1999; Holbrook and Corf- 
man 1985), we define the perceived value of a Web site as 
an interactive, relativistic, preference experience that results 
from visiting the Web site. It is “interactive” in that a value 
judgment is neither wholly subjective nor wholly objective 
but rather entails a subject-object interaction, meaning that 
it is affected by aspects of the person (e.g., emotions, 
sociodemographics) visiting the Web site and the character- 
istics of the Web site (e.g., privacy/security). It is "relativis- 
tic" in that value judgments differ among Web site visitors, 
just as consumer characteristics and perceptions of Web site 
characteristics may differ among people. Value involves 
"preference," which Holbrook and Corfman (1985, p. 40) 
interpret as "favorable disposition"; this encompasses 
evaluations (e.g., the Web site is useful) and intentions to 
act on these evaluations (e.g., intention to visit the Web site 
again; see also Chen, Clifford, and Wells 2002; Wells and 
Prensky 1996). Finally, value is an "experience" in that it 
resides in the consumption of the object, that is, in surfing 
the Web site. 

Figure 1 presents our conceptual framework, delineat- 
ing drivers of perceived value of Web sites. We.discuss the 
hypotheses underlying this framework subsequently. 


Consumptlon Experience 


The consumption experience has both a cognitive (utili- 
tarian and an emotional component (Holbrook and 


Hirschman 1982). This also applies to the Internet context; 
thus, it becomes crucial to evaluate “Web sites in terms of 
the extent to which they deliver these two types of experi- 
ence" (Novak, Hoffman, and Yung 2000, p. 39). By visiting 
a Web site, consumers may acquire useful information to 
help them in their decision process. That is, the Web site 
may be instrumental in the attainment of utilitarian con- 
sumption goals, which is valued by consumers (Shankar, 
Smith, and Rangaswamy 2003). 

Consumers may also derive value from the emotions 
generated during the Web site visit per se. In many cases, 
the consumer can even achieve flow while visiting the site 
(Novak, Hoffman, and Yung 2000). Various emotion typolo- 
gies have been proposed in the literature, of which the 
pleasure-arousal typology is among the most prominent 
(e.g., Eliashberg and Sawhney 1994; Olney, Holbrook, and 
Batra 1991; Reisenzein 1994). "Pleasure" refers to the 
affective feelings generated by the Web site, and "arousal" 
pertains to the level of activation or excitement generated 
by the Web site. The higher the feelings of pleasure evoked 
by a site, the more positive its evaluation will be. On the 
basis of the Wundt curve (Berlyne 1971), we hypothesize 
that the relationship between the arousal evoked in a person 
by visiting the Web site and that person's evaluation of the 
Web site takes the form of an inverted U shape; that is, 
moderate degrees of arousal are preferred over low and high 
levels of arousal. Formally, 


FIGURE 1 
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H,: The perceived value derived from a Web site visit (a) is 
positively related to the utilitarian experience associated 
with visiting the site, (b) is positively related to the plea- 
sure associated with visiting the site, and (c) exhibits an 
inverted U-shaped relationship with the arousal induced 
by the site. 


Web Site Characteristics 


Two Web site characteristics that are consistently men- 
tioned in the literature as being among the most important 
characteristics affecting consumer evaluations of sites are 
privacy/security and customization (Sultan et al. 2002; Zeit- 
haml, Parasuraman, and Malhotra 2002). Given the interna- 
tional scope of our study, a third characteristic appears per- 
tinent: the cultural congruity between the site and its visitor 
(Luna, Peracchio, and De Juan 2002). 

Perceived privacy and security are two related key dri- 
vers of consumer value (Hoffman, Novak, and Peralta 1999; 
Zeithaml, Parasuraman, and Malhotra 2002). Both involve 
protecting Web site users' personal information. The higher 
a site scores on privacy and security protection in the minds 
of consumers, the more the risk that consumers perceive to 
exist when supplying private information online will be 
mitigated (Lwin and Williams 2004), and thus perceived 
value will increase. 

The Internet also provides unique opportunities to cus- 
tomize information. Customization refers to the extent to 
which the Web site's content is adapted to an individual's 
unique needs versus being presented as a standardized 
package. People value personalized experiences (see, e.g., 
Feldman and Lynch 1988). 

The Internet allows global access to a Web site. This 
offers the opportunity for cross-border information flow, but 
it also means that there may be cultural incongruity 
between the site and the visitor. Cultural congruity refers to 
whether the Web site is translated into the visitor's local 
country-specific language; offers local content; and exhibits 
localization in the form of country-specific dates, zip codes, 
currency formats, and so forth (Singh, Furrer, and Ostinelli 
2004). Exposure to a culturally congruous site facilitates 
processing of the site's content (Luna, Peracchio, and De 
Juan 2002) and conveys the marketer's respect for the visi- 
tor's culture (Koslow, Shamdasani, and Touchstone 1994), 
both of which consumers value. Thus: 


Hp: The perceived value derived from a Web site visit (a) is 
positively related to the perceived privacy/security protec- 
tion of the site, (b) is positively related to the site's degree 
of customization, and (c) is higher when there is cultural 
congruity between the site and the consumer. 


The Moderating Role of a Country's 
Institutlonal Context 


Web site visitors are members of larger institutional sys- 
tems and thus are subject to the conflicts and compatibili- 
ties between their own evaluative structure and the priorities 
of the institutional system. Institutions are "the most endur- 
ing features of social life ... giving ‘solidity’ [to social sys- 
tems] across time and space" (Giddens 1984, p. 24). A soci- 
ety's institutional priorities reflect the basic issues and 
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problems that societies must confront to regulate human 
activity. The shared institutional priorities in a society help 
shape the social and economic reward contingencies to 
which people must adapt to function smoothly and effec- 
tively (Schwartz 1994). 

A society's institutional priorities stimulate the activa- 
tion of beliefs that are in line with and conducive to main- 
taining these priorities, and beliefs that run counter to these 
priorities are discouraged (Scott 2001; Triandis 1989). 
Thus, as standards, these priorities influence how individual 
beliefs and emotions are evaluated and what kinds of 
expressions are encouraged or discouraged. These positive 
or negative institutional reinforcement mechanisms, which 
operate between the two types of constructs, give rise to 
interactions between society-level and individual-level 
variables (Schwartz 1994). 

Scott (2001) distinguishes three 1nterrelated but distinct 
"pillars of institutions": the regulative, moral, and cultural 
systems (see also Grewal and Dharwadkar 2002; Suchman 
1995). Each of these systems has been identified by various 
social theorists as a vital component of a country's institu- 
tional context (Scott 2001). These three systems form a 
continuum that moves "from the conscious to the uncon- 
scious, from the legally enforced to the taken for granted" 
(Hoffman 2001, p. 36). We summarize some of the key 
characteristics of the three pillars of institutions in Table 1. 

The "regulative" system involves the capacity to estab- 
lish formal rules, inspect society members' conformity to 
them, and, if necessary, impose sanctions (Scott 2001). It 
includes the presence and efficacy of regulatory institutions 
and the associated legal system that exists to ensure stabil- 
ity, order, and continuity of societies (Arndt 1979). The 
"moral" system involves societal norms of right and wrong 
and "introduces a prescriptive, evaluative, and obligatory 
dimension in social life" (Scott 2001, p. 54). It defines goals 
or objectives (e.g., making a profit, loving one's country) 
but also specifies appropriate ways to pursue them (e.g., 
conceptions of fair business rules, purchase of domestic 
products). The "cultural" system represents the culturally 
supported beliefs, attitudes, habits, and behaviors. It main- 
tains that external cultural frameworks shape internal inter- 
pretive processes and shared understanding (Triandis 1989). 
Certain routines (e.g., shaking hands when meeting some- 
one) are followed because they are taken for granted “as the 
way we do these things" (Scott 2001, p. 57). In this study, 
we include one key component of each of these pillars of 
institutions, namely, rule of law (regulative; Oxley and 
Yeung 2001), national identity (moral; Giddens 1981), and 
national-cultural individualism (cultural; Oyserman, Coon, 
and Kemmelmeier 2002). 


Regulative System: Rule of Law 


In general, it is recognized that the rule of law is an essen- 
tial component of the regulative subsystem (Licht, Gold- 
schmidt, and Schwartz 2003; North 1990; Oxley and Yeung 
2001; Scott 2001). It is a central tenet in the “good gover- 
nance" agenda of the major international institutions, such 
as the International Monetary Fund and the World Bank 
(Kaufmann, Kraay, and Zoido-Lobatón 1999). These bodies 
define governance as the traditions and institutions by 


TABLE 1 
Three Pillars of Institutions 





Regulatlve 
Basis of compllance Expedience 
Basls of order Regulative rules 
Mechanisms Coercive 
Basis of legitimacy Legally sanctioned 
Primary scientific domain Economlcs 


Source: Adapted from Scott (2001). 


which authority in a country is exercised for the common 
good. The rule of law addresses the degree to which the 
behavior of individuals and organizations (including gov- 
ernment authorities) is guided by formal, transparent, legal 
rules that apply to everyone, regardless of rank and status 
(Licht, Goldschmidt, and Schwartz 2003). A country with a 
strong rule of law has sound political institutions and a 
strong, independent court system. Its citizens are entitled to 
equal protection under the law and are willing to accept the 
established institutions to make and implement laws and 
resolve disputes (Oxley and Yeung 2001). 

A strong rule of law generates greater transparency and 
stability regarding the boundaries of acceptable behavior. It 
reduces Web site visitors' uncertainty about what legal pro- 
tection they can expect and enhances their ability to suc- 
cessfully litigate at least the most serious cases of fraudu- 
lent online dealings. In countries in which tbe rule of law is 
weak, this ability is undermined and online visitors bave 
greater need for other safeguards against fraud and misuse 
of their trust. A particularly important type of online safe- 
guard is the privacy/security protection offered by the Web 
site (Zeithaml, Parasuraman, and Malhotra 2002), which 
therefore is expected to be (even) more important in coun- 
tries that are characterized by a weak rule of law. 

H«: The effect of perceived privacy/security protection on per- 

ceived value is stronger in countries with a weak rule of 
law. 


Moral System: National Identity 


Previous research in marketing has noted the important role 
of one particular moral construct in explaining 'consumer 
attitudes and behavior: consumer ethnocentrism (Balabanis 
et al. 2001; Shimp and Sharma 1987; Steenkamp, Batra, 
and Alden 2003), defined as "the beliefs held by [Ameri- 
can] consumers about the appropriateness, indeed morality 
of buying foreign-made products" (Shimp and Sharma 
1987, p. 280, emphasis added). There are systematic differ- 
ences in consumer ethnocentrism among countries (e.g., 
Balabanis et al. 2001). | 

The construct of national identity is conceptually 
closely related to consumer ethnocentrism (cf. Baughn and 
Yaprak 1996) in that both constructs provide prescriptions 


Moral Cultural 


Social obligation Taken-for-grantedness 


Shared understanding 


Binding expectations Constitutive schema 
Normative Mimetic 
Morally governed Comprehensible 

Hecognizable 
Culturally supported 
Sociology Anthropology 
Psychology Sociology 


for morally motivated behavior. However, whereas con- 
sumer ethnocentrism is a consumer-domain-specific con- 
struct, national identity is a more abstract, "central" con- 
struct (cf. Lastovicka 1982) and, as such, is more central to 
a society's moral system (Scott 2001). The theoretical basis 
of national identity is social identity theory, which distin- 
guishes between in-groups and out-groups and posits the 
incorporation of the in-group as part of the self of its mem- 
bers (Mackie and Smith 1998; Tajfel 1981). Various theo- 
rists have argued that nations can be viewed as communi- 
ties, whose members form a social group. For example, 
Giddens (1981) stated that a nation's inhabitants have "an 
overall awareness ... of belonging to an inclusive commu- 
nity with a certain identity.” In a similar vein, Druckman 
(1994, p. 63) argues that nations "achieve personal rele- 
vance for individuals when they ... gain a sense of identity 
and self-esteem through their national identification." 
Countries that are high on national identity take pride in 
and display a positive bias toward their country's symbols, 
language, and culture (Billig 1995). These daily reminders 
of national identity, as well as its omnipresence in education 
and mass media, provide a continuous reinforcement of the 
feeling of belonging to a national group (Verlegh 2001). 
The level of national identification differs widely among 
countries (International Social Survey Program [ISSP] 
1998), and differences in national identity are related to dif- 
ferential tendencies to favor culturally adapted products and 
other culturally congruous symbols (cf. Kowalski and 
Wolfe 1994; Steenkamp, Batra, and Alden 2003). Consis- 
tent with this body of research, we expect tbat people from 
countries that are high on national identity attach more 
importance to the cultural congruity of the Web site in shap- 
ing their value perceptions. 
H4: The effect of cultural congruity on perceived value is 
stronger in countries characterized by high national 
identity. ` 


Cultural System: Natlonal-Cultural Individualism 

National-cultural individualism pertains to the degree to 
which people in a country prefer to act as individuals. Con- 
versely, collectivism pertains to the extent to which people 
in a country prefer to act as members of a group. People in 
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individualistic societies place their personal goals, motiva- 
tions, and desires ahead of those of others, whereas collec- 
tivistic cultures are conformity oriented and show a higher 
degree of group behavior and concern to promote their con- 
tinued existence (Kagitcibasi 1997). Individualism’s 
“pivotal role” (Allik and Realo 2004, p. 32) in the cultural 
system has been consistently emphasized by social science 
researchers (Oyserman, Coon, and Kemmelmeier 2002). It 
has been studied as such in a wide range of disciplines, 
including philosophy, history, anthropology, sociology, psy- 
chology, and business (Triandis 1995). Kagitcibasi (1997, p. 
3) notes that "about one-third of recently published studies 
cited this construct [individualism/collectivism] as at least a 
partial explanation of observed cross-cultural differences." 

Psychologists have found that compared with collec- 
tivistic societies, in individualistic societies, emotions fea- 
ture more prominently in people's judgments because emo- 
tions provide direct feedback about the fit between personal 
needs and goals and the stimulus experienced (Schimmack 
and Diener 1997; Schimmack et al. 2002). In collectivistic 
societies, fulfillment of personal goals is also important, but 
relatively more weight is given to following cultural norms. 
In these cultures, there is relatively more emphasis on 
acquiring skills and gaining expertise than in individualistic 
societies (Hofstede 2001; Schimmack et al. 2002). Conse- 
quently, we expect that emotional experience receives more 
weight in the value derived from a Web site visit in indi- 
vidualistic societies than in collectivistic societies, whereas 
the utilitarian experience is given more emphasis in collec- 
tivistic societies than in individualistic societies. 

In individualistic societies, the social fabric and group 
norms are much looser. These societies emphasize the pri- 
vate self, whereas the social self receives relatively more 
weight in collectivistic societies (Trafimow, Triandis, and 
Goto 1991). Individualistic societies emphasize that "every- 
one has a right to a private life,’ whereas in collectivistic 
cultures, "private life is invaded by institutions and organi- 
zations to which one belongs" (Hofstede 2001, p. 227). Per- 
sonal security is also given more weight in individualistic 
cultures because the mechanism operant in collectivistic 
societies—namely, that security is provided by the organi- 
zation or society—is relied on less in individualistic soci- 
eties (Hofstede 2001, pp. 226—27, 267). In summary, it can 
be expected that personal privacy and security are given 
more weight in individualistic societies. 

In their seminal article, Oyserman, Coon, and Kem- 
melmeier (2002) conclude that desire for uniqueness and 
independence are core elements of cultural individualism. 
In individualistic societies, a person's attitudes and behavior 
are regulated largely by individual preferences, whereas in 
collectivistic societies, attitudes and behavior are heavily 
influenced by society's preferences (Triandis 1989). In indi- 
vidualistic countries, children are socialized to think of 
themselves as “I.” That is, an individual's identity is clearly 
distinct from other people's *T's" (Hofstede 2001, p. 227). 
People in individualistic societies value consumption expe- 
riences that are customized to their own unique needs more 
than people in collectivistic societies. In summary, we pro- 
pose the following moderating effects involving national- 
cultural individualism: 
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H5: The effect of utilitarian experience on perceived value is 
stronger when national-cultural individualism is lower. 

Hg: The effect of emotional experience on perceived value is 
stronger when national-cultural individualism is higher. 

H}: The effect of perceived privacy/security protection on per- 
ceived value is stronger when national-cultural individual- 
ism is higher. : 

Hg: The effect of perceived customization on perceived value 
is stronger when national-cultural individualism is higher. 


Method 


Data Collection 


Data collection was carried out in close collaboration with 
16 U.S. and European CPG companies. They allowed us to 
post a questionnaire on their Web sites in one or more of the 
following countries: Belgium, France, Germany, Italy, the 
Netherlands, Spain, Sweden, and the United Kingdom. We 
obtained information on the individual-level constructs 
using questionnaires. We used secondary sources for the 
country characteristics. Furthermore, we conducted in- 
depth interviews with ten consumers to augment the quanti- 
tative results with qualitative insights into why the Web 
sites provided high or low perceived value. These con- 
sumers were not aware of the results of the large-scale 
quantitative study. 


Questionnaires. We posted a survey on 30 Web sites and 
requested that online visitors fill out the survey. Electronic 
data collection was conducted in collaboration with the 
global market research agencies Taylor Nelson Sofres and 
GfK and the Internet market research agency MetrixLab. . 
Online questionnaires were completed by 9546 consumers 
in 107 countries. We excluded 370 respondents because 
they had an excessive number of missing responses. In 
addition, for this study, we used only the countries from 
which at least 15 responses were returned. This rule 
resulted in the exclusion of an additional 290 completed 
surveys from further analysis. Our final data set consisted of 
8886 consumers from 23 countries (Australia, Austria, Bel- 
gium, Canada, Croatia, Denmark, France, Germany, Ire- 
land, Italy, Macedonia, the Netherlands, New Zealand, 
Poland, Romania, Russia, Serbia and Montenegro, Slove- 
nia, Spain, Sweden, Switzerland, the United Kingdom, and 
the United States) reporting on 30 Web sites of 16 CPG 
companies. The list of companies and Web sites examined 
appears in Table 2. 


Secondary data. We obtained country data from exter- 
nal sources (Hofstede 2001; ISSP 1998; Kaufmann, Kraay, 
and Zoido-Lobatón 1999; Suh et al. 1998). 


Measures 


We developed the questionnaire in English in close collabo- 
ration with an advisory committee of Internet experts (e.g., 
vice president of Internet marketing) of five of the partici- 
pating CPG companies. We subsequently discussed the 
questionnaire in field interviews with 14 senior managers 
from ten of the participating CPG companies and pretested 
it on a Web site. After we made the necessary modifica- 
tions, the questionnaire was finalized, back-translated into 


Firm 


TABLE 2 


Firms, Brands, Product Categories, and Web Sites Examined 


Web Site Brand Product Category 
Banlla www.barilla.com Barilla Pasta and pasta sauces 
www.barilla.It ` Barilla Pasta and pasta sauces 
Beiersdorf www.nlvea.de Nivea Personal care 
Cadbury www.schweppeseuro.com Schweppes Beverages 
Schweppes 
Colgate-Palmolive www.colgate-palmolive.de Multiple brands Personal care, household care, 


(e.g., Colgate, Ajax) 


pet food 


Danone www .danoneconsells.com Danone, Lu Dalry products, biscuits, 
beverages 
Heineken www.helneken.com Helneken Beer 
Heinz www.heinz.co.uk Heinz Ketchup, beans, spaghetti 
Henkel www.DonnabD.it Dixan, Pril, Vernel, Testanera Household care, personal care 
Kraft Foods www.milka.de . Milka Chocolate confectionery 
www.milkaski.de Milka Chocolate confectionery 
www.jacobs.de ` Jacobs Coffee 
www.estrella.se Estrella Salty snacks 
Lindt & Sprüngli www.lindt.com ` Linat Chocolate confectionery 
Mars www.mypetstop.com/fr Pedigree, Whiskas Pet food 
www.mypetstop.com/ge Pedigree, Whiskas Pet food 
www.celebrations.co.uk Celebrations Confectionery 
Procter & Gamble www.olay.com Olay Personal care 


www.uk.pantene.com 


Pantene Pro V 


Hair care 


Reckitt Benckiser www.finish.co.uk : Finish Dishwashing products 
Sara Lee www.de.nl Douwe Egberts Coffee 
www.senseocrema.nl Senseocrema Coffee pads 
www.geschenkenwinkel.nl Douwe Egberts, Pickwick, Coffee, tea, tableware, textiles, 
Senseo coffee makers 
www.zwitsal. nl Zwitsal Baby care 
SCA Hygiene www.bodyform.co.uk Bodyform Sanitary napkins 
Unilever WWw.knorr-gastro.ch. Knorr Food products 
www.conimex.nl Conlmex Foreign food products 
www.lifeandcooking.nl Multiple brands (e.g., Becel, Food products 
Bertolli) ' 
www.viadirekt.nu ' Via Direkt Laundry detergents 
www.axe.be Axe 


Notes The content of these sites has changed since we carned out our consumer survey. 


Deodorizer 


all languages involved (Dutch, French, German, Italian, 
Spanish, Swedish), and linked to the respective country 
Web sites. To satisfy Web site managers’ concerns, the num- 
ber of items per construct was limited. This is not uncom- 
mon in Internet research conducted in collaboration with 
companies (cf. Shankar, Smith, and Rangaswamy 2003). 
The Appendix provides detailed information on the 
measures. 


Individual-level constructs. We measured perceived 
value with five items, which collectively capture the percep- 
tions and behavioral intentions that have been associated 
with perceived value in the literature (Parasuraman, Zeit- 
haml, and Malhotra 2005; Sirdeshmukh, Singh, and Sabol 
2002; see also Holbrook 1999; Chen and Wells 1999). 
Seven facets of utilitarian experience are distinguished: (1) 
information provision, (2) easier decision making, (3) con- 
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sumer education, (4) decision confidence, (5) convenience, 
(6) cross-channel integration, and (7) timeliness. These 
seven facets constitute a formative scale because there is no 
a priori reason to assume that they are correlated (Diaman- 
topoulos and Winklhofer 2001). The higher a site scores on 
any of these facets in the mind of a visitor, the higher is his 
or her utilitarian experience. We measured both pleasure 
and arousal with two items taken from the work of Mehra- 
bian and Russell (1974). We measured perceived privacy/ 
security protection with two items, which we adapted from 
the work of Wolfinbarger and Gilly (2003) and Zeithaml, 
Parasuraman, and Malhotra (2002). We measured cus- 
tomization with three items based on the work of Srini- 
vasan, Anderson, and Ponnavolu (2002) and Wolfinbarger 
and Gilly (2003). 

Finally, we operationalized the congruity between the 
consumer and the site as a dummy variable (1 — site is 
adapted to the respondent's culture; 0 = otherwise). Adapted 
sites have three characteristics: (1) They are displayed in 
local country-specific language (e.g., American versus 
British English); (2) they offer local content (e.g., the site 
lists local activities versus the site is a mere translation); 
and (3) they exhibit some level of localization in the form of 
country-specific dates, zip codes, currency, and/or number 
formats (cf. Singh, Furrer, and Ostinelli 2004). Two coders 
independently coded whether each of the 30 sites was 
adapted to each of the 23 countries; there was a 9696 inter- 
rater agreement, and disagreements were resolved by 
discussion. 


Country-level constructs. Rule of law measures the 
respect of citizens and the state for the institutions that gov- 
ern their interactions. It includes several indicators that 
measure the extent to which people have confidence in and 
abide by the rules of society. Together, these indicators 
measure the success of a society in developing an environ- 
ment in which fair, predictable, impartial, and enforceable 
rules form the basis for economic and social interactions 
(Kaufmann, Kraay, and Zoido-Lobatón 1999). The rule-of- 
law measure reflects the statistical compilation of informa- 
tion obtained from surveys, nongovernmental organizations, 
commercial risk-rating agencies, and think tanks. 

We based the national identity measure on the ISSP 
(1998) on national identity. Respondents in various coun- 
tries were asked to rate their opinion on six topics pertain- 
ing to national identity, such as how close they felt to their 
respective countries, national pride, preference to their own 
country's films and programs, and willingness to move out- 
side their country. The ISSP reports the country-averaged 
score for each item. We constructed a national identity scale 
by averaging the countries' scores on the six national iden- 
tity items.! 

We obtained the national-cultural individualism rating 
for each country by averaging the ratings of two leading 
experts in the field: Hofstede and Triandis, after converting 


lFor five countries, national identity ratings were unavailable 
from ISSP (1998). Instead, we used the national identity scores 
from a neighboring country. We conducted a robustness check by 
estimating the model after deleting those countries. The results 
remained substantively the same. 
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Hofstede's ratings to a ten-point scale used by Triandis 
(e.g., Suh et al. 1998). Hofstede's scores have the strength 
of being based on a massive empirical data set, and Trian- 
dis's ratings include more recent information and include 
qualitative insights as well. 


Covariates. We included two sociodemographic 
variables (gender, age), another site characteristic variable 
(accessibility), and 29 site dummy variables to control for 
unobserved site-specific effects (Greene 2000). We mea- 
sured accessibility with two items pertaining to being able 
to access the site quickly and being able to maneuver 
through the site (Novak, Hoffman, and Yung 2000). These 
covariates are not the focus of our study, but controlling for 
their effects provides a stronger test of our hypotheses 
(Greene 2000). 


Measure Validation 


To allow for a stringent test of convergent and discriminant 
validity, we included all reflective individual-level con- 
structs in a single confirmatory factor model. Although the 
chi-square was highly significant (not unexpected, given the 
large sample size), other indicators suggested a good fit 
(Byrne 1998): ¥2(67) = 2070.77, p < .001; comparative fit 
index = .95; Tucker—Lewis index = .93; and root mean 
Square error of approximation = .06. All the indicators 
loaded significantly (p < .001) and substantively (all factor 
loadings exceeded .50) on their hypothesized factors, pro- 
viding evidence of convergent validity of the measures 
(Steenkamp and Van Trijp 1991). To test for discriminant 
validity, we ran a series of nested confirmatory factor analy- 
ses in which we constrained the correlation between each 
pair of reflective constructs to one. For all pairs, when we 
compared the constrained model with a free model, the dif- 
ference was significant, which supports discriminant 
validity (Anderson and Gerbing 1988). Composite reliabili- 
ties were .65 or greater, exceeding the benchmarks that are 
suggested as acceptable (Bagozzi and Yi 1988). 

We further tested the validity of the measures of the 
Web site characteristics perceived privacy/security protec- 
tion and perceived customization by relating them to objec- 
tive site characteristics. Drawing on the definition of 
privacy/security protection, along with consumer pretesting 
and discussions with the advisory committee of Internet 
experts of the participating CPG companies, we specified 
the following four objective privacy/security indicators: (1) 
whether the site tells how it will use the visitor’s personal 
information, (2) whether the site explicitly indicates that it 
will not share personal data with third parties, (3) whether 
the site explicitly indicates that it will not send follow-up e- 
mails or other communications unless the visitor requests 
this, and (4) whether the visitor can delete the information 
the site has about him or her in its database. 

In a similar vein, we developed five objective indicators 
for customization: (1) whether the visitor can personalize 
the content of the site, (2) whether the visitor can personal- 
ize the “look and feel" of the site, (3) whether the visitor 
can personalize fun activities and games on the site, (4) 
whether the visitor can consult an expert through the site, 
and (5) whether the visitor is addressed by name when vis- 


iting the site (after having registered). Two coders scored 
each Web site on each of these objective characteristics (1 = 
yes; 0 = no); there was a 97% interrater agreement, and dis- 
agreements were resolved by discussion. The correlation 
between the summated score on the objective privacy/ 
security indicators and the mean perceived privacy/security 
score per site was .53 (p < .001), whereas this correlation 
was .50 for customization (p < .001). These: correlations 
support the validity of the perceived Web site characteristics 
measures. 


Model Specification 


Our conceptual model of the antecedents of perceived value 
involves variables at two hierarchical levels of aggregation: 
the individual and the country level. The Level 1 
(individual-level) and Level 2 (country-level) models are as 
follows:2 

Level 1: 


(1) PV, = Bo, + BjjUTIL; + D; PLEAS, + B3,AR,, 
+ By, AR; + BsPRIVSEC; 
+ Bg CUSTOM, + P7; CONGRUITY, 
+ Bs,GENDER,, + By AGE, + BjgACCESS;, 


39 
M B SITE, Ti 
11 
Level 2: 
(2) Bo, = Yoo + Yoi LAW, + Yo2NATID) + Yo3INDIV, + ug; 
(3) By = Yio + YilNDIV, + uj; 
(4) Bay = 20 + Yai INDIV; + uj; 
(5) Bs, = 5o + Yi INDIV; + uz; 
(6) Bs, = Yso + Ys1LAWj + YsQINDIV, + usj; 
(7) Be = Yeo + Yer INDIV; + ug; 
(8) By = Yro + Tii NATID, + u7; 
(9) Ba = Yo + Up forr=4, 8-10; and 


(10) Bs = Yso fors = 11, ..., 39; 


where i and j represent individuals and countries, respec- 
tively; PV denotes perceived value; UTIL is utilitarian 
experience; PLEAS represents pleasure; AR is arousal; 
PRIVSEC measures perceived privacy/security protection; 
CUSTOM is perceived customization; CONGRUITY is 
cultural congruity; LAW is the rule of law; NATID is 
national identity; and INDIV is national-cultural individual- 
ism. The covariates GENDER and AGE are self- 
explanatory, ACCESS is the Web site’s perceived accessi- 
bility, and SITE is the dummy variables that denote the 


ZWe estimated our model using SAS PROC MIXED (Singer 
1998). 


specific Web site that was evaluated to control for unob- 
served site effects. 

We assume the individual-level error term rj; to be nor- 
mally distributed with zero mean and variance 02. The ran- 
dom effects Uaj (q = 0, ..., 10) are multivariate normally dis- 
tributed over countries, each with an expected value of zero, 
var(ug) = Tag and COV(Ugj, Ugy) = Tag (q, q' = 0, ..., 10). 
Finally, ug, is the unique effect of country j on the intercept 
(Bo) or slope (Bij ..., Pio)» while controlling for the 
country-level predictor variables (when applicable). A pre- 
liminary analysis indicated that our specification of random 
effects was consistent with the data. In all cases, the vari- 
ance of the error term was significantly different from zero 
(ps « .05). 

We do not hypothesize main effects for the country- 
level variables, because it is not obvious why the perceived 
value of a Web site is systematically influenced by, for 
example, the degree of national-cultural individualism of 
the country the visitor is from. However, we include these 
main effects as covariates for correct estimation of the inter- 
action effects involving the country-level variables (Ped- 
hazur 1982). 


Model Estimation 


In our model, respondents are nested within countries. 
Applying ordinary least squares regression to multilevel 
data is inappropriate because the resulting estimates will be 
biased and the estimated standard errors of the effects will 
be too small (Aitkin, Anderson, and Hinde 1981; 
Raudenbush 1988). Hierarchical linear modeling (HLM; 
Raudenbush and Bryk 2002) has been specifically devel- 
oped to deal with multilevel data. It enables the simultane- 
ous estimation of relationships of variables at two (or more) 
levels, using iterative maximum likelihood estimation. Hier- 
archical linear modeling borrows strength from all the data 
in each of the countries and makes it possible to estimate 
cross-level effects, thus enabling us to test H4—Hg, specify- 
ing how country-level constructs affect relationships occur- 
rng at the individual level. We used HLM to test our 
research hypotheses. Following Raudenbush and Bryk's 
(2002) recommendations, we centered the Level 1 predic- 
tors within countries, and we grand-mean-centered the 
Level 2 predictors. 


Results 


Table 3 presents descriptive statistics and correlations 
between the variables. The results for the HLM model 
appear in Table 4.3 Note that we report unstandardized coef- 
ficients. In HLM, standardized coefficients are problematic 


3We use one-tailed tests because we have directional hypothe- 
ses. A 5% significance level might be considered too lenient, 
given the large sample size, but we believe that this is not the case. 
It is well known that moderator effects are extremely difficult to 
detect in nonexperimental research (McClelland and Judd 1993). 
In simulation studies, McClelland and Judd (1993, p. 379) show 
that "9196 of the simulated field studies made Type II errors by 
failing to reject a false null hypothesis." They argue (p. 387) that 
"enormous samples are required to have the statistical power of 
optimally designed experiments for detecting interactions." 
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TABLE 3 
Descriptive Statistics 





Correlation Matrix 








1 2 3 4 5 6 7 8 9 10 11 12 13 
1 Perceived value 1.000 
2 Utilittarian experience .563 1.000 
3 Pleasure .446 .338 1.000 
4 Arousal 460  .264  .659 1.000 
5 Privacy/security 281 .809 2271  .250 1.000 
6 Customization 540 8.530  .471 432 .402 1.000 
7 Cultural congrulty 011 .016 —030 —019 .015 -—012 1.000 
8 Gender 155  .156 .067 .053 .096  .134 4.003 1.000 
9 Age —.009 —.032 —.042 .047 -.031 -.007 -.015 -.197 1.000 
10 Accessibility 459  .472 4.222 196 .148 .311 014  .078 .001 1.000 
11 Rule of law —.187 -.076 .011 —190 —111 -—108 .114 —144 -.058 -.061 1.000 
12 National Identity —.048  .035 -.056 —.230 .056 022 —.103 .072 -—.141 -.058 -.033 1.000 
13 Individualism —.137 —015 .007 —236 —.083 -.098 .045 —.0832 -.099 -059 .703 -.016 1.000 
M 3.961 3.864 3.783 3.625 3.647 3.742 .951 .746 32.163 38582 1.168 3.521 7.220 
, SD 628  .438 .693 .761 #.642 #2639 .216  .435 12.096 .690 .858 .280 1.448 


Notes: N = 8886 for evaluating pairwise correlations between individual-level variables or between individual- and country-level vanables and 


for means and standard deviations of Individual4tevel variables, N = 23 for evaluating pairwise correlations between country-level 
vanables and for means and standard deviations of country-level variables. 


because the variance is partitioned across different levels. 
We use t-values as a measure of relative effect size (Nijs et 
al. 2001). 

The more positive the utilitarian experience (y; = .421, 
p < .001) and the more pleasurable the experience (Yo = 
.106, p « .001), the higher is the perceived value. We also 
found an inverse U-shaped relationship between arousal and 
perceived value (yo = .119, p < .001; y49 = —.028, p < .01). 
Thus, H; is supported. Contrary to our expectations (Ha, 
and H»,), we do not find support for the notion that per- 
ceived privacy/security protection (Yso = .016, p = .11) and 
cultural congruity (% = .017, p = .30) enhance perceived 
value. However, as we expected (H>,), customization has a 
significant effect on perceived value (Ysg = .166, p < .001). 

The core of our conceptual model pertains to the cross- 
level interactions between individual-difference and 
country-level variables. The positive effect of perceived 
privacy/security protection on perceived value is stronger in 
countries characterized by a weak rule of law than in coun- 
tries characterized by a strong rule of law (ys, = —.094, p < 
01), in support of H4. The effect of perceived privacy/ 
security protection is also larger in individualistic cultures 
than in collectivistic cultures (ys) = .040, p < .01). This is 
consistent with H;. 

H; proposes that the effect of cultural congruity on per- 
ceived value is reinforced when national identity is higher. 
This hypothesis is supported (y; = .198, p < .05). Hs pro- 
poses that the effect of utilitarian experience is smaller in 
individualistic cultures than in collectivistic cultures. This 
hypothesis is not supported (yj; = —012, p = .32). Hg, 
which proposes that the effect of emotional experience is 
larger in individualistic cultures than in collectivistic cul- 
tures, is supported for pleasure (5; = .024, p < .05) but not 
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for arousal (73; = —.001, p = .49). Finally, as we expected, 
the effect of customization is Jarger when national-cultural 
individualism is higher (ye, = .035, p < .05), in support of 
Hg.4 

The main effects of gender and age were not significant. 
However, given the practical importance of these easily 
identifiable consumer characteristics, we reestimated our 
model including all two-way interactions with consumption 
experience and Web site characteristics. Three interactions 
were significant. We found an interaction between gender 
and (arousal)? (y = —.033, p < .05), indicating that males 
have a higher optimum level of arousal than females. This is 
consistent with the literature (Zuckerman 1994). Further- 
more, we found that older people attach less importance to 
pleasure (y = —003, p < .001) and more importance to cus- 
tomization (y = .003, p < .001). 


Discussion 


In this article, we examined whether the antecedents of the 
perceived value that consumers derive from visiting a brand 


4As a robustness check, we reestimated the model for an alter- 
native specification of perceived value, after eliminating Per- 
ceived Value Items 4 and 5, while adding Utilitarian Experience 
Item 3. The results remained substantively the same. The direction 
of all significant coefficients remained the same, but p values were 
often somewhat weaker. This is also because the alternative per- 
ceived value instrument had substantially lower reliability (.65 
versus .75 for our operationalization). Nevertheless, nine of the ten 
hypothesized effects that were significant remained significant in 
the alternative model (the other coefficient attained a p value of 
.13). 


TABLE 4 





Results 
Hypothesized Unstandardized 
Predictor Effect Coefficient t-Value p 
Intercept (Yoo) 4.033 80.2 <.001 
Consumption Experience 
Utilitarian (40) + 421 14.8 <.001 
Pleasure (yoo) + .106 8.3 <.001 
Arousal (yao) + .119 4.2 <.001 
(Arousal) (y49) — —.028 —2.6 «.01 
Web Site Characteristics 
Privacy/secunty (Yso) 4 .016 1.3 n.s. 
Customization (Yao) + .166 7.4 «.001 
Cultural congruity (Y70) + .017 5 n.s. 
Moderating Effect of Country Characteristics 
Rule of law x privacy/security (51) — —.094 —2.7 «.01 
National identity x cultural congruity (yz) + .198 2.0 <.05 
Individualism x utilitarian experience (y) ~ —.012 —5 n.s. 
Individualism x pleasure (ye) + .024 1.8 <.05 
Individualism x arousal (ys4) + —.001 —.1 n.s. 
Individualism x privacy/security (y52) + .040 2.6 <.01 
Individualism x customization (ye) + .035 1.8 «.05 
Covariates 
Gender (0 = male; 1 = female) (yao) .025 1.1 n.s. 
Age (go) —.001 —.4 n.s. 
Accessiblity (400) 177 9.2 <.001 
Rule of law (Y94) —.076 —1.9 n.s. 
National identity (yoo) —.062 —.7 n.s. 
Individualism (Yo3) .016 8 n.s. 


29 site dummies (¥119){¥390) 


Notes: Except for the covanates, all p values are for one-sided tests; n.s = not significant (p > .05). 


manufacturer’s Web site are systematically moderated by 
the country’s institutional context. We developed specific 
hypotheses and tested them across 30 CPG Web sites in 
eight European countries, using large samples of consumers 
from three continents. We find that perceived privacy/ 
security 1s more important in countries characterized by a 
weak rule of law. Perceived privacy/security and customiza- 
tion have a greater effect in individualistic countries than in 
collectivistic countries. Cultural congruity weighs more 
heavily in countries that are high on national identity. We 
find evidence that the emotional experience is more impor- 
tant in shaping perceived value judgments in individualistic 
countries, but this effect holds for pleasure only. Notably, 
existing research on the difference in the impact of emo- 
tions in people’s judgments between individualistic and col- 
lectivistic societies largely concerns pleasure (e.g., Hof- 
stede 2001; Schimmack et al. 2002; Triandis, McCusker, 
and Hui 1990). It appears that, in general, it is not so much 
emotions but rather the pursuit of pleasure that is more 
important in judgment formation in individualistic coun- 
tries. Finally, there is directional evidence that utilitarian 


experience is more important in collectivistic societies, but 
this effect does not reach statistical significance. 

In summary, our findings provide broad support for our 
conceptual model and the relevance of including country- 
level institutional constructs for understanding the per- 
ceived value that consumers derive from visiting a brand 
manufacturer’s Web site. Our results show that the relation- 
ships between perceived Web site value and its drivers are 
systematically and predictably moderated by the institu- 
tional context in which consumers live. The findings of this 
study are important from a theoretical perspective because 
they offer insights into why certain variables do not signifi- 
cantly affect perceived value in some countries; namely, the 
national environment may not be conducive to such rela- 
tionships. It is also important from an applied perspective 
because more and more companies use Web sites to create 
consumer value. 


Managerial Implications 


At the micro level, the most important driver of perceived 
value is the utilitarian experience associated with the site. A 
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company can increase utilitarian experience by providing 
clear, truthful, and new information about the company’s 
products in a way that eases decision making and increases 
decision confidence. Notably, some Web sites of these large 
CPG companies were not able to achieve this, at least not in 
the minds of their visitors. For example, in the follow-up 
in-depth interviews, consumers indicated two Web sites that 
received low scores on utilitarian experience (the con- 
sumers in the qualitative interviews were not informed 
about the results of the quantitative study), both of which 
were operated by major European CPG companies: “This 
site is just an advertisement instead of providing useful con- 
tent,” and “this site packs so many bells and whistles that 
the useful information gets lost.” 

On average, pleasure is the second most important dri- 
ver of perceived value. Additional time and money spent on 
creating sites that generate stronger emotions of pleasure 
are particularly beneficial for sites that are primarily visited 
by people from individualistic countries. Visual material 
appears to be a major component of pleasurable experi- 
ences. A Web site operated by Sara Lee was able to create a 
pleasurable experience by using soft colors and cute anima- 
tions, such as a small flying butterfly. In the follow-up 
qualitative interviews, consumers mentioned that “this site 
is really pleasant; it’s like taking a walk in a fairy-tale 
wood.” Visitors of Danone’s (called Dannon in the United 
States) site explained that they considered the site visit a 
pleasurable experience mainly because of the nice colors 
and the high-quality pictures: “This is a pleasurable site to 
visit because of the high-quality pictures that are being 
used. I consider that to be very important”; and “this site 
looks pretty sunny because of the light colors that are being 
used; it makes one feel sunny too.” 

The third most important driver is customization. Espe- 
cially people living in more individualistic countries gave 
more weight to customization in their perceived value judg- 
ments. Customization can be achieved by allowing the visi- 
tor to adapt the content, or the look and feel, of the site to 
his or her own preferences (e.g., by changing the color 
scheme at Henkel’s site), by allowing the visitor to consult 
an expert (e.g., through a live consult at Procter & Gamble’s 
Olay), or by personally addressing the visitor. After regis- 
tration, visitors of Mars’s Mypetstop can create a personal- 
ized page for each pet they have. They can use this person- 
alized page to keep track of medical records and vet 
appointments and to set up an online scrapbook to post pho- 
tos of their pet so other pet lovers and friends can view 
them. In these personalized pages, the visitor’s pet is always 
referred to by its name rather than by the impersonal “your 
pet.” In-depth interviews with consumers revealed that they 
particularly like the customization offered by the site: “This 
is a very personal site; it feels as if this site is directly talk- 
ing to me and my cat”; and “my cat is not just an animal on 
this site; it is given a personality, and it is addressed by its 
name.” In addition, visitors can use the FoodFinder to find 
out what pet food product would be advisable for their pet 
after they have typed in the pet’s type, breed, age, special 
needs, and whether it is overweight. 


146 / Journal of Marketing, July 2006 


The inverted U-shaped effect of arousal generalized 
across countries. Our qualitative interviews indicated that a 
major way to generate arousal is to engage the Web site 
visitor actively. Two examples are Barilla and Unilever’s 
Life & Cooking. The qualitative interviews indicated the 
following: “The strength of this site is the way its recipe 
section is set up; it is definitely fun to play around with” 
(Barilla); and “I feel excited because the site invites you to 
actively use it by allowing you to collect your own favorite 
recipes into a personal cookery book, create menus for spe- 
cial dinners, e-mail these to yours friends/guests, and print 
out shopping lists” (Life & Cooking). 

The effect of perceived privacy/security protection was 
completely moderated by the country context. Especially 
Web sites oriented to countries in which individualism is 
high and the rule of law is weak should emphasize privacy/ 
security. Our measurement validation check suggests that 
this can be achieved by informing visitors about how the 
site intends to use the visitors' personal information, by 
explicitly indicating that personal data will not be shared 
with third parties and that follow-up e-mails will not be 
sent, and by making it clear to visitors that they can always 
delete the data the site has about them in its marketing data- 
base. Our qualitative interviews further revealed that in 
addition to legally formulated privacy/security statements, a 
well-known brand name can also act as a privacy/security 
signal. Illustrative of this are the following comments, refer- 
ring to Kraft Foods's brand Milka: “I trust an old brand like 
Milka, so I would not hesitate when asked to hand over my 
credit card number"; and "they're a well-known brand, and 
they clearly indicate a telephone number where you can call 
them when you have questions or something goes wrong, so 
why should I worry?" Consumers’ reliance on a reputable 
brand for privacy/security protection is consistent with 
research showing that relying on a well-known brand name 
is a frequently employed risk reduction strategy (Roselius 
1971; see also Sultan et al. 2002) and with economic theory 
suggesting that consumers can plausibly believe in the pro- 
tection offered by a high-reputation brand name (Choi 
1998; Shapiro 1983; Wernerfelt 1988). 

The effect of cultural congruity was also completely 
moderated by the country context. We find that Web sites 
oriented to countries in which national identity is high 
should be adapted to the local context by using local lan- 
guage, symbols, and so forth. The global marketing litera- 
ture posits that completely standardized products often have 
limited market appeal, whereas complete adaptation to the 
local context creates cost inefficiencies and may result in 
products that are not better than completely standardized 
products (e.g., because the wheel is reinvented every time 
and research-and-development expenditures are spread too 
thinly; Kotabe and Helsen 2004). The Internet is a powerful 
instrument to both globalize and localize. Companies can 
use global portals to direct Web site visitors to nationally 
tailored sites. This is consistent with received wisdom in 
global marketing to develop a common “core,” while allow- 
ing adaptation to local circumstances. A case in point is 
Beiersdorf's site for Nivea. Nivea's portal gives its users 
access to more than 40 regional and country sites; they 


share the same look and feel, including Nivea’s signature 
blue colors, and typically feature Innovations, Magazine, 
Advice, and Fun as the main buttons. However, the sites are 
locally adapted with respect to language, the model used 
(e.g., an Asian woman for sites in Japan, a Western woman 
for Western sites), the innovations featured on the country 
portal, and so forth. The Internet offers unrivaled potential 
to adapt Web sites and company communications locally 
around a globally standardized platform, thus reaping the 


efficiencies of a standardized product and the responsive- 
ness of local adaptation (Yip 2003). 


Limitations and Further Research 


Our research has several limitations that offer avenues for 
further research. First, although our study would have been 
difficult to carry out without the close collaboration of 
major CPG companies, this also imposed constraints on the 
design of the study. These companies were interested in 


MEASUREMENT APPENDIX 
eee 


Individua-Level Constructs 


Perceived Value (a = .75) 

1. Taken everything together, | consider this Web site 
useful. 

2. This Web site fell short of my expectations. (R) 

3. | do not intend to bookmark this Web site (add to 
favorites). (R) 

4. | intend to visit this Web site again. 

5. Visiting this Web site was a poor experience. (R) 


Utllitarian Experience (Formative Scale) 
This site ... 
1. Provides clear and truthful Information about products. 
2. Makes It more difficult to decide what to choose. (R) 
3. Allows me to learn useful things | did not know before. 
4. Lowers my confidence to make the right cholces. (R) 
5. Makes it more difficult to fulfill my shopping needs. (R) 
6. Indicates stores where | can get additional information 

or buy the product. 

7. Provides information that is up-to-date. 


Pleasure (a = .70) 
When surfing this site ... 
1. 1 feel pleased. 
2. | feel happy. 


Arousal (o = .70) 
When surfing this site ... 
1. 1 feel stimulated. 
2. | feel excited. 


Perceived Privacy/Security Protection (a = .66) 
1. This site has a rigorous privacy policy. 
2. This site has good protection against fraud.’ 


Perceived Customization (a = .65) 
This site ... 
1. Has Interactive features that flt my needs. 
2. Helps me to evaluate offerings that fit my needs. 
3. Creates the feeling of receiving personalized attention. 


Cultural Congrulty 
= 1 If the site Is adapted to the respondent's culture. 
= 0 If otherwise. 


Accessibility (Formative Scale; Covarlate) 
1. Downloading information from this site Is slow. (R) 
2. It is difficult to find the information | need on this site. 


(R) | 


Country-Level Constructs 





Natlonal Cultural Individualism 
We employed a combination of country scores from the 
work of Hofstede (2001) and Suh and colleagues (1998) 
and converted them into a ten-point scale. 


National Identity (a = .89) 

1. If you could improve your work or living conditlons, 
how willing or unwilling would you be to move outside 
your country? (1 z "very willing"; 5 — *very unwilling") 

2. | would rather be a citizen of my country than of any 
other country in the world. (1 = “agree strongly’; 5 = 
“disagree strongly”) (R) 

3. The world would be a better place If people from other 
countries were more like the people In my country. 

(1 = "agree strongly"; 5 = “disagree strongly") (R) 

4. Generally my country is a better country than most 
other countries. (1 = “agree strongly"; 5 = “disagree 
strongly’) (R) 

5. My country should limit the Import of foreign products 
in order to protect its national economy. (1 = “agree 
strongly”; 5 = “disagree strongly’) (R) 

6. My country’s television should give preference to my 
country’s films and programs. (1 = “agree strongly’; 

5 = “disagree strongly") (R) 


Rule of Law 
We employed the country ratings provided in the work of 
Kaufmann, Kraay, and Zoido-Lobatón (1999). 


— M 
Notes: R = reverse-scored item. With the exception of cultural congruity, all individual-level tems were scored using a five-point scale with the 
following categories: “strongly disagree,” "disagree," “neither agree nor disagree,” "agree," and "strongly agree." 


Country Characteristics and the Perceived Value of Web Sites / 147 


using Web sites as a communication tool for creating per- 
ceived value for consumers. Further research could investi- 
gate other communication objectives (e.g., creating brand 
value) and examine Web sites that offer an (additional) 
online sales channel. 

Second, consistent with the perspective of the partici- 
pating CPG companies, our study takes the perspective of 
the consumer by focusing on the consumption experience 
associated with the Web site visit and perceived Web site 
characteristics. Although the follow-up in-depth interviews 
with consumers and the measurement validation high- 
lighted some of the ways that managers can create the 
desired consumer experiences and perceptions, further 


research could systematically examine what causes Web 
sites to deliver these experiences and perceptions. A 
promising possibility is to content-analyze Web sites more 
extensively and relate objective site characteristics to the 
variables in our conceptual model. 

Third, we assessed the measure of cultural congruity on 
a dichotomous scale. Further research could measure this 
on a continuous scale. Finally, the consumers in our sample 
visited the Web sites on their own accord. This approach 
seemed appropriate because our study is about explaining 
variance in perceived value. Further research might contrast 
perceptions of visitors versus nonvisitors. 


————— —————sssssásÀ——————————————————————————————————————————sg 
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Outstanding Reviewers 

The Journal of Marketing was again the top-rated marketing journal for 
2005, based on the SSCI impact factors, and was ranked second among ail 
business journals. Maintaining this level of quality is only possible with fast, 
constructive, insightful reviews by the Editorial Review Board and ad hoc 
reviewers. The performance of the journal's reviewers for the year 2005/2006 
has been truly outstanding. The average decision turnaround for new submis- 
sions for the most recent quarter was only 52 days, despite submissions to JM 
being at record levels (371 new submissions in 05/06). Among all the out- 
standing Editorial Review Board members, five people stood out as having 
more reviews than average, completing them faster than average, and having 
the highest level of quality. They are the winners of the Best Reviewer Award 
for 2005/2006: 


Tim Ambler (London Business School) 
John Hauser (MIT) 
Donald Lehmann (Columbia) 
Werner Reinartz (INSEAD) 
Barton Weitz (Florida) 


Congratulations to these outstanding reviewers! 





Guest Editors 


The Journal of Marketing thanks the following people for serving as guest 
editors during 2005/2006: 


Ruth Bolton 
Thomas Kinnear 
Robert Lusch 
David Stewart 
P. Rajan Varadarajan 


The journal is also pleased to announce 20 new appointments to the Edito- 
rial Review Board. The new appointees are: 


Joseph Alba (Florida) Koen Pauwels (Dartmouth) 
Kwaku Atuahene-Gima (CEIBS) Robert Peterson (Texas) 
William Bearden (South Carolina) Vithala Rao (Cornell) 
Stephen P. Brown (Houston) Aric Rindfleisch (Wisconsin) 
Joseph Cannon (Colorado State) Fred Selnes (Norwegian School of 
Yuxin Chen (NYU) Management) 
Inge Geyskens (Tilburg) Rebecca Slotegraaf (Indiana) 
Wendy Moe (Maryland) Jacqueline Thomas (Northwestern) 
John Mowen (Oklahoma State) Craig Thompson (Wisconsin) 
Eitan Muller (NYU & Tel Aviv) Kenneth Wathne (Wisconsin) 


Richard Netemeyer (Virginia) 


Xueming Luo & C.B. Bhattacharya 


Corporate Social Responsibility, 
Customer Satisfaction, and Market 
Value 


Although prior research has addressed the influence of corporate social responsibility (CSR) on perceived 
customer responses, it is not clear whether CSR affects market value of the firm. This study develops and tests a 
conceptual framework, which predicts that (1) customer satisfaction partially mediates the relationship between 
CSR and firm market value (i.e., Tobin's q and stock return), (2) corporate abilities (innovativeness capability and 
product quality) moderate the financial returns to CSR, and (3) these moderated relationships are mediated by 
customer satisfaction. Based on a large-scale secondary data set, the results show support for this framework. 
Notably, the authors find that in firms with low innovativeness capability, CSR actually reduces customer satisfaction 
levels and, through the lowered satisfaction, harms market value. The uncovered mediated and asymmetrically 


moderated results offer important implications for marketing theory and practice. 


social responsibility (CSR) represents a high-profile 

notion that has strategic importance to many companies. 
As many as 9096 of the Fortune 500 companies now have 
explicit CSR initiatives (Kotler and Lee 2004; Lichtenstein, 
Drumwright, and Bridgette 2004). According to a recent 
special report in BusinessWeek (Berner 2005, p. 72), large 
companies disclosed substantial investments in CSR initia- 
tives (i.e., Target's donation of $107.8 million in CSR repre- 
sents 3.6% of its pretax profits, General Motors's donation 
of $51.2 million represents 2.796 of its pretax profits, Gen- 
eral Mills's donation of $60.3 million represents 3.296 of its 
pretax profits, Merck's donation of $921 million represents 
11.3% of its pretax profits, and Hospital Corporation of 
America's donation of $926 million represents 43.396 of its 
pretax profits). By dedicating ever-increasing ‘amounts to 
cash donations, in-kind contributions, cause marketing, and 
employee volunteerism programs, companies are acting on 
the premise that CSR is not merely the “right thing to do” 
but also “the smart thing to do” (Smith 2003, p..52). 

Importantly, along with increasing media coverage of 
CSR issues, companies themselves are also taking direct 
and visible steps to communicate their CSR initiatives to 


i: today’s competitive market environment, corporate 


Xueming Luo is Assistant Professor of Marketing, Department of Market- 
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tacharya is Associate Professor of Marketing, Department of Marketing, 
School of Management, Boston University (e-mail: cb@bu.edu). The 
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Xiaochu Yu for their assistance with data collection and analysis. They 
also thank the anonymous JM reviewers, Guido Berens, Donald Lichten- 
stein, Femado Jaramllo, and seminar participants at the University of 
Texas at Arlington for their constructive and insightful comments on previ- 
ous versions of this article. 
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various stakeholders, including consumers. A decade ago, 
Drumwright (1996) observed that advertising with a social 
dimension was on the rise. The trend seems to continue. 
Many companies, including the likes of Target and Wal- 
Mart, have funded large national ad campaigns promoting 
their good works. The October 2005 issue of InStyle maga- 
zine alone carried more than 25 “cause” advertisements. 
Indeed, consumers seem to be taking notice; whereas in 
1993, only 26% of people surveyed by Cone Communica- 
tions could name a company as a strong corporate citizen, 
by 2004, the percentage surged to as high as 80% (Berner 
2005). 

Motivated, in part, by this mounting importance of CSR 
in practice, several marketing studies have found that social 
responsibility programs have a significant influence on sev- 
eral customer-related outcomes (Bhattacharya and Sen 
2004). More specifically, on the basis of lab experiments, 
CSR is reported to affect, either directly or indirectly, con- 
sumer product responses (Brown 1998; Brown and Dacin 
1997), customer-company identification (Sen and Bhat- 
tacharya 2001), customer donations to nonprofit organiza- 
tions (Lichtenstein, Drumwright, and Bridgette 2004), and, 
more recently, customers' product attitude (Berens, Van 
Riel, and Van Bruggen 2005). 

Although this stream of research has contributed a great 
deal of insight, there is still a limited understanding of 
whether and how CSR affects financial outcomes of the 
firm, such as its market value. Yet it is important to evaluate 
CSR's impact on market value (i.e., stock-based firm per- 
formance) because a firm's financial health is the ultimate 
test for the success or failure of any strategic initiative. 
Moreover, prior laboratory studies and anecdotal examples 
are yet to be complemented with a large-scale analysis 
using secondary data. Indeed, Brown and Dacin (1997, p. 
80) urgently call for research on “how societally oriented 
activities might bring about positive outcomes for the firm.” 
Echoing this, Berens, Van Riel, and Van Bruggen (2005) 
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energetically call for research efforts that directly link CSR 
to stock market performance. 

Our research responds to this call by investigating the 
linkage between CSR and firm market value with a longitu- 
dinal, archival data set. In keeping with contingent linkages 
between CSR and consumer responses that prior 
researchers articulated (see, e.g., Bhattacharya and Sen 
2004), we do not predict a simple, unconditional relation- 
ship between CSR and market value. This is because firms 
are not the same in executing, supporting, and exploiting 
CSR initiatives in the marketplace (Brown 1998; Sen and 
Bhattacharya 2001). Specifically, companies may generate 
different (i.e., positive, nonsignificant, and negative) market 
returns from CSR under different conditions. For example, 
Starbucks's superior brand equity and its successful CSR 
initiatives with the charity agency CARE are due, at least in 
part, to its superior product quality, innovative skills, and 
ability to obtain and sustain customer satisfaction over time. 
In contrast, many companies find that CSR results in nega- 
tive financial returns because of the added costs of making 
extensive charitable contributions and the diverted attention 
from improving product quality that would have allowed 
them to better satisfy customer needs and wants (McGuire, 
Sundgren, and Schneeweis 1988; Sen and Bhattacharya 
2001). Thus, the research questions in this study are as fol- 
lows: (1) Under what conditions do CSR initiatives result in 
positive financial performance? and (2) Does customer sat- 
isfaction matter in the relationship between CSR and firm 
performance? 

To address these questions, we develop and test a con- 
ceptual model that proposes that CSR initiatives enable 
firms to build a base of satisfied customers, which in turn 
contributes positively to market value. Specifically, we pre- 
dict that customer satisfaction partially mediates the rela- 
tionship between CSR and market value. Although extant 
marketing literature has addressed the direct impact of cus- 
tomer satisfaction on firm shareholder value (e.g., Ander- 
son, Fornell, and Mazvancheryl 2004; Fornell et al. 2006), 
the mediating role of customer satisfaction in the financial 
contribution of CSR has been ignored. In this study, we 
explicitly theorize this role and argue that building cus- 
tomer satisfaction represents part of the underlying mecha- 
nism through which the financial promises of CSR are 
capitalized. 

Furthermore, we explore the boundary conditions under 
which firms may derive positive or negative market value 
from CSR. Drawing on various theoretical bases, we argue 
that firms that have better inside-out corporate abilities (.e., 
product quality and innovativeness) to begin with tend to 
generate more market value from outside-in strategic initia- 
tives (i.e., CSR programs). Conversely, firms that exhibit 
poorer corporate abilities may find that CSR actually harms 
customer satisfaction and, because of the lowered satisfac- 
tion, decreases stock performance. 

Based on multiple secondary data sets that comprise rat- 
ings of large companies, the results show support for the 
CSR — customer satisfaction — firm market value causal 
linkage. In addition, we find that a proper combination of 
external CSR initiatives and internal corporate abilities can 
lead to synergistic returns. However, the data also reveal a 
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previously neglected "dark side" of CSR. That is, CSR 
actually reduces customer satisfaction levels in firms with 
low innovativeness capability and, through this negative 
impact, harms firm market value. The uncovered mediated 
and asymmetrically moderated results suggest a more 
nuanced understanding of the financial returns to CSR for 
both practitioners and marketing researchers. 


Conceptual Framework and 
Hypotheses 


CSR and Market Value 


Broadly defined, CSR is a company's activities and status 
related to its perceived societal or stakeholder obligations 
(Brown and Dacin 1997; Sen and Bhattacharya 2001; 
Varadarajan and Menon 1988). Although studies in strategy 
and finance have explored the relationship between CSR 
actions and firm performance, empirical evidence to date 
has been rather conflicting (for a review, see Orlitzky, 
Schmidt, and Rynes 2003; Pava and Krausz 1996). For 
example, the returns to CSR are found to be positive in 
some studies (e.g., Fombrun and Shanley 1990; Soloman 
and Hansen 1985) but negative in others (e.g., Aupperle, 
Carroll, and Hatfield 1985; McGuire, Sundgren, and 
Schneeweis 1988). Thus, Margolis and Walsh (2003, p. 
277) conclude that the relationships between CSR and 
financial performance are decisively "mixed." 

There are at least two explanations for these conflicting 
findings. First, existing studies have largely related CSR to 
backward-looking firm profitability (1.e., accounting-based 
return on investment) but not to forward-looking firm mar- 
ket value (i.e., stock-based Tobin's q). Theoretically, how- 
ever, market value is different from (and perhaps more 
important than) return on investment because "accounting 
measures are retrospective and examine historical perfor- 
mance. In contrast, the market value of firms hinges on 
growth prospects and sustainability of profits, or the 
expected performance in the future" (Rust, Lemon, and 
Zeithaml 2004, p. 79). Second, the equivocal link between 
CSR and firm performance may be due, in part, to extant 
strategy and finance literature having largely omitted the 
underlying processes or contingency conditions that may 
explain the range of observed relationships (Sen and Bhat- 
tacharya 2001). 

We precisely examine these research issues in this 
study. In particular, as we show in Figure 1, our framework 
proposes that the relationship between CSR and firm mar- 
ket value is better understood by the mediating link of cus- 
tomer satisfaction. In recent times, scholars (e.g., Anderson, 
Fornell, and Mazvancheryl 2004; Fornell et al. 2006) have 
demonstrated the positive relationship between customer 
satisfaction and market value. We build on this literature 
and institutional theory to propose that CSR is a driver of 
customer satisfaction and that the CSR-firm market value 
linkage exists (at least partially) because of the underlying 
process through customer satisfaction. In addition, drawing 
on work in the area of corporate identity and associations 
(e.g., Brown and Dacin 1997), we posit that a firm's corpo- 
rate abilities (i.e., product quality and innovativeness capa- 
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bility) moderate the relationship between CSR, and market 
value. Finally, we expect that customer satisfaction medi- 
ates, at least partially, these moderated relationships. 


CSR and Customer Satisfaction 


Customer satisfaction is defined as an overall evaluation 
based on the customer's total purchase and consumption 
experience with a good or service over time: (Anderson, 
Fornell, and Mazvancheryl 2004; Fornell 1992). In the mar- 
keting literature, customer satisfaction has been recognized 
as an important part of corporate strategy (Fornell et al. 
2006) and a key driver of firm long-term profitability and 
market value (Gruca and Rego 2005). 

Why should a firm's CSR initiatives lead to greater cus- 
tomer satisfaction? At least three research streams point to 
such a link: First, both institutional theory (Scott 1987) and 
stakeholder theory (Maignan, Ferrell, and Ferrel] 2005) 
suggest that a company's actions appeal to the multidimen- 
sionality of the consumer as not only an economic being but 
also a member of a family, community, and country (Han- 
delman and Arnold 1999). Building on this, Daub and 
Ergenzinger (2005) propose the term "generalized cus- 
tomer" to denote people who are not only customers who 
care about the consumption experience but also actual or 


potential members of various stakeholder groups that com- 
panies need to consider. Viewed in this way, such general- 
ized customers are likely to be more satisfied by products 
and services that socially responsible firms (versus socially 
irresponsible counterparts) offer. 

Second, a strong record of CSR creates a favorable con- 
text that positively boosts consumers' evaluations of and 
attitude toward the firm (Brown and Dacin 1997; Gürhan- 
Canli and Batra 2004; Sen and Bhattacharya 2001). Specifi- 
cally, recent works on customer-company identification 
(Bhattacharya and Sen 2003, 2004) suggest that CSR initia- 
tives constitute a key element of corporate identity that can 
induce customers to identify (i.e., develop a sense of con- 
nection) with the company. Indeed, Lichtenstein, 
Drumwright, and Bridgette (2004, p. 17) note that “a way 
that CSR initiatives create benefits for companies appears to 
be by increasing consumers’ identification with the corpora- 
tion ... [and] support for the company.” Not surprisingly, 
identified customers are more likely to be satisfied with a 
firm's offerings (e.g., Bhattacharya, Rao, and Glynn 1995, 
Bhattacharya and Sen 2003). 

The third literature stream that enables us to relate CSR 
to customer satisfaction examines the antecedents of cus- 
tomer satisfaction. For example, perceived value is a key 
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antecedent that has been empirically shown to promote cus- 
tomer satisfaction (Fornell et al. 1996; Mithas, Krishnan, 
and Fornell 2005b). In our context, all else being equal, cus- 
tomers likely derive better perceived value and, conse- 
quently, higher satisfaction from a product that is made by a 
socially responsible company (i.e., added value through 
good social causes). Furthermore, engaging in CSR may 
allow firms to understand their generalized customers better 
and thus improve their customer-specific knowledge (Sen 
and Bhattacharya 2001). Because improving customer 
knowledge represents another antecedent that has been 
found to enhance customer satisfaction (Jayachandran et al. 
2005; Mithas, Krishnan, and Fornell 20052), we believe that 
CSR initiatives may help promote customer satisfaction. 


H,: All else being equal, firms that are viewed more favorably 
for their CSR initiatives enjoy greater customer 
satisfaction. 


The Mediating Role of Customer Satisfaction 


The existing marketing literature shows accumulating evi- 
dence for the influence of customer satisfaction on firm mar- 
ket value. For example, firms with satisfied customers tend 
to enjoy greater customer loyalty (e.g., Bolton and Drew 
1991: Oliver 1980), positive word of mouth (Szymanski and 
Henard 2001), and customer's willingness to pay premium 
prices (Homburg, Koschate, and Hoyer 2005), all of which 
can increase a firm's market value. Indeed, several studies 
find that firms with higher levels of customer satisfaction 
are able to achieve higher levels of cash flows (e.g., Gruca 
and Rego 2005; Fornell 1992; Mittal et al. 2005) and less 
volatility of future cash flows, thus leading to superior mar- 
ket value (e.g., Anderson, Fornell, and Mazvancheryl 2004; 
Fornell et al. 2006; Srivastava, Shervani, and Fahey 1998). 

In linking this evidence for the influence of customer 
satisfaction on firm market value with our first hypothesis 
on the influence of CSR on satisfaction, a mediating role of 
customer satisfaction in the CSR-performance linkage 
might logically be expected. That is, CSR affects customer 
satisfaction, which in turn affects market value. In other 
words, customer satisfaction represents the mediational 
pathway through which CSR actions affect firm market 
value. 

However, there may be “noncustomer routes” by which 
CSR affects market value. For example, both textbooks 
(e.g., Kotler and Lee 2004; Pava and Krausz 1996) and aca- 
demic articles (e.g., Godfrey 2005; Margolis and Walsh 
2003) have pointed to the impact of CSR on multiple stake- 
holders, such as employees and investors as well as con- 
sumers. In particular, positive “moral capital” as a result of 
CSR (Godfrey 2005, p. 777) could directly affect market 
value by improving employee morale and productivity. In 
addition, CSR creates public goodwill (Houston and John- 
son 2000; McGuire, Sundgren, and Schneeweis 1988), 
which provides an “insurance-like” protection to share- 
holder wealth. As a consequence, putting the pieces 
together, we predict a partially mediating role of customer 
satisfaction on the impact of CSR on market value. 


H»: All else being equal, firms that are viewed more favorably 
for their CSR initiatives enjoy higher market value, and a 
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firm's customer satisfaction level at least partially medi- 
ates this influence of CSR on market value. 


The Moderating Role of Corporate Abilities 


In this section, we argue that the relationship between CSR 
and firm market value may not be universally positive but 
rather contingent on several boundary conditions. That 1s, a 
positive or negative relationship may be observed, depend- 
ing on the levels of corporate abilities. In general, corporate 
abilities refer to various elements of a firm's expertise and 
competency, such as the ability to improve the quality of 
existing products/services and the ability to generate new 
products/services innovatively (Gatignon and Xuereb 1997; 
Rust, Moorman, and Dickson 2002, Zeithaml 2000). 
According to Brown and Dacin (1997), a company's CSR 
and corporate abilities both influence customers' percep- 
tions of the company's products. 

We expect that firms with low levels of corporate abili- 
ties (i.e., low levels of innovativeness capabilities and prod- 
uct quality) generate negative market value from CSR for 
several reasons. On the basis of institutional theory, Handel- 
man and Arnold (1999) contend that companies should 
engage in CSR with good causes (for the social aspect of 
legitimation) and, at the same time, provide a good product 
(for the pragmatic aspect of legitimation). Thus, it is likely 
that CSR initiatives fail to generate a favorable impact if the 
firm is perceived as less innovative and as offering poor- 
quality products (i.e., due to a lack of pragmatic legitima- 
tion; see DiMaggio and Powell 1983). Indeed, Sen and 
Bhattacharya (2001) show that CSR initiatives may even 
backfire with reduced purchase intent and negative percep- 
tions if consumers believe that CSR investments are at the 
expense of developing corporate abilities, such as product 
quality and innovativeness (i.e., investments represent “mis- 
guided priorities” on the part of the firm with low levels of 
corporate abilities). More important, consumers may make 
negative and detrimental attributions regarding a firm’s 
motives if a low-innovativeness or low-product-quality firm 
engages in social responsibility. This would ultimately 
result in an unattractive corporate identity and, thus, nega- 
tive market returns by virtue of negative word of mouth and 
detrimental customer complaints (Brown 1998; Varadarajan 
and Menon 1988).! 

Conversely, we predict that firms with high levels of 
corporate abilities generate positive market value from 
CSR. Such firms tend to posses better corporate image and 
more attractive identities with which consumers want to 





1Our focus here 1s on the moderating role of corporate ability in 
the CSR-performance link rather than on the direct relationship 
between CSR and corporate ability. That is, we do not investigate 
whether CSR directly affects or is related to innovativeness and 
product quality (i.e., corporate ability-related constructs) given the 
conflicting literature. On the one hand, Brown and Dacin (1997, p. 
68, emphasis added) contend that "CSR associations are often 
unrelated to the company's abilities in producing goods and ser- 
vices" On the other hand, it 1s possible that a firm's innovation is 
CSR onented (e.g., environmentally responsible packaging), and 
CSR initiatives may affect product-quality perceptions. 


identify (Bhattacharya and Sen 2003). When coupled with 
high corporate abilities, a firm’s CSR actions are more 
likely to generate favorable attributions and consumer iden- 
tification. This would ultimately promote performance- 
enhancing behaviors, such as customer loyalty (Bhat- 
tacharya and Sen 2004). Indeed, if a firm can accommodate 
customers and other stakeholders and meet different sets of 
norms (e.g., pragmatic and social norms) by not merely 
executing CSR initiatives but also developing strong corpo- 
rate abilities to support and exploit these CSR actions, it is 
in a better position to win the social contract, institutional 
allegiance, moral legitimacy, and consumers’ support for 
the organization (cf. Handelman and Arnold 1999, p. 34; 
Scott 1987). Taken together, these beneficial effects suggest 
a positive market return to CSR for firms with high levels of 
corporate abilities. Therefore, we propose an asymmetric 
moderating effect of corporate abilities on the association 
between CSR and firm market value. 

H3: Corporate abilities G.e., product quality and innovative- 
ness capability) moderate the relationship between CSR 
and market value. The relationship will be negative for 
firms with low corporate abilities but will be positive for 
firms with high corporate abilities. 


The Mediating Role of Customer Satisfaction In 
the Moderated Relationships 


Finally, as we have argued, part of the mechanism by which 
CSR actions influence a firm’s market value is customer 
satisfaction. Thus, it is conceivable that the positive impact 
of CSR on firms with high leveis of corporate abilities 
enhances the level of customer satisfaction, which then 
leads to enhanced market value (Anderson, Fornell, and 
Mazvancheryl 2004; Brown and Dacin 1997; Sen and Bhat- 
tacharya 2001). 

On the contrary, for firms that are low in corporate abil- 
ity (i.e., they are neither innovative nor competent in prod- 
uct quality), CSR actions may not be able to generate much 
institutional legitimacy, customer-company identification, 
or customer satisfaction (Scott 1987). As a result, CSR ini- 
tiatives may relate little to financial results and market value 
(e.g., Margolis and Walsh 2003; Mithas, Krishnan, and For- 
nell 2005b) in firms with low levels of corporate abilities. 
Thus: 


Hy: A firm's customer satisfaction at least partially mediates 
the moderated relationship among CSR, corporate abili- 
ties (1e., product quality and innovativeness capability), 
and market value. 


Data and Variable Construction 


In this section, we describe the secondary data that we col- 
lected to test the hypotheses. We also present the construc- 
tion of the variables, such as CSR, corporate abilities, cus- 
tomer satisfaction, and market value. In Table 1, we report 
the variables, their definitions, and data sources. We col- 
lected data for the publicly traded Fortune 500 companies 
from multiple archival sources: COMPUSTAT, Fortune 
America’s Most Admired Corporations (FAMA), the 
American Customer Satisfaction Index (ACSI), Competi- 


tive Media Reporting (CMR), and Center for Research in 
Security Prices (CRSP). 


Measuring CSR 


One approach to measuring market perceptions of firms’ 
CSR initiatives is to rely on the amount of CSR investments 
disclosed in firms’ annual reports to shareholders. However, 
there are many important doubts about the validity of the 
announced CSR investments, despite the seeming attrac- 
tiveness of this approach. For example, there is a lack of 
consensus on what should be included (or excluded) in CSR 
investments (Margolis and Walsh 2003; Orlitzky, Schmidt, 
and Rynes 2003; Tsoutsoura 2004). Few companies have 
their announced CSR investments audited or validated 
externally by third parties. Thus, some firms may overreport 
CSR investments for impression management (1.e., exagger- 
ating their giving). Other firms may underreport CSR 
investments because they may regard CSR investments only 
as donated cash or in-kind products and services (excluding 
investments that benefit the environment and their employ- 
ees). Furthermore, although some external sources (e.g., 
100 best corporate citizens by Business Ethics, 
csrwire.com, Social Responsibility Initiative reports) may 
track companies’ CSR investments objectively, the nature 
and amount of CSR investments for the same firm can 
change dramatically from one source to another (Berner 
2005; Fombrum and Shanley 1990; Margolis and Walsh 
2003). 

Therefore, we turn to subjective measures of CSR. 
Although some studies use small-scale survey data with a 
limited set of firms (e.g., Christmann 2000), prior research 
suggests the use of a more comprehensive, large-scale sur- 
vey data set available from FAMA to measure CSR 
(McGuire, Sundgren, and Schneeweis 1988). More specifi- 
cally, in ranking the United States’ most admired corpora- 
tions each year, FAMA polls more than 10,000 financial 
analysts, senior executives, and Wall Street investors from 
more than 580 large companies (see Fortune 2005, p. 68). 

For each firm-year observation, FAMA collects ratings 
of CSR that have been made on an interval scale ranging 
from 0 to 10, with 10 as the highest; the ratings represent a 
comparison among major competing companies in a given 
industry. Studies in both marketing and strategy (e.g., Fom- 
brum and Shanley 1990; Houston and Johnson 2000; 
McGuire, Sundgren, and Schneeweis 1988) have reported 
evidence of reliability and validity of this data source. In 
particular, McGuire, Schneeweis, and Branch (1990, p. 
170) note, “Fortune reputation is one of the most compre- 
hensive and widely circulated surveys of attributes avail- 
able. Both the quality and number of respondents are com- 
parable or superior to the ‘expert panels’ usually gathered 
for such purposes.” Houston and Johnson (2000, p. 12) also 
acknowledge it as the "best secondary" data source. 

Prior research has shown that there is a reverse-causality 
concern between CSR and financial performance (e.g., 
McGuire, Sundgren, and Schneeweis 1988). That is, a 
firm's CSR affects its future performance, and a firm's his- 
tory of financial performance contributes to its current CSR 
involvement. We accommodate this concern by using the 
approach that Roberts and Dowling (2002) recommend. In 
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TABLE 


Variables and Data Sources 


ee 


Variables Definitions; Measures 


CSR Broadly defined as a company's activities 
and status related to Its perceived societal 
or stakeholder obligations; /atent variable 
Indicated by CSR scores in 2001 
(published in 2002), 2002 (published In 
2003), and 2003 (published in 2004). 


Customer Defined as an overall evaluation of the 
satisfaction postconsumption experience of products or 
services In the minds of customers; /atent 
variable indicated by customer satisfaction 
scores in 2002, 2008, and 2004. 


Product quality Defined as the minimum condition or the 
threshold of product attributes that a firm 
must meet when offering its products or 
service in competitive markets; /atent 
varlable indicated by quality of 
products/services scores in 2001, 2002, 
and 2003. 


Innovativeness Defined as a firm’s ability to apply its Internal 
capability knowledge stock to produce new 
technology, new products/services, and 
other new fronts; /atent variable indicated 
by quality of products/services scores In 
2001, 2002, and 2003. 


Tobin's q Stock price-based measure of firm market 
. Value; observed varlable based on the 
average of Tobin's q In 2002, 2003, and 
2004. 


Stock return Stock price-based measure of firm market 
value; observed variable based on the 
average of stock return in 2002, 2003, and 


1 
Secondary Data Sources Data Types 

FAMA Interval from O to 10 
ACSI Interval from O to 100 
FAMA Interval from O to 10 
FAMA Interval from O to 10 
CRSP Ratio 

COMPUSTAT 
CRSP Ratio 

COMPUSTAT 


2004 


particular, we regress CSR scores against firm financial per- 
formance (return on assets [ROA]) in the prior four years 
and save the residual of this regression as the final measure 
of CSR. Because this residual is independent from financial 
performance, the reverse-causality bias is no longer a 
concern. 

Following the work of Cho and Pucik (2005), we used 
the ratings of CSR for each firm in 2001, 2002, and 2003 
(but published in 2002, 2003, and 2004, due to a one-year 
lag in print) as three separate indicators of the latent con- 
struct of CSR.2 This approach of using measurement items 
with different time frames is also widely applied in the 
strategy (e.g., Li and Atuahene-Gima 2001) and personal- 





2Cho and Pucik (2005) find strong support (construct and 
criterion-related validity) for using multiyear ratings from Fortune 
magazine as indicators of the underlying latent variable McGuire, 
Sundgren, and Schneeweis (1988) also employ single-year CSR 
ratings from Fortune as the measure of CSR. 
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selling and psychology literature streams (Bluedorn 1982; 
Boles, Johnston, and Hair 1997; Johnston et al. 1990; Nete- 
meyer, Maxham, and Pullig 2005). 


Measuring Corporate Abilitles 


We do not view corporate abilities simply as a unidimen- 
sional construct. Instead, we consider two specific corpo- 
rate abilities: product quality and innovativeness capability 
(Gatignon and Xuereb 1997; Rust, Moorman, and Dickson 
2002; Zeithaml 2000). In our view, both innovativeness and 
product quality can represent the dimensions of corporate 
ability that Brown and Dacin (1997) propose. Although 
product quality refers to a firm's ability to "exploit" the 
capabilities of products already in the marketplace (Cho and 
Pucik 2005; March 1991), innovativeness represents a 
firm's ability to *explore" new market possibilities in terms 
of developing new products (Kim and Mauborgne 1997; 
Kleinschmidt and Cooper 1991). In addition, commitment 
to the quality of existing products is essential for keeping a 
firm's current customers happy, whereas innovation is 


essential for reaching new customer bases and catering to 
ever-changing customer needs. 

Formally, product quality can be defined as the mini- 
mum condition or the threshold of product attributes that a 
firm must meet when offering its products/services in 
competitive markets (Rust, Moorman, and Dickson 2002; 
Vargo and Lusch 2004; Zeithaml, Parasuraman, and Berry 
1990). Prior studies have established that a firm’s ability to 
provide a superior product/service quality is critical for its 
long-term survival and success (e.g. Buzzell, Gale, and 
Sultan 1975; Mittal et al. 2005; Rust, Moorman, and Dick- 
son 2002). 

In a similar fashion to CSR, we measure product quality 
by FAMA ratings in 2001, 2002, and 2003 (published in 
2002, 2003, and 2004) as the underlying indicators. Again, 
because of the reverse causality between finaricial perfor- 
mance and FAMA ratings, we control for this bias and 
obtain clean measures for product quality and innovative- 
ness capability by employing the same residual approach as 
in the case of CSR (e.g., Roberts and Dowling 2002). 

Innovativeness capability is a firm's ability to apply its 
internal knowledge stock to produce new technology, new 
products/services, and other new fronts (Drucker 1993; 
Griffin and Hauser 1996). According to exploration learn- 
ing theory (March 1991), innovation is also critical for the 
survival and success of organizations because dynamic mar- 
kets constantly shake out the players that lack capabilities 
to explore new market opportunities (Gatignon and Xuereb 
1997; Schumpeter 1934). Similar to product quality, we 
measure the latent variable of a firm's innovativeness capa- 
bility by using its Fortune ratings in 2001, 2002, and 2003 
from FAMA (published in 2002, 2003, and 2004) as three 
separate indicators underlying this construct. Prior research 
has employed this data source to measure companies' inno- 
vativeness capability (Cho and Pucik 2005). 


Measuring Customer Satisfaction 


We used the ACSI database to measure customer satisfac- 
tion. In the marketing literature, the ACSI has been shown 
to be a reliable source of measuring customer satisfaction. 
Several studies employ this database to assess overall cus- 
tomer satisfaction of total purchase and consumption 
experience at the firm level (e.g., Anderson, Fornell, and 
Mazvancheryl 2004; Fornell et al. 2006; Gruca and Rego 
2005; Mithas, Krishnan, and Fornell 2005b; Mittal et al. 
2005). The National Quality Research Center at the Univer- 
sity of Michigan developed and maintains the ACSI data 
set. It has data for nearly 200 Fortune large companies that 
span all major economic sectors and constitute approxi- 
mately 4396 of the U.S. economy. To obtain 'ACSI data, 
more than 50,000 household consumers (actual product 
users) of these large firms are polled on a quarterly basis. 
Each valid respondent has passed screening questions 
related to predefined purchase and consumption periods. 
The ACSI uses an interval scale ranging from 0 to 100, with 
100 as the highest level of customer satisfaction. 

Based on multi-item, multiconstruct criteria, the ACSI 
is a reliable data source because it employs the same survey 
questionnaire, random sampling, and estimation modeling 


across firms and years (Fornell et al. 1996; Fornell et al. 
2006; Mithas, Krishnan, and Fornell 2005b). A comprehen- 
sive test of the validity and reliability of this satisfaction 
measure can be found in the work of Fornell and colleagues 
(1996). Parallel to CSR, innovativeness capability, and 
product quality, we treat customer satisfaction as a latent 
variable and measure it using its ACSI ratings in 2002, 
2003, and 2004 as three separate indicators. 


Measuring Market Value 


We have two separate measures of market value at the firm 
level across years: Tobin's q and stock return. We follow 
prior marketing studies (Lee and Grewal 2004; Rao, Agar- 
wal, and Dahlhoff 2004) to calculate Tobin's q for each 
firm-year observation.? In addition, following Jacobson and 
colleagues (i.e., Aaker and Jacobson 1994, 2001; Mizik and 
Jacobson 2003), we derive the measure of stock return 
using the COMPUSTAT and CRSP databases.4 Rather than 
using a simple year-end stock price, we use a more con- 
servative measure of stock price—that is, the average of the 
end of the four quarters of stock prices—when calculating 
Tobin's q and stock return (Lee and Grewal 2004). We then 
use the derived three-year average (2002, 2003, and 2004) 
of Tobin's q and stock return as observed measures for mar- 
ket value. Compared with market value, the predicting 
variables of CSR, innovativeness capability, and product 
quality were all lagged by one year to be more precise on 
the specific direction of causality and to reduce the possibil- 
ity of endogeneity bias (Murthi, Srinivasan, and Kalya- 
naram 1996; Rust, Moorman, an Dickson! 2002). 


Measuring Control Varlables 


We obtained the data for control variables such as research- 
and-development (R&D) intensity, firm size, competition 
intensity, and ROA from COMPUSTAT, and we obtained 
the data for advertising intensity from CMR. More specifi- 
cally, R&D intensity is the ratio of R&D spending to total 
assets. We control for the influence of R&D expenditures 
on performance because a firm's R&D intensity enhances 
innovation activities and investors' evaluations of the firm 
(Chauvin and Hirschey 1993; Gruca and Rego 2005; 
McGuire, Sundgren, and Schneeweis 1988). 

Advertising intensity is the ratio of reported advertising 
spending to total assets. Because COMPUSTAT has many 
missing data points for firm advertising expenditure, we use 


3Rao, Agarwal, and Dahlhoff (2004, p. 130) provide a detailed 
function on how to derive Tobin's q. That is, q = (share price x 
number of common stock outstanding -- liquidating value of the 
firm's preferred stock + short-term liabilities — short-term assets + 
book value of long-term debt)/book value of total assets. 

4In particular, Aaker and Jacobson (2001, p. 489) and Mizik and 
Jacobson (2003, p. 71) suggest a detailed function on how to cal- 
culate stock return. That is, stock return = (current year's share 
price x number of common stock outstanding + dividends — previ- 
ous year's share price x number of common stock outstanding)/ 
(previous year’s share price x number of common stock 
outstanding). 
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the CMR database for advertising-spending data (Rao, 
Agarwal, and Dahlhoff 2004). We control for the influence 
of advertising expenditures on performance because intense 
advertising promotes customer awareness, brand equity, and 
sales revenues (e.g., Joseph and Richardson 2002; Morgan 
and Rego 2006). 

Firm size is the log of number of employees. We control 
for the influence of firm size because large firms may have 
more resources and thus enjoy economies of scale, but 
small firms may have higher strategic flexibility when seek- 
ing entrepreneurial rents (Dutta, Narasimhan, and Rajiv 
1999; Rao, Agarwal, and Dahlhoff 2004). 

Strategic focus is the number of business segments in 
which the firm operates (Rao, Agarwal, and Dahlhoff 
2004). This variable is available directly from the menu 
choice at the Compact Disclosure (CD-ROM), which 
defines it as “the number of unique business segments of an 
individual company.” We control for this influence because 
of possible diversification effects. That is, more diversified 
firms may have a faster asset turnover rate and exhibit 
economies of scope. However, highly diversified firms may 
lack focus in the highly segmented, competitive market- 
place and thus experience negative returns (Fombrum and 
Shanley 1990; Gruca and Rego 2005). 

We measure competition intensity by using the Herfind- 
ahl concentration index, derived from COMPUSTAT. Fol- 
lowing prior work (Gruca and Rego 2005; Mithas, Krish- 
nan, and Fornell 200523), we calculate this concentration 
index at the primary four-digit industry level of Standard 
Industrial Classification codes (which has been replaced by 
the North American Industry Classification System) for 
each firm-year observation. We use this covariate to control 
for impact of industry competition level (Rao, Agarwal, and 
Dahlhoff 2004). 

Finally, we control for the influence of ROA in predict- 
ing stock return and Tobin's q (Chauvin and Hirschey 
1993). In particular, we measure ROA as the ratio of net 
income after extraordinary items to book value of total 
assets, derived from COMPUSTAT. We used the average of 
the 2002, 2003, and 2004 data points as the measure of 
ROA. We include ROA as a covariate variable because of 
the impact of financial information on the stock market 
(Chauvin and Hirschey 1993; Erickson and Jacobson 1992). 
Table 2 presents the summary statistics for all variables in 
this study. 

Despite having these stringent controls, in light of our 
moderation hypotheses, a lingering issue is whether there 
are systematic industry differences between firms that are 
rated high on product quality (and/or innovativeness) and 
those that are rated low. A close examination of the top and 
bottom firms on the dimensions of product quality and 
innovativeness allays this concern. We find that both the top 
and the bottom firms in terms of their innovativeness and 
product quality ratings cover a variety of industries, such as 
retail, services, and manufacturing. More specifically, top 
innovativeness firms include Apple, Google, Procter & 
Gamble, FedEx, Nike, and Target, among others; bottom 
innovativeness firms include United Airlines, Dillard's, 
Kmart, and Qwest Communications, among others, accord- 
ing to Fortune's large-scale survey data in 2005. In other 
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words, neither the low-innovativeness nor the high- 
innovativeness firms are dominated by particular industry 


types. 


Merged Final Data Set 


We merged data from these different archival sources and 
obtained unbalanced panel time-series, cross-sectional data 
consisting of 452 firm-year observations across 113 firms 
for the 2001—2004 periods. However, one year's data are 
Jost because we employed the lagging process (2001—2003 
for CSR, product quality, and innovativeness; 2002—2004 
for customer satisfaction, Tobin's q, and stock return) to 
reduce the endogeneity bias and reverse-causality concerns 
described previously. Thus, we were able to use 339 data 
points for hypotheses testing. This merged data set includes 
individual firms in various industries, ranging from 
durables (e.g., automobiles, household appliances, personal 
computers), to nondurables (e.g., cigarettes; athletic shoes; 
services, such as airlines, hotels, and utilities), to retail (e.g., 
department stores, discount stores, supermarkets), among 
others. Although FAMA has ratings of CSR, innovativeness 
capability, and product quality for approximately 580 firms 
(Cho and Pucik 2005; Fortune 2005) and ACSI has data on 
approximately 190 firms/brands (Fornell et al. 1996; For- 
nell et al. 2006; Gruca and Rego 2005; Morgan and Rego 
2006), we were not able to obtain a larger sample of firms 
in the merged final data set. This is because many firms 
included in Fortune's source are not represented in the 
ACSI source and because the same firm may have several 
brands in the ACSI (Anderson, Fornell, and Mazvancheryl 
2004). We also tried to search other relevant secondary 
sources (Standard & Poor's industry reports, company 
annual reports, Compact Disclosure, and Moody's report) to 
cross-validate our final data set spanning the period from 
2001 to 2004. 

Note that COMPUSTAT does not have complete data 
points for all variables. For example, because COMPUS- 
TAT does not require companies to report their R&D invest- 
ments (volunteered responses only; see Joseph and Richard- 
son 2002), we found that more than 4096 of observations for 
the control variable of R&D are missing across the years. 
Before testing the hypotheses, we controlled for the covari- 
ates using the same approach applied in prior studies (e.g., 
Ahearne, Bhattacharya, and Gruen 2005; Pan, Ratchford, 
and Shankar 2002). In particular, we ran a linear regression 
with all control variables (firm-level and industry-level) as 
independent variables and Tobin’s q as the dependent 
variable. We saved the unstandardized residuals from this 
regression and then used them as the surrogate for Tobin’s q 
in all structural equation models (SEMs). We also applied 
this approach to obtain the surrogates for stock return. 


Analyses and Results 


Measurement Model Results 


Following the work of Anderson and Gerbing (1988), we 
employ confirmatory factor analysis (CFA) to test the 
validity of the measures. Overall model statistics show that 
the chi-square for the model is 90.73 (d.f. = 48, p > .05), 
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and the comparative fit index (CFI), goodness-of-fit index 
(GFI), and root mean square error of approximation 
(RMSEA) are satisfactory (.94, .92, and .06, respectively). 

As we report in Table 3, the CFA results lend some sup- 
port for the convergent validity for all the measures because 
all estimated loadings of indicators for the underlying con- 
structs are significant (i.e., smallest t-value = 6.53, p < .05). 
Cronbach’s alpha of the constructs exceeded the .7 thresh- 
old (Nunnally 1978). The minimum reliability of these 
measures is .85, as we reported. In addition, the average 
variance extracted (AVE) across the constructs exceeds the 
5 benchmark (see Fornell and Larcker 1981). As Table 3 
shows, the smallest AVE of the constructs is .72. The data 
also supported discriminant validity of the measures. We 
examined pairs of measures using the constrained model 
and unconstrained model in a series of chi-square difference 
tests (Anderson and Gerbing 1988). The test results consis- 
tently indicated that for each pair of constructs, the uncon- 
strained models fit the data better than their constrained 
counterparts, suggesting discriminant validity. In addition, 
we compared the estimated AVE of each measure with the 
squared correlation between-measure pairs (Fornell and 
Larcker 1981). In all cases, we found that the AVEs 
exceeded the squared correlations, further confirming the 
discriminant validity of the constructs. 


Results for the Mediating Role of Customer 
Satisfaction 


In testing the mediating role of customer satisfaction, we 
used SEM to consider explicitly the possible bias of mea- 
surement error on path estimates. Consistent with the proce- 
dures in psychology (e.g., Holmbeck 1997) and marketing 
(Andrews et al. 2004; Handelman and Arnold 1999; Selnes 
and Sallis 2003), our SEMs not only account for measure- 
ment error but also allow for a comprehensive test of the 
hypotheses related to mediation, moderation, and mediated 
moderation. | 

Table 4 reports the results of the SEMs. Hy predicted 
that CSR would positively affect customer satisfaction. 
Model 1 examines this prediction, and the result is statisti- 


cally significant, in support of Hj. We assess the signifi- 
cance of the reported SEM path estimates through a boot- 
strapping approach with 1000 resamples. As the CFI, GFI, 
and RMSEA indicate, Model 1 fits the data well. 

H, predicted that CSR would positively influence a 
firm's market value and that customer satisfaction would 
mediate this influence. To establish the existence of this 
mediation effect, four conditions should hold (Andrews et 
al. 2004): (1) The predictor variable (CSR) should signifi- 
cantly influence the mediator variable (customer satisfac- 
tion); (2) the mediator should significantly influence the 
dependent variable (market value); (3) the predictor (CSR) 
variable should significantly influence the dependent 
variable (market value); and (4) after we control for the 
mediator variable (customer satisfaction), the impact of the 
predictor (CSR) on the dependent variable (market value) 
should no longer be significant (for full mediation) or 
should be reduced in strength (for partial mediation) (Baron 
and Kenny 1986, p. 1177). 

As Table 4 shows, Model 1 meets the first two 
conditions. That is, CSR affects customer satisfaction. 
Furthermore, satisfaction significantly affects both Tobin's q 
and stock return, which is consistent with existing studies 
(Anderson, Fornell, and Mazvancheryl 2004; Bolton and 
Drew 1991; Fornell et al. 2006). Model 2 qualifies the third 
condition; the predictor variable of CSR affects market 
value in terms of Tobin's q and stock return. As Table 4 
shows, Model 2 does not include the mediator of customer 
satisfaction and appears to fit the data reasonably well. The 
fourth condition holds if the effects of CSR on market value 
become insignificant or less significant after the mediator of 
customer satisfaction is included. Model 3 results (no- 
mediation model in Table 4) show that the inclusion of 
customer satisfaction diminishes the strength of the effect of 
CSR on firm market value. The main effects of CSR on both 
Tobin's q and stock return are no longer significant.’ Thus, 





5We also employed ordinary least squares to test the mediation 
hypotheses. The results are consistent and suggest strong support 
for the mediation results of CSR. However, because SEM offers at 


TABLE 3 
Results of the CFA 
S a e 
Construct Items Factor Loading t-Value AVE CR 
CSR 75 .90 
CSR — CSR01 .69 13.43 
CSR — CSR02 71 13.55 
CSR 9 CSRO03 75 13.54 
Innovativeness capabllity (IN) .74 .87 
IN  INO1 .67 11.10 
IN > INO2 .68 11.16 
IN > INOS .62 9.98 
Product quality (PQ) .76 .91 
PQ — PQO1 .78 13.39 
PQ — PQ02 .83 13.92 
PQ — PQ03 .80 13.51 
Customer satisfaction (CS) 72 .85 
CS > CS02 .50 7.21 
CS 5 CS03 48 6.92 
CS > CS04 46 6.53 


tr Cn M — 
Notes: All t-values are significant (p < .05); x? = 90.73 (d f. = 48, p > .05), CFI = 94, GFI = .92, and RMSEA = .06. CR = construct reliability. 
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TABLE 4 


SEM Results for Mediation Effects 
E i Ss es 


SEM Estimates 
ee 


Model Specifications x? d.f. 
Model 1 362.10 101 
Model 2 112.82 59 
Model 3 391.58 96 
Model 4 345.05 97 
Full 
Mediation: 
Mode! 1 

CSR > TQ 
IN > TQ 
PQ — TQ 
CSR x IN TQ 
CSR x PQ TQ 
CSR 9 CS .23** 
IN => CS .20** 
PQ CS .28** 
CSR x IN CS .12* 
CSR x PQ 5 CS .18** 
CSR — SR 
IN — SR 
PQ SH 
CSR x IN > SR 
CSR x PQ SH 
CS > TQ .2b** 
CS > SR Peo" 
R2 

CS .34 

TQ .46 

SR 38 


X*ait (Af. ater) CFI GFI RMSEA 
Compared base .94 92 .05 
.92 91 .07 
29.48** (5)a .91 .89 .07 
17.05** (4)b .96 .94 .04 
Partial 
PV — DV: Nonmediation: Mediation: 
Model 2 Model 3 Model 4 
.14* .08 
.10 .10 Pa Ea 
AZ .14* 12" 
.14* .09 
.20** .15* 19* 
20" 
.19** 
.26** 
.14* 
.18'* 
.13* .08 
.08 .07 
AT .09 
.10 .07 
.18"* .12* og a 
23*" .22"* 
2] .19** 
2 
41 45 .48 
.34 37 39 


Ő 


*p < .05, one-tailed test. i 

**p < .01, one-talled test. 

aThe results of the difference between Model 1 and Model 3. 
bThe results of the diffarence between Model 1 and Model 4 


Notes: CS = customer satisfaction, IN = innovativeness capability, PQ = product quality, TQ = Tobin's q, and SR = stock return. Model 2 (PV > 
DV) does not include the mediator of customer satisfaction. Model 3 (nonmediation effects) Includes the mediator of customer 


satisfaction. 


customer satisfaction seems to mediate fully the direct 
impact of CSR on firm market value (though, it does not 
mediate fully the interaction effects between CSR and 
corporate abilities on market value, as we detail next). As 
such, the data provide strong support for H5, which 
predicted that CSR would increase a firm's long-term 
financial performance through the mediator of customer 
satisfaction.6 





least a weak test of causal pathways and easily compares different 
rival models, but ordinary least squares does not account for mea- 
surement error, we report the results based on the SEMs for all 
hypotheses in this study. 

6We also tested the hypotheses with single-year items for the 
predicting and dependent variables (rather than the reported 
multiple-years-based separate items). The results are similar in 
pattern and further support the hypotheses. 


Results for the Moderating Role of Corporate 
Abilities 


H3; predicted that corporate abilities, such as innovativeness 
capability and product quality, would moderate the impact 
of CSR on market value. Table 5 reports the hierarchical 
SEM results related to moderation effects. Following the 
work of Aiken and West (1991), we mean-centered the 
CSR, innovativeness capability, and product-quality 
variables before generating the interaction terms, and then 
we added the interaction terms hierarchically from Model 2 
to Model 3.7 The results in Table 5 show that the interaction 





7Multicollinearity bias was not a severe problem. The highest 
variance inflation factor was 3.06, and the largest condition index 
was 3.51. Note that in a mean-centered interaction-effects model, 
the estimated coefficient of one independent variable is obtained 
under the assumption of the mean value of other variables. More- 
over, the entry of the interactions terms for CSR, innovativeness 
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TABLE 5 
Hierarchical SEM Results 











SEM Estimates 
Rival Models y? d.f. X2ai (d.i. au CFI GFI RMSEA 
Model 1 15.83 5 96.99** (54)8 .90 .86 .08 
Model 2 177.09 40 64.27** (19)b .91 .89 .07 
Model 3 112.82 59 Compared base .92 .91 .07 
Direct Effects: Direct Effects: Moderated Effects: 
Model 1 Model 2 Model 3 

CSR > TQ .14* 2* .14* 
IN TQ .11* .10 
PQ > TQ Ag 47** 
CSR x IN TQ .14* 
CSR x PQ > TQ .20** 
CSR SH .12* .13* aa" 
IN — SH .08 08 
PQ SR .10 T 
CSR x IN => SR .10 
CSR x PQ > SR .18** 
R2 

TQ 30 35 41 

SR .28 29 34 





*p « 05, one-talled test. 

“po < .01, one-talled test. 

aThe results of the difference between Model 1 and Model 3. 
bThe results of the difference between Model 2 and Model 3 


Notes: CS = customer satisfachon, IN = innovativeness capability, PQ = product quality, TQ = Tobin's q, and SR = stock return 


term of CSR x product quality significantly affects both 
Tobin's q and stock return, though the interaction term of 
CSR x innovativeness capability affects only Tobin's q. 

To facilitate the interpretation of the moderating effects, 
Figure 2, Panel A, illustrates the relationship between CSR 
and Tobin's q for firms with low or high innovativeness 
capability (see Aiken and West 1991, pp. 12-14). Figure 2, 
Panel A, suggests that firms with low innovativeness capa- 
bilities generate negative market value from CSR, whereas 
firms with high innovativeness generate positive market 
. value from CSR. However, Figure 2, Panel B, shows that 
though firms with high product quality generate positive 
market value from CSR (the upward-sloping line), firms 
with low product quality seem not to be penalized in terms 
of generating market value from CSR (the rather flat line). 
As such, overall, we find support for H4 when we use inno- 
vativeness capability as the measure of corporate abilities, 
but we find only partial support for H4 when we use product 
quality as the measure of corporate abilities. 


Results for the Mediating Role of Customer 
Satisfaction in the Moderated Relationships 


Hy predicted that customer satisfaction would mediate the 
moderated relationships in H4. Although a test of this com- 
bination of mediation and moderation is somewhat compli- 
cated, Baron and Kenny (1986, p. 1179) recommend a prac- 


capability, and product quality explained significantly more vari- 
ance of market value beyond the main effects, adding 696 more 
variance for Tobin's q and 5% more variance for stock return. 
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tical approach. Essentially, it is similar to the four condi- 
tions of mediation we described previously, but it requires 
entering the interactions items (CSR x innovativeness capa- 
bility and CSR x product quality) rather than the main 
effect of CSR. More specifically, to establish mediated 
moderation, four specific conditions must be met: (1) The 
interaction variables (CSR x innovativeness capability and 
CSR x product quality) should significantly influence the 
mediator (customer satisfaction); (2) the mediator should 
significantly influence the dependent variable (market 
value); (3) the interaction variables (CSR x innovativeness 
capability and CSR x product quality) should significantly 
influence the dependent variable (market value); and (4) 
after we control for the mediator variable (customer satis- 
faction), the impact of the interaction variables (CSR X 
innovativeness capability and CSR x product quality) on the 
dependent variable (market value) should be no longer sig- 
nificant (for full mediation) or reduced in strength (for par- 
tial mediation) (Baron and Kenny 1986, p. 1179). Follow- 
ing this advice, prior studies in both strategy (Shin and 
Zhou 2003) and marketing (Andrews et al. 2004; Handel- 
man and Arnold 1999) have tested hypotheses combining 
mediation and moderation. 

Because the second and third conditions are met, when 
testing H(—H5, we need to check only for the first and 
fourth conditions. The significant paths from these interac- 
tion terms to satisfaction in Model 1 (Table 4) suggest that 
the first condition is also met. In addition, entering the 
mediator of customer satisfaction indeed decreases the 
impact of these interaction terms from Model 2 to Model 3 


FIGURE 2 | 
The Moderated Effect of CSR on Market Value 
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in Table 4. In particular, the impact of CSR x innovative- 
ness capability on Tobin's q is no longer significant, sug- 
gesting full mediation (this is not the case for stock return, 
in which the coefficients in both Model 2 and Model 3 are 
insignificant). In addition, the impact of CSR x product 
quality on both Tobin's q and stock return is diminished 
(but still significant), indicating partial mediation. Thus, 
these results suggest that the moderation relationships in H4 
are only partially mediated by customer satisfaction, in sup- 
port of H,.8 


8Note that Baron and Kenny (1986, p. 1179) label the 
relationships we tested as “mediated moderation,” which means 





Rival Models and Alternative Explanations 


We conducted additional analyses and ruled out several 
competing explanations. Because the aforementioned 
results suggest partial mediation, we fit several additional 
SEMs with different partial mediation effects (step-by-step 
adding/removing of individual paths from the predictive 
variables of CSR, innovativeness, and product quality to the 
predicted variables of Tobin’s q and stock return). The path 
estimates of the best-fit partial mediation model appear in 
Figure 3 and in the last column (Model 4) in Table 4. An 
examination of Figure 3 suggests three insights. First, 
although the main effect of CSR is fully mediated by cus- 
tomer satisfaction, CSR has an interaction effect with prod- 
uct quality that is not fully, but rather partially, mediated by 
customer satisfaction (in other words, this interaction effect 
between CSR and product quality directly influences 
Tobin’s q and stock return). Second, a firm’s product quality 
and innovativeness both have direct and indirect (through 
customer satisfaction) influence on Tobin’s q performance, 
which is consistent with prior literature (Dutta, Narasimhan, 
and Rajiv 1999; Fornell et al. 1996; Rao, Agarwal, and 
Dahlhoff 2004). Third, we can reject several alternative 
explanations, including the conjectures that the impact of 
product quality on market value is fully mediated by satis- 
faction and that innovativeness capability influences only 
firm performance but not intermediate outcomes, such as 
customer satisfaction. 

Furthermore, we ruled out several rival models. For 
example, as we report in Table 4, our SEM results suggest 
that the partial-mediation SEM (Model 4) fits the data better 
than the full-mediation SEM (Model 1; ¥24:¢ = 29.48, 
df. gier = 5, p < .05) and that the full-mediation SEM fits the 
data better than the nonmediation SEM (Model 3; ¥2,¢ = 
17.05, d.f.ag = 4, p < .05). Another criterion for SEM com- 
parison is the number of significant parameters (Morgan 
and Hunt 1994; Selnes and Sallis 2003). We find that the 
rival models with full mediation and nonmediation gener- 
ated fewer significant path estimates. Thus, our hypothe- 
sized partial-mediation model fits the data better than com- 
peting models in terms of both the relative predictive power 
of the overall model and the relative number of significant 
path estimates. 


Discussion 


How is CSR related to firm market value, and why do CSR 
initiatives result in financial gains for some firms but losses 
for others? Our study suggests that the answer to these 
questions is twofold: (1) CSR affects market value partially 
through the mediator of customer satisfaction, and (2) 
returns to CSR can be both positive and negative depending 





that controlling the mediator makes the influences of CSR x 
innovativeness capability and CSR x product quality no longer 
significant or less significant. This is different from “moderated 
mediation,” in which the moderators should also moderate the 
mediator-performance linkage (Baron and Kenny 1986, p. 1179). 
A pictorial illustration of the differences between mediated 
moderation and moderated mediation can be found in the work of 
Handelman and Arnold (1999, p. 38). 
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FIGURE 3 
SEM Results of Best-Fit Partlal Mediation Model 
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*p « .05, one-talled test. 
**p < .01, one-tailed test. 


Notes: Bolded paths are hypothesized relationships. Dashed paths are supported partlal mediation results. We assessed the significance of all 
SEM path coefficients through a bootstrapping approach wrth 1000 resamples. 


on the levels of a firm's corporate abilities. Based on a com- 
prehensive secondary data set, our results show that cus- 
tomer satisfaction plays a significant role in the relationship 
between CSR and firm market value and that a proper com- 
bination of both CSR initiatives and product-related abili- 
ties is important. These results have implications for both 
marketing theory and practice. 

Before presenting the implications, we note that 
FAMA's survey-based measure of CSR is an important lim- 
itation of this article. As we detailed in the "Data and 
Variables" section, the FAMA ratings are one possible 
source of CSR information and thus restrict our analysis 
and conclusion. To inspire greater confidence in our find- 
ings, further research should also attempt to replicate and 
extend our analysis with alternative measures of CSR. For 
example, measuring direct spending on CSR initiatives with 
a large-scale record of CSR monetary expenses across 
many firms (if obtained reliably from third-party agencies 
or firms’ own reporting; see Margolis and Walsh 2003; Orl- 
itzky, Schmidt, and Rynes 2003; Tsoutsoura 2004) would 
put CSR on par with measures such as advertising and 
R&D investments. A clear advantage of this direct approach 
is that marketing researchers would be able to compare and 
contrast the financial returns to these different types of 
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spending in an ideal way (i.e., by uncovering the relative, 
incremental, and synergistic impact of CSR, advertising, 
and R&D on a firm's market value). 


Implications for Marketing Theory 


Although CSR has been linked to customer responses (e.g., 
Bhattacharya and Sen 2004; Brown 1998; Brown and Dacin 
1997), this was the first marketing study to explore the rela- 
tionship between CSR and market value. Our work extends 
the research stream on the outcomes of CSR from perceived 
customer responses based on hypothetical lab experiments 
toward eventual financial returns based on large-scale sec- 
ondary data. It provides a direct answer to the calls for 
efforts that link CSR to a firm's stock performance (Berens, 
Van Riel, and Van Bruggen 2005; Luo and Donthu 2006; 
Rust, Lemon, and Zeithaml 2004). Indeed, Brown and 
Dacin (1997, p. 68) note that “we do all good things,... but 
we don't know if we get anything out of it.” The findings 
pertaining to the significant influence of CSR on a firm's 
Tobin's q and stock return attest to the financial value of 
CSR programs as strategic initiatives. Thus, future market- 
ing research should examine a wider spectrum of the bene- 
fits of CSR, ranging from perception-based outcomes to 
archive-based financial returns. 


A more important contribution of this research is that 
we identified a route through which CSR is related to a 
firm's market value. Our results of the significant CSR > 
customer satisfaction — market value causal chain suggest 
that a firm’s CSR helps build a satisfied customer base and 
that customer satisfaction partially mediates the financial 
returns to CSR. This mediating role of customer satisfaction 
is important for two reasons. First, it extends the CSR lit- 
erature by uncovering a previously ignored outcome (i.e., 
customer satisfaction) of CSR. Although prior work has 
noted that CSR should affect various kinds of consumer 
Tesponses, Customer satisfaction has not yet been explicitly 
examined as one such outcome. Second, it also extends the 
research stream on customer satisfaction (Anderson, For- 
nell, and Mazvancheryl 2004; Fornell 1992) by uncovering 
the antecedents (ie., CSR) of customer satisfaction. 
Although an emerging research strand has examined the 
outcomes of customer satisfaction (Anderson, Fornell, and 
Mazvancheryl 2004; Anderson Fornell, and Rust 1997; For- 
nell et al. 2006; Mittal et al. 2005), efforts have rarely been 
undertaken to examine factors that increase or decrease cus- 
tomer satisfaction. Overall, this chained relationship from 
CSR to customer satisfaction to a firm’s market value sug- 
gests that achieving customer satisfaction represents one of 
the underlying pathways through which the financial poten- 
tial of CSR is realized and capitalized. The notion that the 
extent to which CSR is beneficial to the firm is determined 
by how much CSR builds a satisfied customer base points 
further research in a more precise direction when evaluating 
the ultimate financial impact of CSR. 

Furthermore, our findings suggest that the financial 
returns to CSR are not the same, but rather are different, 
across firms with different internal situations. In particular, 
our finding that the positive financial returns to CSR are 
amplified in firms with higher product quality indicates that 
a proper mix or combination of external CSR initiatives and 
internal corporate abilities likely generates and sustains 
financial value for the firm. In this sense, we provide 
empirical evidence for the resource-based view. That is, in 
support of the resource-based view (Barney 1986; Penrose 
1959; Wernerfelt 1984) and marketing capability (Day 
1994; Vorhies and Morgan 2005) literature, we find that a 
firm’s sustainable competitive advantages indeed result 
from a complementary “bundle” of valuable internal (cor- 
porate abilities) and external (CSR initiatives) assets. Thus, 
further research is encouraged to go beyond the simple, uni- 
versal effects of CSR and explore contextual situations that 
moderate the associations between CSR and market value. 

Finally, existing marketing research has been enthusias- 
tic about the positive benefits of CSR, but unfortunately, it 
has potential negative outcomes (for an exception, see Sen 
and Bhattacharya 2001). Our research indicates that CSR 
can harbor a dark side. That is, in firms that are less innova- 
tive in nature, CSR may decrease customer satisfaction lev- 
els and ultimately reduce the firm’s financial returns. This 
finding of the negative returns to CSR in the low- 
innovativeness condition can be understood from the per- 
spective of competitive signaling theory (Caves and Porter 
1977; Stigler 1961). 


In particular, this theory holds that low (high) innova- 
tiveness competency in firms may serve as a cue of inferior 
(superior) competitiveness to corporate stakeholders, thus 
signaling weaker (stronger) future performance to financial 
investors in the marketplace. In the light of signaling theory, 
we conjecture that though CSR may help firms obtain insti- 
tutional legitimacy (i.e., by being socially responsive and 
supportive), firms that are less innovative in meeting cus- 
tomer needs may send a negative signal of incorrect strate- 
gic choice and misguided firm priorities in the market that 
contaminates and degrades this legitimacy (DiMaggio and 
Powell 1983; Scott 1987). The resulting costs of signaled 
noncompetitiveness in the market may outweigh the bene- 
fits of CSR and thus lead to negative market value. Con- 
ceivably, consumers may view CSR activities in firms with 
low asset specificity as opportunistic (i.e., manipulative and 
misleading with disguised selling purposes), which causes 
CSR to backfire and leads to consumer boycotts (Sen and 
Bhattacharya 2001; Smith 2003). It is also possible that 
firms that are low in corporate abilities likely invest in less 
influential and pure cost-adding CSR activities, such as 
cash donations. In contrast, firms that are high in corporate 
abilities implement “smarter” CSR strategies that are rela- 
tively idiosyncratic and thus generate more long-term finan- 
cial benefits. We call for further investigation of possible 
explanations of the observed asymmetric returns to CSR. 


Implications for Marketing Practice 


Marketers have pondered whether companies should take a 
more strategic tack on CSR and how “doing good” can con- 
tribute to their bottom line (Brown and Dacin 1997; Sen and 
Bhattacharya 2001). These are important issues that have 
strong managerial implications because prudent practition- 
ers face tough choices in allocating their limited resources 
and in prioritizing different strategic initiatives. 
Even evangelists such as Nardelli [chief executive officer 
of Home Depot] stop short of saying that companies 
should divert money from other strategic priorities to sup- 
port [CSR]. But at corporations like Home Depot and 
[General Electric], good works are being bred into Big 
Business. ‘It’s just the right thing to do,’ says Nardelli. 
Good PR? Sure. Money well spent? The goodwill refund 
could be in the mail. (Grow, Hamm, and Lee 2005, p. 78) 


Our finding that CSR contributes positively to market 
value suggests that managers can obtain competitive advan- 
tages and reap more financial benefits by investing in CSR. 
To be more specific, we calculated that for a typical com- 
pany in our sample with an average market value of approx- 
imately $48 billion, one unit increase of CSR ratings would 
result in approximately $17 million more profits on average 
in subsequent years, a substantial increase of financial 
returns. 

Indeed, companies should realize that CSR initiatives 
can represent a robust public relations strategy, particularly 
in the current market environment in which stakeholders, 
such as customers (and employees), may have strong social 
concerns. Creative executives at Home Depot, IBM, Wal- 
Mart, General Electric, and Cisco are engaging in “smarter 
corporate giving" than merely writing checks (Berner 2005, 
p. 68). For example, Home Depot donated 2 million hours to 
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various types of community services, and IBM gave away 
more than 100 specialized business applications (i.e., trans- 
lation servers changing English e-mails into Spanish mes- 
sages) in heavily Latino-populated schools and community 
groups. Closer examination of the CSR portfolios of some of 
the top- and bottom-rated firms in terms of CSR sheds addi- 
tional light on how managers may derive positive market 
returns from CSR and/or avoid negative returns. That is, 
many of the firms at the top of the CSR heap (e.g., United 
Parcel Service, Alcoa, Verizon Communications) seem to 
have integrated CSR tightly with their business strategies. 
For example, these firms invest in a host of employee-related 
initiatives, such as education and safety, that engender iden- 
tification and instill pride among employees, all of which 
influence customer satisfaction and market value. Moreover, 
these firms have employee volunteerism programs in which 
employees are visible contributors to the local communities. 
This helps capture customers' favorable attention. 

In contrast, firms at the bottom of the CSR heap, such as 
Toys ‘R’ Us and Mitsubishi Motors, seem to be perceived as 
“irresponsible” by dint of mistreating workers and/or con- 
cealing product defect information. Such negative actions 
tend to receive media coverage 1n today's scrutiny-intensive 
world. Viewed in conjunction with our results, these exam- 
ples suggest that managers should not only "get their house 
in order" to avoid negative returns to CSR but also adopt an 
integrated, strategic perspective and allocate resources to 
CSR programs for superior market performance. After all, 
*it is no longer an option [for companies] to sit on the side- 
lines" (Grow, Hamm, and Lee 2005, p. 77; Smith 2003). 

Our findings that CSR increases customer satisfaction, 
which in turn leads to positive financial returns, may 
improve managers' understanding of why CSR matters. In 
particular, marketers may have already known that CSR 
helps promote external social benefits, such as public good- 
will outside the firm (Houston and Johnson 2000; McGuire, 
Sundgren, and Schneeweis 1988), which can polish a firm's 


reputation in the presence of corporate scandals or regula- 
tory scrutiny. In addition, CSR can boost internal employee 
morale and commitment within the firm (Godfrey 2005; 
McGuire, Schneeweis, and Branch 1990) and attract more 
capable, young talents who are trying to "marry their work 
and nonwork lives" (Grow, Hamm, and Lee 2005; for 
detailed benefits of CSR and cause-related marketing, see 
Varadarajan and Menon 1988, p. 60). Importantly, we sug- 
gest an additional insight to managers: CSR initiatives also 
influence customers' satisfaction levels, which ultimately 
Jead to higher market returns. To managers, this means that 
building satisfaction is an important intermediate step in 
converting CSR into financial gains. 

However, our findings of the boundary conditions of the 
returns to CSR suggest that managers should not ignore the 
inherent traps and pitfalls of CSR. For example, we show 
that firms are not always able to benefit from CSR actions. 
When companies are not innovative, our findings indicate 
that CSR actually decreases their market return. Thus, CSR 
seems to be a double-edged sword; without proper support 
of corporate abilities, such as innovativeness, CSR can be 
harmful to firm performance. Indeed, when “doing better at 
doing good" (Bhattacharya and Sen 2004, p. 9), it is impor- 
tant for managers to consider CSR initiatives in the light of 
the firm's corporate abilities. In particular, less innovative 
companies may be better off financially by avoiding CSR 
actions. Managers should understand that a misalignment of 
CSR with internal factors can be detrimental and lead to 
decreased market value. As a consequence, marketers need 
to examine carefully the organizational context in totality 
before implementing CSR initiatives. 

In conclusion, this research suggests a more nuanced 
understanding of the market returns to CSR initiatives. Our 
findings seem to indicate that "doing good" has compli- 
cated implications and that customer satisfaction plays an 
important mediating role in the relationship between CSR 
and firm market value. 


————sÓ—À— P ——————P— Á—  — — ——— ——————————— 
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Martin Holzwarth, Chris Janiszewski, & Marcus M. Neumann 


The Influence of Avatars on Online 
Consumer Shopping Behavior 


An impediment to Web-based retail sales is the impersonal nature of Web-based shopping. A solution to this 
problem is to use an avatar to deliver product information. An avatar is a graphic representation that can be 
animated by means of computer technology. Study 1 shows that using an avatar sales agent leads to more 
Satisfaction with the retailer, a more positive attitude toward the product, and a greater purchase intention. Study 2 
shows that an attractive avatar is a more effective sales agent at moderate levels of product Involvement, but an 
expert avatar is a more effective sales agent at high levels of product involvement. 


medium. It is estimated that between 700 million and 

50 million people used the Internet in 2004 and that 

close to 500 million of these people were frequent users 

(i.e., more than once a week). Approximately 200 million 

people engaged in retail commerce on the Internet, with 

worldwide retail sales approaching $70 billion in 2004 

(e.g., United States: $44 billion; Europe: $10 billion; Asia: 

$10 billion). Yearly Internet retail sales growth averaged 

30% per year from 2001 to 2005 and is expected to grow at 
a similar rate in the near future. 

Despite the strong growth of Internet sales, this distribu- 
tion channel has not been as successful as was' once pro- 
jected. In 2000, it was anticipated that as much as 7.8% of 
U.S. retail sales would be transacted on the Internet in 2005 
(Dykema 2000). However, only 2.0% of U.S. retail sales 
were transacted on the Internet in the fourth quarter of 2004 
(U.S. Census Bureau 2005), and an even smaller percentage 
was transacted in Europe and Asia. The greatest impedi- 
ment to Internet channel growth has been the low shopper 
conversion rate, that is, the percentage of visitors to a retail 
site who actually make a purchase. The conversion rate of 
Internet shoppers averages only 4.9% among the top 100 
Internet retailers (Nielsen//NetRatings 2005), a rate signifi- 
cantly lower than experienced by comparable firms using 
traditional retailing channels. Moreover, research shows 
that between 65% and 75% of consumers who initiate an 
online transaction fail to complete the transaction (Mum- 
mert & Partner 2001). 


T= Internet is a major communication and transaction 
9 


Martin Holzwarth Is a consultant at Marketing Partner, MP Management 
Consultants GmbH, Wiesbaden, Germany (e-mail: mh & martinholzwarth. 
com). Chris Janiszewski Is Jack Faricy Professor of Marketing, Warrington 
College of Business Administration, University of Florida (e-mail: chris. 
Janiszewski cba.ufl.edu). Marcus M. Neumann is a doctoral student, 
Department of Business Administration and Marketing Il, University of 
Mannheim (e-mail: marcus.neumann @bwi.un-mannheim.de). All three 
authors contributed equally to this work. The authors thank Alan Sawyer 
for his helpful comments. 


To read or contribute to reader and author dialogue on this article, visit 
http://www. marketingpower.com/mbiog. 


© 2006, American Marketing Association 
ISSN: 0022-2429 (print), 1547-7185 (electronic) 


19 


Survey research suggests that the most significant 
inhibitors of online shopping are the absence of pleasurable 
experiences, social interaction, and personal consultation by 
a company representative (Barlow, Siddiqui, and Mannion 
2004; G&J Electronic Media Services 2001). Consumers 
report that online companies are impersonal, they feel help- 
less when shopping virtually in unfamiliar or complex prod- 
uct categories, and they want the customer assistance often 
found in a conventional shopping environment combined 
with the convenience of Internet shopping. Thus, improving 
the Internet shopping experience should improve the con- 
version rate of potential buyers (Childers et al. 2001). 

One approach to increasing the entertainment value, 
information value, and customer satisfaction of Web-based 
shopping experiences is to use "avatars" (Barlow, Siddiqui, 
and Mannion 2004; Redmond 2002). Avatars are virtual 
characters that can be used as company representatives. 
Avatars can serve as identification figures, as personal shop- 
ping assistants, as Web site guides, or as conversation part- 
ners. In these roles, avatars have the potential to fulfill the 
consumer's desire for a more interpersonal shopping experi- 
ence. Thus, an electronic shopping agent may be able to 
ease a consumer's navigation of a Web site or provide per- 
sonalized information, but an avatar can anthropomorphize 
the interaction and make the shopping experience more 
interpersonal. Consequently, the information provided on 
the Web site should be perceived as more credible, the 
shopping experience should become more enjoyable, and 
the likelihood of a purchase should increase. 

The goal of this research is to investigate the benefits of 
using avatars as company representatives on commercial 
Web sites. Two studies show that avatars positively affect an 
online shopping experience. In Study 1, an avatar communi- 
cator creates a more positive perception of the entertain- 
ment value and informativeness of a Web site. Conse- 
quently, shoppers are more satisfied with the retailer, more 
positive about the product, and more likely to purchase the 
product. These avatar advantages persist even when infor- 
mation content is held constant between an avatar and a no- 
avatar format. Study 2 finds that moderately involved shop- 
pers are more persuaded by attractive-looking avatars and 
that highly involved shoppers are more persuaded by expert 
avatars. Attractive avatars are persuasive because of their 
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likeability, whereas expert avatars are persuasive because of 
their credibility. 


Avatars 


The word “avatar” has its derivation in the ancient Indian 
language Sanskrit and refers to the embodiment of a deity 
on earth. Consistent with this original definition, present- 
day definitions of an avatar refer to a representation of an 
entity. For example, an avatar is described as “a pictorial 
representation of a human in a chat environment" 
(Bahorsky, Graber, and Mason 1998, p. 8) or as "a repre- 
sentation of the user as an animated character in virtual 
worlds" (Loos 2003, p. 17). There are also technology- 
oriented approaches to defining avatars. For example, the 
computer trade press describes avatars as "graphic personi- 
fications of computers or processes that run on computers" 
(Halfhill 1996, p. 69). For our purposes, avatars are defined 
as general graphic representations that are personified by 
means of computer technology. 


Computer-Medlated Interaction 


According to the theory of social response, people tend to 
react to computer technology as though it is a social entity 
(Moon 2000, 2003; Reeves and Nass 1996). Whenever 
computer technology exhibits humanlike behaviors, such as 
language production, taking turns in conversation, and reci- 
procal responding, the user is more apt to personify the 
technology (Moon 2000; Nass et al. 1995a). This tendency 
to treat a computer as a social entity occurs whether the rep- 
resentation of the computer is the screen, a voice, or an 
agent (Moon 2000). Thus, when people are confronted with 
a computer or software program, they have a tendency to 
engage in a “social response to (these) communication tech- 
nologies,” or what Morkes, Kernal, and Nass (1999) call the 
SRCT approach. 

An early example of a social response to computer tech- 
nology is provided by Weizenbaum (1976), who exposed 
experimental participants to a computer-based speech 
analysis program called ELIZA. This program reformulated 
inputs from the participants into psychoanalytical ques- 
tions. The interaction caused participants to develop an 
emotional relationship to and attribute human characteris- 
tics to the computer, even though the participants were fully 
aware that the computer did not possess a personality 
(Morkes, Kernal, and Nass 1999). More recent evidence 
shows that users respond to computers as they do to people 
in terms of psychosocial phenomena, such as personality, 
politeness, and flattery (Moon 2003; Nass et al. 1995a; 
Nass et al. 1995b; Nass and Steur 1993). 


Avatar-Medlated Interaction 


Research on computer-mediated interactions persuasively 
demonstrates that computer technology can be personified. 
Yet the research is silent with respect to the benefits of 
using an avatar to represent the technology. However, the 
research on mediated communication may provide insight 
into this issue. This stream of research suggests that medi- 
ated communications (e.g., radio, television) are less per- 
suasive than interpersonal communication because medi- 
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ated communications lack reciprocity (Horton and Wohl 
1956; Rubin, Perse, and Powell 1985). There is also evi- 
dence that increasing the reciprocity of a technology-based 
interaction through verbal disclosures increases compliance 
with requests made by the technology (Moon 2000). We 
posit that simply having an avatar pictured during the 
human-computer interaction will make the interaction feel 
more conversational and reciprocal. Thus, adding an avatar 
to a decision support system on a retail Web site will 
increase the effectiveness of the Web site. This prediction is 
consistent with recent evidence that relationships are more 
likely to develop if the computer technology is represented 
using human forms (Trogemann 2003). 


Avatars as Persuasion Agents 


If avatars enhance the personification of a technology, they 
should influence the purchase process in a manner similar 
to human sales agents. Human sales agents have been 
shown to increase satisfaction with a retailer, enhance atti- 
tudes toward products sold by the retailer, and increase the 
consumer's intention to buy (Katz and Lazarsfeld 1955; 
Webster 1968). Thus, avatars should have a similar impact 
on the shopping experience (Redmond 2002). 


H,: Avatar-mediated communication has a positive effect on 
(a) satisfaction with the retailer, (b) attitude toward the 
product, and (c) purchase intention. 


It is also important to understand how avatar-mediated 
communications can influence purchase behavior. First, 
there is a long history of evidence that personal 
communication is more effective than mass-media 
communication (Williams 1977). Second, there is a long 
history of evidence that personal communication is more 
complex, adaptive, and satisfying than other forms of 
communication (Allen et al. 2002). Both research streams 
help explain the effectiveness of face-to-face interactions 
with sales agents. In general, face-to-face interactions with 
sales agents are effective not only because sales agents can 
provide information, be empathic, and build rapport 
(Barlow, Siddiqui, and Mannion 2004) but also because the 
agent's information is perceived as more accurate and is 
more likely to be believed (Soldow and Thomas 1984). In 
effect, sales agents enhance the value of the information 
provided and increase the pleasure of the shopping 
experience, especially in a retailing environment (Reynolds 
and Beatty 1999). Given that people visit Web sites for 
entertainment needs and information needs (Eighmey and 
McCord 1998; Korgaonkar and Wolin 1999), avatar- 
mediated communications may have an impact similar to 
human-sales-agent-mediated communications. 


H5: Avatar-mediated communication has a positive effect on 
(a) the perceived entertainment value of a Web site and (b) 
the perceived information value of a Web site. 


Finally, we expect that the perceived entertainment and 
information value of a Web site will mediate the consumer's 
satisfaction with the retailer, attitude toward the product, 
and purchase intention. Our expectation of the influence of 
these mediating factors is related to our previous discussion 
of deterrents to Internet shopping. Recall that an important 
deterrent to consumers' use of the Internet for product pur- 


chases is the limited range of personalized services avail- 
able within this medium. The most notable omission is the 
lack of face-to-face communication (Phillips et al. 1997). 
Swaminathan and colleagues (1999) observe that many 
people have a strong desire for social interaction in their 
shopping experience. If a marketer can positively affect a 
customer's mood by providing entertaining content, this 
should have a positive effect on the perception of the com- 
pany and its products (Brown and Stayman 1992; MacKen- 
zie and Lutz 1989; MacKenzie, Lutz, and Belch 1986). 
Similarly, if a marketer can make information 'seem more 
relevant and important, this should encourage a more posi- 
tive response toward the retailer and its offerings (Weitz, 
Sujan, and Sujan 1986). 


Hs: (a) The perceived entertainment value of a Web site and 
(b) the perceived information value of a Web site mediate 
the positive influence of an avatar-mediated communica- 
tion on satisfaction with the retailer, attitude toward the 
product, and purchase intention. 


We summarize these relationships in Figure 1 under the 
heading “Influence of Avatars.” 


Designing Effective Avatars 


Kelman (1961) explains how three processes of social influ- 
ence affect the message recipient’s acceptance of a mes- 
sage. The first process focuses on the recipient’s identifica- 
tion with the communicator. In this case, the recipient 
adopts the attitude of the communicator through imitation 
or internalization of the message. The interpersonal attrac- 
tion of the communicator is a prerequisite for motivating 


the recipient to adopt the communicator’s position. The sec- 
ond process concerns the recipient’s perceptions of the 
communicator’s credibility as a consequence of the exper- 
tise of the communicator and the trust that develops 
between the communicator and the recipient. In this case, 
the credibility of the communicator plays a key role in per- 
suasion. The third process pertains to the compliance of the 
recipient. ‘The recipient is more compliant if he or she feels 
controlled by the communicator and/or the communicator 
can provide rewards and punishments. We focus on the first 
two processés. 

Moderator. We anticipate that the attractiveness and the 
expertise of an avatar will have an influence on its persua- 
siveness and that the consumer's involvement with the pur- 
chase will moderate the influence of these characteristics. 
The expectation that the persuasiveness of different types of 
avatars will vary with involvement is consistent with a long 
history of research on persuasion by human communicators 
(see Cooper and Croyle 1984; Wood 2000). For example, 
the attractiveness of a communicator has been shown to 
have a greater impact on persuasion at lower levels of 
involvement (Petty, Cacioppo, and Goldmann 1981), 
whereas the expertise of a communicator’s message has 
been shown to have a greater impact on persuasion at higher 
levels of involvement (Petty, Cacioppo, and Heesacker 
1981). The positive characteristics of the communicator can 
generalize to the products being promoted by these people 
(Caballero and Pride 1984). 

The varied influence of communicator attractiveness 
and communicator expertise may generalize to avatar com- 
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munications. At the lowest level of purchase involvement, 
there will be little difference between an attractive and an 
expert avatar because the communication agent and mes- 
sage are irrelevant to the consumer. An avatar’s attractive- 
ness will contribute to an avatar’s persuasiveness as involve- 
ment increases from a low to a moderate level, but it will 
have less influence on persuasiveness as involvement 
increases from a moderate to a high level. An avatar's 
expertise, as exemplified in the message quality, will con- 
tribute to an avatar’s persuasiveness as involvement 
increases from a low to a moderate to a high level. In sum- 
mary, the influence of avatar attractiveness on persuasion is 
a log function of increasing involvement, whereas the influ- 
ence of avatar expertise on persuasion is a linear function of 
increasing involvement (see Figure 2). We expect that these 
different patterns of response to avatar attractiveness and 
expertise will allow an attractive avatar to be more persua- 
sive at moderate levels of involvement and an expert avatar 
to be more persuasive at high levels of involvement. 
Formally, 


H4: The influence of avatar attractiveness on persuasion is a 
log function of increasing involvement, whereas the influ- 
ence of avatar expertise on persuasion is a linear function 
of increasing involvement. 


H,,: An attractive avatar will be more persuasive than an 
expert avatar at moderate levels of involvement. 


Hay: An expert avatar will be more persuasive than an attrac- 
tive avatar at high levels of involvement. 


Mediators. Whereas H4 discusses factors (e.g., enter- 
tainment value, information value) that might mediate the 
positive influence of having an avatar present on a Web site, 
H4 hypothesizes that different types of avatars (e.g., attrac- 
tive, expert) may be effective in different situations. It is 
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also possible to speculate about processes that mediate the 
relative effectiveness of an attractive or expert avatar. For 
example, interactions with physically attractive people are 
pleasant and rewarding (Bull and Rumsey 1988). The pleas- 
ant feeling that results from an interaction with an attractive 
person, in turn, promotes the adoption of the communica- 
tors behaviors and attitudes (Chaiken 1979; Dion, 
Berscheid, and Walster 1972; Joseph 1982; McGuire 1985). 
In addition, physically attractive people are perceived more 
favorably on traits typically associated with selling effec- 
tiveness (Reingen and Kernan 1993). Thus, the attractive- 
ness of an avatar should influence the likeability of an 
avatar, and likeability should mediate the degree of 
persuasion. 


Hs,: The likeability of the avatar mediates the persuasiveness 
of an attractive avatar relative to an unattractive avatar. 


Similarly, the expertise of an avatar should influence 
perceptions of the credibility of the avatar, and credibility 
should mediate the degree of persuasion. For our purposes, 
it is important to note that particularly experienced or 
knowledgeable people are perceived as experts (Friedman 
and Friedman 1979; Stáudel 1987) and that expert commu- 
nicators are perceived as more credible than nonexpert com- 
municators (Brehm, Kassin, and Fein 2005; Hovland, Janis, 
and Kelley 1953; McGuire 1985). In general, research 
investigating source expertise in persuasive communica- 
tions indicates that the source's perceived credibility has a 
positive impact on persuasion (Horai, Naccari, and Fatoul- 
Jah 1974; Maddux and Rogers 1980; Mills and Harvey 
1972). For example, the expertise of a corporate communi- 
cation agent affects the perceived credibility of the commu- 
nication (Freiden 1984; Rubin, Mager, and Friedman 1982) 
and the associated company (Stephens and Faranda 1993), 
as well as the acceptance of the message. To the extent that 
avatars persuade as human communicators do, we hypothe- 
size similar mediation effects for avatars. 


Hs: The perceived credibility of the avatar mediates the per- 
suasiveness of an expert avatar relative to a nonexpert 
avatar. 


We summarize these relationships in Figure 1 under the 
heading “Influence of Type of Avatar.” 


Study 1 


Study 1 explored the influence of avatars on consumer 
responses to Web-based merchandising. The purchase sce- 
nario was an opportunity to purchase a leisure shoe that 
could be customized by means of an online consultation. 
This online consultation was performed using a series of 
diagnostic screens that solicited the consumer to make 
product design decisions. The key manipulation was 
whether the consumer was led to believe that an impersonal 
software program or an avatar was soliciting information 
and design recommendations. 


Design and Stimull 


The experiment was an avatar personification (between- 
subjects manipulation) x intrinsic involvement (measured 
variable) mixed design. The key manipulation was the per- 


sonification of the avatar available to assist with the pur- 
chase (no avatar, attractive avatar, or expert avatar). We 
established the attractiveness and expertise of the avatars in 
three ways: appearance, credentials, and information (prod- 
uct positioning). We purposely aligned these three factors to 
create a consistent personification for the avatar.! 


Altractive avatar condition. We designed attractive 
avatars tO appear younger, thinner, and more athletic (see 
Figure 3). The avatar introduced itself by saying, “Hello, 
my name is Kim (Tom). J have recently been employed as a 
shoe consultant. I have already gained preliminary knowl- 
edge about our popular, customizable shoe products. I can 
certainly help you find an attractive offer.”2 The avatar then 
began the consultation using a fashion-oriented statement 
that was consistent with the avatar’s personification: 


Did you know a person’ s outfit is judged, above all, by the 
shoes that he or she is wearing? And comfortable sports 
shoes are not seen as very stylish. It is too bad that most 
fashionable shoes look trendy but do not offer much com- 
fort. Using our company, you can design the look of your 
shoe and make it exactly as you wish. In addition, we can 
custom-fit your size and design the shoe's features accord- 
ing to your needs. 


The avatar was present on the remaining screens to assist in 
the purchase. When appropriate, we tailored the avatar's 
advice to be consistent with its personification (for a sample 
consultation from the attractive avatar condition, see 


Figure 3). 


Expert avatar condition. We designed expert avatars to 
appear older and nonathletic, and they wore eyeglasses. The 
avatar introduced itself by saying, “Hello, my name is Dr. 
Anne Schneider (Dr. Norbert Oswald). I have been a podia- 
trist for over 10 years and I would like to competently 
inform and advise you about our products. Take advantage 
of my experience and let me show you how our ergonomic 
shoes can be useful to you.” The avatar then began the con- 
sultation using a comfort-oriented statement that was con- 
sistent with the avatar’s personification: 


Every day, your feet are subjected to a great deal of strain, 

which all too often leads to discomfort. Fashionable 
shoes, unfortunately, are usually ergonomically unsatis- 

factory, and comfortable shoes often do not have the right 
look. Our medically tested shoes, on the other hand, can 
be specially made according to your needs regarding fit 
and features. Our shoes offer a healthy form and comfort. 
You can also choose your desired design. 





!In this design, we intentionally confound the avatar’s appear- 
ance and credentials and the content of the consultation to create 
an avatar personification factor. Thus, we cannot assess the inde- 
pendent influence of an avatar’s presence. In a follow-up study, we 
control for the content of the consultation and show the indepen- 
dent influence of the avatar. 

4We included an avatar gender manipulation in the design 
because industry contacts believed that avatar gender was a rele- 
vant factor. Avatar gender did not’exhibit a main effect or interact 
with the manipulated variables for any dependent measure. The 
stimulus materials were in German. The statements are transla- 
tions of German text. 


The avatar was present on the remaining screens, and we 
tailored its advice to be consistent with its personification. 


No-avatar control. The no-avatar control condition did 
not contain an avatar; thus, there was no introductory salu- 
tation. There was an attempt to keep the consultation in the 
no-avatar control equivalent to the consultation in the avatar 
conditions. We accomplished this by emphasizing both 
fashion and comfort at the beginning of the consultation: 


We are a company that offers customized leisure shoes. 
We will introduce our product idea on the following Web 
pages. When buying leisure shoes, customers can usually 
choose between trendy fashion shoes and comfortable 
sport shoes. Our shoes, however, can be individually made 
for each customer with regard to fit, features, and design. 


The information provided during the consultation was a 
blend of fashion and comfort information. 


Procedure 


We conducted the online experiment using a population of 
German shoppers. We recruited the participants using vari- 
ous online and offline procedures. The final sample size 
was 400 consumers, 55% of whom were men and 45% of 
whom were women. The participants were between ages 17 
and 74 years, and the median age was 24 years. Sixty-six 
percent of the participants were college graduates, implying 
a disproportionately high level of education relative to a 
representative sample of online consumers. In addition, the 
participants’ experience with the Internet and online shop- 
ping was above average compared with published statistics 
about the average Internet user. 

We began by recording demographic information. Next, 
we asked participants four questions about their use of sales 
associates when shopping. Then, participants saw an intro- 
duction to the experiment: 


On the following pages, a company that sells leisure shoes 
will introduce its products to you. The unique feature of 
the shoes is that they can be customized with respect to fit, 
features, and design. Please be aware that these are not 
actual Web pages for an online store, but rather pages that 
have been specifically designed for this experiment. You 
can use these pages to learn about design options, to have 
an online consultation, and to calculate a price for your 
personalized shoes. 


After the introduction, participants were greeted by the 
avatar if they were in an avatar condition, received the intro- 
ductory statement consistent with their condition, and were 
taken through the customized shoe-shopping experience. 
During the customization process, the participant could 
select different visual and functional features to create a 
personalized shoe. Participants were asked to select a pro- 
cedure for fitting, the outside shoe material, the inside shoe 
material, and the soles. The avatar consultant discussed how 
to make a specific design decision and provided a recom- 
mendation before each decision. After the consultation, a 
randomly generated price of $92.40, $94.40, or $96.40 was 
provided. Within an avatar condition, the avatar used the 
same text and made the same recommendations for each 
participant. The participants were not required to follow the 
avatar’s recommendations. The participants had the oppor- 
tunity to recustomize the shoe at the end of the process 


Online Consumer Shopping Behavior / 23 


FIGURE 3 
Study 1: Avatars and Sample Consultation 
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B: Consultation Using Appearance-Orlented Product Positioning Text 
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(thus the random assignment of price), though few chose to 
do so. The participants were not asked to make an actual 
purchase. 

Participants then responded to a series of dependent 
measures. Participants reported their involvement with the 
shoe purchase, perceptions of avatar attractiveness and 
expertise, perceptions of the Web site’s entertainment value 
and information value, perceptions of the Web site’s graphic 
design, perceptions of avatar likeability and credibility, per- 
ceptions of retailer credibility, satisfaction with the retailer, 
attitude toward the product, and purchase intention. We 
measured all items with seven-point Likert scales, anchored 
by “completely disagree” and “completely agree.” The indi- 
vidual items used to measure each construct and the Cron- 
bach’s alpha for each scale appear in Appendix A. 


Tests of H4—H; 


H;: Avatar effects on satisfaction, attitude, and pur- 
chase intention. The means of the satisfaction-with-the- 
retailer, attitude-toward-the-product, and purchase-intention 
dependent variables appear in Table 1.3 As H, predicted, 
participants who saw an avatar were more satisfied with the 
retailer (M = 3.91) than those who did not see an avatar 
(M = 3.41; F(1, 397) = 9.19, p < .01), had a more favorable 
attitude toward the product (M = 4.36) than those who did 
not see an avatar (M = 3.86; F(1, 397) = 7.31, p < .01), and 
had a higher purchase intention (M - 3.74) than those who 
did not see an avatar (M = 3.00; F(1, 397) = 9.16, p < .01). 


H»: Avatar effects on perception of the Web site. H» pre- 
dicted that avatars would facilitate Web-based purchasing 
because they make the Web site seem more entertaining and 
informative. Participants who saw an avatar rated the Web 
site as more entertaining (Ma, = 4.22, Mao avatar = 3.55; 
F(1, 397) = 13.45, p < .01) and informative (Mayatar = 4.48, 
Moo avatar = 3-99; F(1, 397) = 9.12, p < .01) than participants 
who did not see an avatar. 


3The smaller cell sizes for the control group are a consequence 
of a communication problem with the Web site programmer. The 
programmer considered gender an experimental rather than a 
counterbalance factor; thus, the male avatar, female avatar, and 
control conditions have similar cell sizes. 


H3: The mediating effects of Web site perception. H 
predicted that the increase in the perceived entertainment 
value and information value of the Web site would mediate 
the influence of the avatar's presence on satisfaction with 
the retailer, attitude toward the product, and purchase inten- 
tion. Evidence for complete mediation requires (1) an effect 
of the independent variable (i.e., avatar presence) on the 
dependent variable (i.e., satisfaction with the retailer, atti- 
tude toward the product, and purchase intention), (2) effects 
of the independent variable (i.e., avatar presence) on the 
mediators (i.e., entertainment, informativeness), and (3) a 
nonsignificant effect of the independent variable when the 
mediators and independent variable are regressed on the 
dependent variable (Baron and Kenny 1986). As we sum- 
marize in Table 2, the presence of an avatar significantly 
influenced satisfaction with the retailer (t(398) = 3.03, p « 
.05), attitude toward the product (t(398) = 2.71, p « .05), 
and purchase intention (t(398) = 3.03, p « .05). The pres- 
ence of the avatar also influenced the entertainment value 
(t(398) = 3.67, p « .05) and the informativeness (t(398) = 
3.02, p « .05) of the Web site. Finally, the inclusion of 
entertainment value and avatar presence in a regression 
made the avatar's presence a nonsignificant predictor of sat- 
isfaction with the retailer (t(397) = 1.16, p > .05), attitude 
toward the product (t(397) = 1.14, p > .05), and purchase 
intention (t(397) = 1.52, p > .05). Likewise, the inclusion of 
informativeness and avatar presence in a regression made 
the avatar's presence a nonsignificant predictor of satisfac- 
tion with the retailer (t(397) = 1.39, p > .05), attitude toward 
the product (t(397) = 1.38, p > .05), and purchase intention 
(t(397) = 1.80, p > .05). These tests provide evidence for 
mediation (Baron and Kenny 1986). 


Tests of H, and H; 


We used the 322 participants in the attractive and expert 
avatar conditions to test H4 and Hs. The hypotheses address 
whether participants with different levels of involvement 
with the product being purchased were more responsive to 
an attractive or expert avatar. Given the continuous nature 
of the intrinsic involvement variable, we used ordinary least 
squares regression to test the hypotheses. We conducted 
follow-up tests of avatar effectiveness at different levels of 
involvement using an analysis of variance (ANOVA). We 


TABLE 1 
Study 1: Means 
Avatar Condition 
None Attractive Expert 
Dependent Measure (n = 78) (n = 159) (n = 163) 
Satisfaction with retailer 3.41 3.93 3.89 
Attitude toward product 3.86 4.38 4.34 
Purchase intention 3.00 3.86 3.60 
Web site entertainment 3.55 4.28 4.16 
Web site Informativeness 3.99 4.44 4.51 
Avatar attractiveness 2.91 2.29 
Avatar expertise 4.15 4.72 
Avatar likeability 4.24 4.09 
Avatar credibility 3.78 4.34 
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TABLE 2 
Study 1: Mediation Analysis for Effects of Avatar Presence - 








Test Variables t p 

AC Avatar satisfaction with retailer 3.03 «.01 
Avatar — attitude toward product 2.71 «.01 
Avatar — purchase intention 3.03 «.01 

AB Avatar — entertainment 3.67 «.01 
Avatar — Informativeness 3.02 l <.01 

Mediation Tests for Entertainment 

A B>C Entertainment, 13.53 <.01 

Avatar — satisfaction with retailer 1.16 .25 
Entertainment, 10.25 «.01 
Avatar attitude toward product 1.14 .26 
Entertalnment, 10.01 «.01 
Avatar — purchase intention 1.52 .13 

Medlatlon Tests for Informativeness 

A,B—»C Informativeness, 17.05 «.01 

Avatar satisfaction with retailer 1.39 .17 
Informativeness, 11.32 <.01 
Avatar attitude toward product 1.38 .17 
Informativeness, 10.48 «.01 
Avatar — purchase Intention 1.80 07 





created the levels of involvement using a tertiary split of the 
self-reported involvement scores. 


Manipulation check. We performed two types of 
manipulation checks. First, we confirmed that the avatar 
manipulations were effective. Participants perceived the 
attractive avatar (M — 2.91) as more attractive than the 
expert avatar (M = 2.29; F(1, 319) = 17.55, p « .05), and 
they perceived the expert avatar (M — 4.72) as more expert 
than the attractive avatar (M = 4.15; F(1, 319) = 15.16, p< 
.05). Second, we confirmed that involvement and type of 
avatar were orthogonal factors and thus could be used in the 
same regression analysis. The avatar manipulation did not 
influence the degree of involvement with the product 
(Mno avatar = 3.82, Mattractive = 3.65, Mexpert = 3.60; 
F(2, 397) = .73, p > .05). In addition, the degree of involve- 
ment with the product did not interact with perceptions of 
the attractiveness of the attractive avatar (F(1, 318) = .28, 
p > .05) or perceptions of the expertise of the expert avatar 
(F(1, 318) 2 .15, p » .05).4 


Review of hypotheses. We offered two general hypothe- 
ses about the effectiveness of different types of avatars. 
First, we predicted that the more attractive avatar would 
appeal to participants who were moderately interested in 


fThe test of the type of avatar x involvement interaction 
involved a comparison of a constrained model (Y = a + B x 
TYPE + B x INVOLVE) with an unconstrained model (Y = œ + 
B x TYPE + B x INVOLVE] + B x INVOLVE2), where TYPE was 
dummy-coded 0/1, INVOLVE was the involvement score, 
INVOLVETI was the involvement scores for the attractive treat- 
ment group (zero otherwise), and INVOLVE2 was the involve- 
ment scores for the expert treatment group (zero otherwise). 
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custom-fit shoes (H4,). The reason for this preference is that 
participants would perceive the avatar as more likeable 
(Hs,). Second, we predicted that the more expert avatar 
would appeal to participants who were highly interested in 
custom-fit shoes (Hap). The reason for this preference is that 
participants would perceive the avatar as more credible 
(Hay). This pattern of results required participants to be 
more sensitive to avatar attractiveness as involvement 
increased to a moderate level and more sensitive to avatar 
expertise as involvement increased to a high level. As we 
stated in H4, the response to the attractive avatar needed to 
show a quadratic form with increasing involvement, 
whereas the response to the expert avatar needed to show a 
linear form with increasing involvement (see Figure 2). 

H4: Satisfaction with the retailer. The first analysis 
investigated the type of avatar x involvement interaction for 
satisfaction with the retailer. Consistent with Hy, there was 
a type of avatar x involvement interaction with a quadratic 
influence of involvement across the attractive and expert 
conditions (F(1, 316) = 10.16, p « .05).5 A review of the 


SThe test of the type of avatar x involvement interaction 
involved a comparison of a constrained model that required a natu- 
ral log for involvement (Y = « + B x TYPE + B x INVOLVE] + 
B x INVOLVE2 + B x In[INVOLVE]) with a less constrained 
model (Y = a + 8 x TYPE + B x INVOLVEI + B x INVOLVE2 + 
B x InINVOLVEI) + B x InINVOLVE2], where TYPE was 
dummy-coded 0/1, INVOLVE was the involvement score, 
INVOLVE] was the involvement scores for the attractive group 
(zero otherwise), and INVOLVE2 was the involvement scores for 
the expert group (zero otherwise). Note that there was no type of 
avatar X involvement interaction with a linear influence of involve- 
ment across the attractive and expert conditions (F(1, 318) = .32, 
p > .05). 


beta coefficients showed a nonsignificant influence of the 
linear involvement term (t(316) = —1.13, p > .05) and a sig- 
nificant influence of the quadratic involvement term 
(t(316) = —.85, p < .05) in the attractive avatar condition. 
There was a significant influence of the linear involvement 
term (t(316) = 3.62, p < .05) and a nonsignificant influence 
of the quadratic involvement term (t(316) = —1.58, p > .05) 
in the expert avatar condition. 

To help illustrate this interaction, the data are shown 
using a tertiary split on the involvement independent 
variable (see Table 3). The data show that moderately 
involved participants were more satisfied with the retailer 
when they shopped using the attractive avatar, but highly 
involved participants were more satisfied when they 
shopped using the expert avatar. Although the regression 
analysis could not test these predictions at various levels of 
involvement, the tertiary grouping of participants by 
involvement level allowed for an ANOVA of simple effect 
tests, albeit with less power to detect a difference between 
the avatar conditions (MacCallum et al. 2002). The attrac- 
tive avatar (M = 3.10) and expert avatar (M = 3.11) resulted 
in equivalent levels of satisfaction at low levels of involve- 
ment (F(1, 316) = .01, p > .05). The attractive avatar (M = 
4.35) led to more satisfaction than the expert avatar (M — 
3.92) for moderately involved participants (F(1, 316) — 
3.45, p < .05).6 The expert avatar (M = 4.85) did not lead to 
more satisfaction than the attractive avatar (M — 4.55) for 
highly involved participants (F(1, 316) = 1.86, p > .05), 
though the means were consistent with predictions. In sum- 
mary, the regression analysis provided support for Hy, and 
the ANOVA provided support for Hy, but only directional 
support for Ha. 

Hy: Attitude toward the product. Consistent with Ha, 
there was a type of avatar x involvement interaction for atti- 
tude toward the product with a quadratic influence of 





6Given the directional hypotheses in the moderate- and high- 
involvement conditions, we used one-tailed tests. 


involvement across the attractive and expert conditions 
(F(1, 316) = 5.35, p < .05).7 The beta coefficients showed a 
nonsignificant influence of the linear involvement term 
(t(316) = —58, p > .05) and a significant influence of the 
quadratic involvement term (t(316) = 2.83, p < .05) in the 
attractive avatar condition. There was a significant influence 
of the linear involvement term (t(316) = 3.62, p « .05) and a 
nonsignificant influence of the quadratic involvement term 
(t(316) = —50, p > .05) in the expert avatar condition. 

Using the tertiary split on the involvement independent 
variable (see Table 3), the attractive avatar (M = 3.37) and 
the expert avatar (M = 3.43) resulted in an equivalent atti- 
tude toward the product at low levels of involvement 
(F(1, 316) = .10, p > .05). The attractive avatar (M = 4.68) 
led to a more positive attitude toward the product than the 
expert avatar (M = 4.22) for moderately involved partici- 
pants (F(1, 316) 2 3.78, p « .05). The expert avatar (M = 
5.64) led to a marginally more positive attitude than the 
attractive avatar (M — 5.28) for highly involved participants 
(F(1, 316) = 2.41, p = .06). In summary, the regression 
analysis provided support for H4, and the ANOVA provided 
support for H4a and marginal support for Hyp. 

H4: Purchase intention. Consistent with Hy, there was a 
type of avatar x involvement interaction for purchase inten- 
tion with a unique quadratic influence of involvement in the 
attractive and expert conditions (F(1, 316) = 5.94, p < .05).° 
The beta coefficients showed a nonsignificant influence of 





7TNote that there was no type of avatar x involvement interaction 
for attitude toward the product with a linear influence of involve- 
ment across the attractive and expert conditions (F(1, 318) = .19, 
p > .05). 

8Note that there was no type of avatar x involvement interaction 
for purchase intention with a linear influence of involvement 
across the attractive and expert conditions (F(1, 318) = 95, p> 


.05). 


TABLE 3 
Study 1: Means by Level of Involvement 


isch thd ——— 


Dependent Varlable and Condition n 
Satisfaction with Retaller 
Low involvement 143 
Moderate involvement 123 
High Involvement 134 
All : 400 
Attitude Toward Product l 
Low involvement 143 
Moderate involvement 123 
High involvement 134 
All 400 
Purchase Intention 
Low involvement 143 
Moderate involvement 123 
High involvement 134 
All 400 


Avatar 

None Attractive Expert 
2.37 3.10 3.11 
3.31 4.35 3.92 
4.16 4.55 4.85 
3.41 3.93 3.89 
2.51 3.37 3.43 
3.95 4.68 4.22 
4.68 5.28 5.84 
3.86 4.38 4.34 
1.44 2.52 2.59 
2.91 4.15 3.48 
4.07 4.61 5.02 
3.00 3.86 3.60 


I ——————————— 
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the linear involvement term (t(316) = —.09, p > .05) and a 
significant influence of the quadratic involvement term 
(t(316) = 2.01, p < .05) in the attractive avatar condition. 
There was a significant influence of the linear involvement 
term (t(316) = 3.76, p « .05) and a nonsignificant influence 
of the quadratic involvement term (t(316) = —1.41, p > .05) 
in the expert avatar condition. 

Using the tertiary split on the involvement independent 
variable (see Table 3), the attractive avatar (M — 2.51) and 
the expert avatar (M = 2.59) resulted in an equivalent pur- 
chase intention at low levels of involvement (F(1, 316) — 
.07, p > .05). The attractive avatar (M = 5.15) led to more 
positive purchase intentions than the expert avatar (M = 
3.41) for moderately involved participants (F(1, 316) — 
5.46, p « .05). The expert avatar (M — 5.02) led to a mar- 
ginally more positive purchase intention than the attractive 
avatar (M = 4.61) for highly involved participants 
(F(1, 316) = 2.21, p = .07). Again, the regression analysis 
provided support for H4, and the ANOVA provided support 
for H44 and marginal support for Hay. 

H5,. Hs, predicted that avatar likeability would mediate 
the persuasiveness of the attractive avatar relative to the 
expert avatar. As the test of H44 shows, the attractive avatar 
was more persuasive than the expert avatar at moderate lev- 
els of involvement. Thus, we used the 97 participants who 
viewed an avatar and were moderately involved in the prod- 
uct category in the mediation analysis. As we summarize in 
Table 4, the presence of an attractive, as opposed to an 
expert, avatar significantly influenced satisfaction with the 
retailer (t(96) = 1.85, p = .07), attitude toward the product 
(t(96) = 2.12, p « .05), and purchase intention (t(96) = 2.16, 
p « .05). The attractive avatar was more liked (t(96) — 1.83, 


p = .07) but was not more credible (t(96) = 1.34, p > .05) 
than the expert avatar. Finally, the inclusion of likeability 
and type of avatar in a regression made the type of avatar a 
nonsignificant predictor of satisfaction with the retailer 
(t(95) = 1.11, p > .05), attitude toward the product (t(95) = 
1.54, p > .05), and purchase intention (t(96) = 1.62, p > 
.05). Sobel tests showed that avatar likeability was only a 
partial mediator of the type of avatar effect. Table 5 also 
reports nonsignificant tests of avatar credibility as a media- 
tor, as we expected. 

Hsp. Hsp predicted that avatar credibility would mediate 
the persuasiveness of the expert avatar relative to the attrac- 
tive avatar. As the test of Hy, shows, the expert avatar was 
marginally more persuasive than the attractive avatar at high 
levels of involvement. Thus, we used the 101 participants 
who viewed an avatar and were highly involved in the prod- 
uct category in the mediation analysis. As we summarize in 
Table 3, the presence of an expert, as opposed to an attrac- 
tive, avatar did not influence satisfaction with the retailer 
(t(100) = 1.56, p = .12), influenced attitude toward the prod- 
uct (t(100) = 1.96, p « .05), and marginally influenced pur- 
chase intention (t(100) = 1.69, p « .05). The expert avatar 
was more credible (t(100) = 3.96, p « .05) but was not more 
liked (t(100) = .79, p > .05) than the attractive avatar. 
Finally, the inclusion of credibility and type of avatar in a 
regression made the type of avatar a nonsignificant predic- 
tor of satisfaction with the retailer (t(99) = .30, p > .05), 
attitude toward the product (t(99) = .67, p > .05), and pur- 
chase intention (t(99) = .97, p > .05). Sobel tests showed 
that avatar credibility was only a partial mediator of the 
type of avatar effect. Table 3 also reports nonsignificant 
tests of avatar likeability as a mediator, as we expected. 


TABLE 4 
Study 1: Medlation Analysis for Effects of Avatar Type 





Moderate Involvement 


High Involvement 


Test Varlables 

AC Type of avatar — satisfaction with retailer 
Type of avatar — attitude toward product 
Type of avatar — purchase Intention 

AB Type of avatar  llkeability 


Type of avatar — credibility 


Mediation Tests for Likeabllity 


ABC 


Ltkeability, 

Type of avatar — satisfaction with retaller 
Likeability, 

Type of avatar > attitude toward product 
Ltkeability, 

Type of avatar > purchase Intention 


Mediation Tests for Credibility 


ABC 


Credibility, 

Type of avatar — satisfaction with retailer 
Credibility, 

Type of avatar — attitude toward product 
Credibility, 

Type of avatar — purchase intentlon 


Notes: n.s. = not significant. 
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Predicted t p Predicted t p 
Significant 1.85 .07 Significant 1.56 .12 
Significant 2.12 .04 Significant 1.96 .05 
Significant 2.16 03 Slgnificant 1.69 .09 
Significant 1.83 .07 n.s. .79 43 
n.s. 1.34 17 Significant 3.96 .00 
Significant 5.09 .00 None 4.39 .00 
n.s. 1.11 .27 Significant 1.35 .18 
Significant 3.60 .01 None 2.90 .01 
n.s. 1.54 .13 Significant 1.79 .08 
Significant 3.27 .00 None 1.72 .09 
n.s. 1.62 11 Significant 1.59 17 
None 5.33 .00 Significant 3.34 .00 
Significant 2.83 .01 n.s. .30 .76 
None 3.02 .00 Significant 3.37 .00 
Significant 2.61 .01 n.s. .67 54 
None 2.97 .00 Significant 169 .10 
Significant 2.64 .01 n.s. .97 .33 


Discussion 


Study 1 provides three compelling results. First, the simple 
inclusion of an avatar on the screens of a Web-based shop- 
ping site increased consumer satisfaction with the retailer, 
made the customer’s attitude toward the product more 
favorable, and increased the customer’s purchase intention 
(Hj). Second, the perceived entertainment value and infor- 
mativeness of the Web site mediated the influence of the 
avatars on the key dependent measures (H, and H3). Third, 
the responsiveness to a particular type of avatar depended 
on the type of avatar and the consumer's involvement with 
the product category (H4). Attractive avatars were more per- 
suasive when consumers were moderately involved in the 
purchase (H,,), whereas expert avatars were marginally 
more persuasive when consumers were highly involved in 
the purchase (Hay). These results are consistent with many 
empirical results showing that differential characteristics of 
persuasion agents are effective at different levels of cus- 
tomer involvement with the purchase. 

There are two unresolved issues in the results of Study 
1. First, our effort to personify the avatars using an exter- 
nally valid procedure allows for an alternative explanation 
of the avatar effects (e.g., H,-H3) observed in Study 1. It 
may be that the consultation in the no-avatar condition was 
of poorer quality than the consultations in the avatar condi- 
tions. Recall that the consultation in the no-avatar condition 
was a combination of fashion and comfort statements, 
whereas the consultation in the attractive (expert) avatar 
condition was fashion oriented (comfort oriented). To 
address this problem, we replicated Study 1 using an avatar 
type (no avatar, attractive avatar, or expert avatar) x consul- 
tation type (fashion, comfort, or fashion and comfort) 
design. We randomly assigned 659 participants to one of the 
nine conditions. The results of this follow-up study repli- 
cated all the avatar-presence results (i.e., H; and H5) and the 
mediation analysis results (H3) of Study 1.9 More impor- 
tant, there was no interaction of the consultation type with 
the avatar type on any of the dependent measures. Thus, it 
appears that the avatar and its credentials, not the content of 
the consultation, are responsible for the effects we observed 
in Study 1. 

The second issue pertains to the source of the involve- 
ment effects (H4 and Hs) observed in Study 1. The data 
show that attractive avatars are more persuasive than expert 
avatars at moderate levels of involvement and that expert 
avatars are more persuasive than attractive avatars at high 
levels of involvement (Hy, and H4). Yet likeability only 
partially mediates the type of avatar effect at moderate lev- 
els of involvement, and credibility only partially mediates 
the type of avatar effect at high levels of involvement (Hs). 
We expect that there are two reasons for partial mediation. 
First, the effects of attractiveness and expertise at their 
respective levels of involvement are modest (i.e., p values 
range from .03 to .12). If a relationship between an inde- 
pendent and a dependent variable is modest, the ability of a 
mediator to explain this relationship is limited. A solution to 
this problem may be to increase the statistical power of the 


9The results are available from the second author. 


test. Power ranged from .32 to .57 in the tests of Hy, and 
Hay, well below the recommended levels of .80. Second, the 
manipulations of attractiveness and expertise were not inde- 
pendent in Study 1. Although this did not create a problem 
for testing H,—H3, in which we treated avatars as replicates, 
it may have caused a problem for testing Hs. For example, 
at moderate levels of involvement, participants perceived an 
attractive avatar as more attractive than an expert avatar but 
also as less credible than an expert avatar. This means.that 
the likeability score for the attractive avatar could be influ- 
enced in opposite directions by attractiveness and expertise 
(.e., the likeability score could be biased downward). Con- 
sequently, the mediation tests involving likeability could 
have been compromised. A similar phenomenon could have 
biased the credibility ratings of the expert avatar, compro- 
mising the mediation tests involving credibility. Study 2 
investigates H4 and Hs using an experimental design that 
controls for these potential biases. 


Study 2 


Study 2 investigated the influence of avatar attractiveness 
and avatar expertise on persuasion at different levels of 
intrinsic involvement with a product. The experiment used a 
level-of-attractiveness (low and high) x level-of-expertise 
(low and high) factorial design along with a measure of the 
consumer's intrinsic involvement with the product. We 
expected that increases in avatar attractiveness would be 
most effective for moderately involved consumers and that 
the degree of persuasion would be mediated by the likeabil- 
ity of the avatar. Similarly, we expected that increases in 
avatar expertise would be most effective for highly involved 
consumers and that the degree of persuasion would be 
mediated by the credibility of the avatar. 


Stimuli and Procedure 


The stimuli and procedure were similar to Study 1. The pri- 
mary changes involved orthogonal manipulations of avatar 
attractiveness and avatar expertise. For example, one possi- 
bility was to attempt to draw avatars that would be per- 
ceived as low on both factors, high on both factors, or a mix 
of levels across the factors. Pretests revealed that it was dif- 
ficult to draw avatars that unambiguously fit these four pro- 
files. Thus, we decided to manipulate avatar attractiveness 
using the avatar's visual features and avatar expertise using 
the avatar's credentials and style of consultation. First, we 
redesigned the female avatars to create a stronger manipula- 
tion of attractiveness.10 Second, we rewrote the consultation 
to be consistent with a low-expertise or high-expertise con- 
sultant (see Appendix B). In summary, the experiment was 
an attractiveness (low and high) x expertise (low and high) 
between-subjects design. In all other respects, the procedure 
was identical to Study 1. 





10We used only female avatars in this study 
lack of a gender effect in Study 1 and (2) the expe 
with redesigning the avatars. 
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Results 


We recruited participants from a population of German 
online shoppers. The final sample size was 596 consumers, 
48.2% of whom were men and 51.8% of whom were 
women. The participants were between 14 and 80 years of 
age, and the median age was 28 years.!! Seventy-four per- 
cent of the participants were college graduates, and the par- 
ticipants’ experience with online shopping was above aver- 
age compared with the average Internet user. 


Manipulation check. We performed two types of 
manipulation checks. First, we confirmed that the manipu- 
lations were effective. Participants perceived the attractive 
avatar (M = 4.07) as more attractive than the unattractive 
avatar (M = 2.06; F(1, 592) = 465.05, p « .01), and they per- 
ceived the expert avatar (M = 4.65) as more expert than the 
nonexpert avatar (M = 3.43; F(1, 592) = 118.23, p < .01). 
Second, we confirmed that involvement was orthogonal to 
the experimental manipulations; thus, both factors could be 
used in the same regression analysis. The avatar attractive- 
ness manipulation influenced the degree of involvement 
with the product (Mattmachve = 3.54, Monattractive = 3-29; FC, 
592) = 4.57, p < .05), but this effect was small compared 
with the variability in involvement (i.e., less than 2%). The 
avatar expertise manipulation did not influence the degree 
of involvement with the product (Mexpen = 3.41, 
Maonexpert = 3-42; F(1, 592) = .04, p > .05). 


Review of hypotheses. We offered two general hypothe- 
ses about the effectiveness of different types of avatars. 
First, we expected that the more attractive avatar would 
appeal to participants who were moderately interested in 
custom-fit shoes (Hy,). The reason for this preference is that 
participants would perceive the avatar as more likeable 
(H;,). Second, we expected that the more expert avatar 
would appeal to participants who were highly interested in 


11The ten shoppers, ages 14~16, had Internet shopping experi- 
ence, so we retained them in the sample. 


custom-fit shoes (H4,). The reason for this preference is that 
participants would perceive the avatar as more credible 


(Hay). 


Initial analyses. Similar to Study 1, the initial analysis 
used regressions to test H4. We anticipated that the impact 
of the attractiveness manipulation would be greater for 
moderately involved consumers than for mildly or strongly 
involved consumers. Although the means were consistent 
with this prediction (see Table 5), tests for a quadratic influ- 
ence of involvement across the two attractiveness condi- 
tions were not significant for satisfaction with the retailer 
(t(591) = .76, p > .05), attitude toward the product (t(591) = 
1.02, p > .05), or purchase intention (t(591) = .56, p > 
.05).12 In general, the attractiveness manipulation was so 
strong that it had an influence at all levels of involvement. 
We anticipated that the impact of the expertise manipulation 
would become greater as involvement increased. Tests for 
an expertise x involvement interaction were significant for 
satisfaction with the retailer (t(591) = 5.29, p < .01), attitude 
toward the product (t(591) = 3.70, p « .01), and purchase 
intention (t(591) = 3.87, p < .01). 


H4a: Moderate involvement. H4, predicted that at mod- 
erate levels of involvement, there would be a main effect for 
the attractiveness manipulation but not for the expertise 
manipulation. The data are shown using a tertiary split on 
the involvement independent variable (see Table 5). People 
who viewed an attractive avatar were more satisfied with 
the retailer (Magus = 3.75 [e.g., Ms = 3.54 and 3.96], 
Monattractive = 3-13; F(1, 590) = 18.58, p < .01), had a more 
positive attitude toward the product (Mattractve = 4.53, 
My nattractvve = 3-90; F(1, 590) = 18.57, p < .01), and had a 
more positive purchase intention (Mattractve = 3-58, 


12Note that the attractiveness manipulation shows a log relation- 
ship to involvement in the attractive-nonexpert condition. This 
condition is the conceptual equivalent of the attractive avatar con- 
dition in Study 1. 





TABLE 5 
Study 2: Means by Level of Involvement 
Dependent Variable Unattractlve Unattractive Attractive Attractive 
and Condition n Nonexpert Expert Nonexpert Expert 
Satisfactlon with Retaller 
Low involvement 201 2.60 2.60 3.03 2.94 
Moderate involvement 201 3.21 3.03 3.54 3.96 
High involvement 194 3.37 4.23 3.82 4.82 
All 596 3.03 3.23 3.51 3.93 
Attitude Toward Product 
Low involvement 201 2.76 2.87 3.01 3.41 
Moderate Involvement 201 3.75 4.08 4.55 4.51 
High Involvement 194 4.72 5.16 4.86 5.42 
All 596 3.67 3.95 4.27 4.47 
Purchase Intention 
Low involvement 201 1.78 1.89 1.67 2.12 
Moderate Involvement 201 3.07 2.82 3.79 3.37 
High involvement 194 4.00 4.22 4.09 4.95 
All 596 2.87 2.89 3.36 3.53 
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Munattractive = 2.95; F(1, 590) = 12.74, p < .01). People who, 


viewed an expert avatar were not more satisfied with the 
retailer (Me. = 3.52, Mponexpert = 3.36; F(1, 590) = 1.07, 
p > .05), did not have a more positive attitude toward the 
product (Mexpert = 4.30, Moonexpert = 4.13; F(1, 590) = 1.35, 
p > .05), and did not have a more positive purchase inten- 
tion (Mexpert = 3.11, Mhonexpert = 3.41; F(1, 590) = 2.79, p > 
.05). The interaction between avatar attractiveness and 
expertise was not significant for retailer satisfaction 
(F(1, 588) = 3.74, p > .05), attitude toward the product 
(F(1, 588) = 1.62, p > .05), and purchase intention 
(F(1, 588) = .27, p > .05). 

H5a' Moderate involvement. Hs, predicted that avatar 
likeability would mediate the persuasiveness of the attrac- 
tive avatar relative to the unattractive avatar. We used the 
201 participants who were moderately involved in the prod- 
uct category in the mediation analysis. As we summarize in 
Table 6, the presence of an attractive, as opposed to an unat- 
tractive, avatar significantly influenced satisfaction with the 
retailer (t(200) = 4.31, p < .01), attitude toward the product 
(t(200) = 4.31, p < .01), and purchase intention (t(200) = 
3.57, p < .01). The second test showed that the attractive 
avatar was more liked (M = 4.73) than the unattractive 
avatar (M = 3.59; t(200) = 6.46, p < .01). Finally, the inclu- 
sion of likeability and avatar attractiveness in a regression 
equation made the type of avatar a nonsignificant predictor 
of satisfaction with the retailer (t(199) = 1.76, p > .05) and 
attitude toward the product (t(199) = 1.82, p > .05), whereas 
purchase intention remained significant (t(199) = 2.38, p « 
.05). Sobel tests showed that avatar likeability was a signifi- 
cant mediator of avatar attractiveness for satisfaction with 


the retailer (t(199) = 4.32, p < .01), attitude toward the 
product (t(199) = 4.32, p < .01), and purchase intention 
(t(199) = 2.05, p « .05). 

H45: High involvement. H4, predicted that at high levels 
of involvement, there would be a main effect for the exper- 
tise manipulation but not for the attractiveness manipula- 
tion. The data are shown using a tertiary split on the 
involvement independent variable (see Table 5). People who 
viewed an expert avatar were more satisfied with the retailer 

— 4.56, M, = 3.60; F(1, 590) = 18.69, p « 
.05), had a more positive attitude toward the product 
(Mexpert = 5.30, Mnonexpert = 4:79; F(1, 590) = 11.76, p< 
.05), and had a more positive purchase intention (Mexpert = 
4.63, Mnonexpert = 4-04; F(1, 590) = 7.08, p < .05). People 
who viewed an attractive avatar were more satisfied with 
the retailer (Maurice = 4-37, Munattractive = 3-80; F(1, 590) = 
12.04, p « .01), did not have a more positive attitude toward 
the product (Mattractive = 5.17, Munattractve = 4-94; F(1, 
590) = 2.22, p > .05), and had a more positive purchase 
intention (Matractive = 4-57, Munattractive = 411; F(1, 590) = 
4.29, p < .01). The interaction between avatar attractiveness 
and expertise was not significant for retailer satisfaction 
(F(1, 588) = .50, p > .05), attitude toward the product 
(F(1, 588) = .18, p > .05), and purchase intention 
(F(1, 588) = 3.65, p > .05). 

Hs High involvement. Hs, predicted that avatar credi- 
bility would mediate the persuasiveness of the expert avatar 
relative to the nonexpert avatar. We used the 194 partici- 
pants who were highly involved in the product category in 
the mediation analysis. As we summarize in Table 6, the 
presence of an expert, as opposed to a nonexpert, avatar sig- 


TABLE 6 
Study 2: Medlation Analyses 


Test Variables 
Attractiveness Analysis 
AC Avatar attractiveness — satisfaction with retailer 


Avatar attractiveness — attitude toward product 


Avatar attractiveness — purchase intentlon 
Avatar attractiveness — Ilkeability 
>C Likeability, 


Avatar attractiveness — satisfaction with retailer 


Likeability, 


Avatar attractiveness — attitude toward product 


Likeability, 
Avatar attractiveness — purchase intention 


Expertise Analysis 
AC Avatar expertise — satisfaction with retailer 


Avatar expertise — attitude toward product 
Avatar expertise — purchase intention 
Avatar expertise — credibility 

Credibility, 

Avatar expertise — satisfaction with retailer 
Credibility, 

Avatar expertise — attitude toward product 
Credibility, 

Avatar expertise — purchase !ntention 


Notes: n.s. = not significant. 


AB 
ABC 


Moderate Involvement High Involvement 


Predicted t 


Predicted t p p 
Significant 4.31 .00 n.s. 3.47 .00 
Significant 4.31 .00 n.s. 1.49 14 
Significant 3.57 .01 n.8. 2.06 .04 
Significant 6.46 .00 None 5.32 .00 
Significant 5.99 .00 None 9.01 53 
n.s. 1.76 .08 None .63 .00 
Significant 5.81 .00 None 5.21 .00 
n.s. 1.82 .07 None .39 .70 
Significant 2.16 .03 None 4.66 .00 
n.s. 2.38 .02 None 35 14 
n.s. 1.03 .31 Significant 6.22 .00 
n.s. 1.16 27, Significant 3.43 .00 
n.s. 1.67 .10 Significant 2.66 .00 
None 3.33 .00 Significant 6.89 .00 
None 6.81 .00 Significant ^ 7.27 .00 
None .45 .65 n.s. 3:04 .00 
None 6.06 .00 Significant ^ 5.58 OO 
None A .87 n.s. .83 .41 
None 2.72 .01 Significant — 4.06 .00 
None 2.27 .02 n.s. .68 .50 
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nificantly influenced satisfaction with the retailer (t(193) = 
6.22, p < .01), attitude toward the product (t(193) = 3.43, 
p « .01), and purchase intention (t(193) = 2.66, p < .01). 
The second test showed that the expert avatar was more 
credible (M = 4.65) than the nonexpert avatar (M = 3.37; 
t(193) = 6.89, p « .01). Finally, the inclusion of expertise 
and avatar credibility in a regression equation made the type 
of avatar a nonsignificant predictor of attitude toward the 
product (t(192) = .83, p > .05) and purchase intention 
(t(192) = .68, p « .05), whereas satisfaction with the retailer 
remained significant (t(193) = 3.04, p > .05). Sobel tests 
showed that avatar credibility was a significant mediator of 
avatar expertise for satisfaction with the retailer (t(193) — 
5.01, p < .01), attitude toward the product (t(193) = 4.34, 
p < .01), and purchase intention (t(193) = 3.51, p < .01). 


Discussion 


In general, the results of Study 2 are consistent with H4 and 
Hs. At moderate levels of involvement with the product pur- 
chase, participants were sensitive to the attractiveness of the 
avatar but not to the expertise of the avatar. The likeability 
of the avatar mediated the influence of avatar attractiveness 
on satisfaction with the retailer, attitude toward the product, 
and purchase intention. At high levels of involvement with 
the product purchase, participants were sensitive to the 
expertise of the avatar. The perceived credibility of the 
avatar mediated the influence of avatar expertise on satis- 
faction with the retailer, attitude toward the product, and 
purchase intention. Unexpectedly, there was a consequence 
of avatar attractiveness at high levels of involvement. This 
effect may be an influence of the strong manipulation of 
avatar attractiveness. 


General Discussion 


The two studies suggest that avatars can enhance the effec- 
tiveness of a Web-based sales channel. Study 1 showed that 
adding an avatar to Web-based information increased the 
customer's satisfaction with the retailer, attitude toward the 
product, and purchase intention. A follow-up study showed 
that the response to the avatar was robust across different 
types of information content. The avatar itself, not the infor- 
mation provided by the avatar, was the key to persuasion. 
Study 2 showed that active attempts to manipulate the 
attractiveness and expertise of the avatar made the avatar 
more persuasive for certain segments of shoppers. In gen- 
eral, making the avatar more attractive was effective across 
all levels of involvement. Making the avatar more expert 
was effective only at high levels of involvement. Study 2 
also provided evidence of mediation. The attractiveness of 
the avatar influenced perceptions of likeability, and likeabil- 
ity mediated the influence of the avatar’s attractiveness on 
persuasion. Similarly, the expertise of the avatar influenced 
perceptions of credibility, and credibility mediated the 
influence of the avatar’s expertise on persuasion. 

The results raise questions about persuasion in a Web- 
based environment. People are often aware of attempts at 
persuasion and have developed several strategies for resist- 
ing persuasion (Jacks and Cameron 2003; Knowles and 
Linn 2004). First, when people identify an attempt at per- 
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suasion, they can discount the message (Gruder et al. 1978) 
or derogate the source (Tannenbaum, Macauley, and Norris 
1966; Zuwerink and Devine 1996). Second, people can use 
selective attention to attitude-congruent information or 
selective avoidance of attitude-incongruent information 
(Albarracín and Mitchell 2004). Third, people can have 
negative affective responses (e.g., irritation, anger) when 
exposed to a persuasive message, which leads to the rejec- 
tion of the message (Zuwerink and Devine 1996). Fourth, 
people can actively look for a fault in the persuasive mes- 
sage and generate counterarguments (Brock 1967; Killeya 
and Johnson 1998). Finally, people can bolster their own 
attitude by selectively generating or recalling information in 
support of their own viewpoint (Lewan and Stotland 1961; 
Lydon, Zanna, and Ross 1988). These latter two strategies 
are likely to produce enduring resistance. However, they 
require cognitive resources and motivation. 

Our two studies show that avatars can be effective 
instruments of persuasion, but further research needs to be 
conducted to answer the following questions: (1) To what 
extent do avatars have the potential to evade the common 
strategies consumers use to resist persuasive messages? and 
(2) What factors support consumer acceptance of avatars as 
helpful sales agents? In particular, two classes of factors 
should be taken into consideration. First, there are many 
options for the design of virtual characters in Web-based 
environment in terms of function, graphic design, and tech- 
nical organization. It is likely that the persuasive power of 
avatars varies significantly, depending on whether avatars 
deliver a testimonial or act as representatives of the com- 
pany, whether avatars are designed as more or less lifelike, 
and whether avatars provide one-way or reciprocal conver- 
sation. Second, recipient characteristics influence persua- 
sion. That is, different people have different motivations 
and abilities to process information and to resist persuasion. 

Finally, five research issues need discussion. First, in 
our procedure, we did not collect sales response data. Ide- 
ally, an investigation into avatar effectiveness would create 
a Web site with an avatar and a Web site without an avatar 
and show that sales are higher on an avatar-accompanied 
Web site. Second, we generated the results using a single 
product category in which a consultation was beneficial. We 
expect that the influence of an avatar will decline as the pur- 
chase process becomes easier, the product becomes simpler, 
and the buyer's knowledge of the product increases. Third, 
it could be argued that the avatar effects were novelty 
effects because few respondents had experienced avatar- 
accompanied Web sites. This may be true, but it is also the 
case that avatars varied in their effectiveness as a result of 
different levels of involvement. This interaction is inconsis- 
tent with a simple novelty effect. Fourth, we expect that 
avatars are effective because they enhance perceptions of 
reciprocal communication. Although our avatars engaged 
shoppers by taking turns conversationally with them, the 
exchange fell short of a truly reciprocal communication in 
which participants respond to the idiosyncratic conversation 
of their partner. Fifth, it could be argued that the results 
show no incremental conceptual insight relative to the exist- 
ing literature on sales force effectiveness. However, if this 
premise is accepted, it must also be acknowledged that 


avatars may have many of the same benefits as human sales 
agents. This was the goal of the article. 


Appendix A 
Questionnaire Items 


Product Involvement (o. = .90) 


*"For me, customized leisure shoes are (important) (fun) 
(exciting) (relevant)." 


Avatar Attractiveness (o = .90) 


“In my opinion, the virtual consultant is (attractive) (beauti- 
ful) (good looking)." 


Avatar Expertise (o. = .93) 


“In my opinion, the virtual consultant is (trained) (experi- 
enced) (knowledgeable)." 


Entertainment Value of the Web Site (o = .93) 


In your opinion, to what extent do the following statements 
about the content and design of the Web-site apply to you? 


e“T find the Web-site entertaining.” 
«^T like the Web-site.” 
«The Web-site is fun to use.” 


information Value of the Web Site (œ = .85) 


In your opinion, to what extent do the following statements 
about the content and design of the Web-site apply to you? 


“The information offered is useful." 
*“The information offered ıs understandable.” 
«“The information offered is sufficient.” 


Avatar Likeability (c. = .89) 


*"[n my opinion, the virtual consultant is (likeable) (friendly) 
(agreeable).” 


Avatar Credibllity (a = .94) 


eIn my opinion, the virtual consultant is (sincere) (compe- 
tent) (credible).” 


Satisfaction with the Retailer (œ = .94) 
“Considering everything that I have learned so far about the 
company and its products, my opinion is that" 

“The company fulfills my needs.” 

°“It would be advantageous for me to buy products from this 

company." 
e“T am delighted about the company and its products.” 
*"[ am satisfied with the company." 


Attitude Toward the Product (œ = .94) 


“I find the company's customized leisure shoes (useful) 
(agreeable) (favorable) (good)." 


Intention to Buy (o. = .95) 


eT can imagine buying a pair of customized leisure shoes 
from this company.” 

*"The next time I buy a pair of leisure shoes, I will take this 
company into consideration and have them make me an 
offer." 

*"[ am very interested in buying a pair of customized leisure 
shoes from this company." 


Appendix B13 


Low-Expertise Consuftation 

Introduction: Hi, my name is Kim. / I'm a new shoe sales- 
person and I am not yet very familiar with this merchandise. 
I can, however, tell you how to learn more about our awe- 
some shoes. / Our shoes can be made exactly as you want 
them. The shoes will have just the right style and they will 
be comfortable. Your feet will feel wonderful in them. Find 
out more about the advantages of custom-made shoes. 

Page 1: A person's appearance makes a strong first 
impression. Therefore, the appearance of a shoe is often the 
most important factor in a shoe purchase. Unfortunately, 
stylish shoes are often uncomfortable. There's nothing 
worse than sore feet at the end of a long day. We use a cer- 
tain technique for custom-fit production so you can get 
shoes that feel great and look the way you want them to. / 
Are you looking for a women's or men's shoe? 

Page 2: You can design your shoe by using the mouse to 
click on the right. Here is a quick tip. If there is something 
that you are unsure about, click on the “i” and an explana- 
tion will be provided. / To begin with, there is a shoe in 
white material that is already pretty well equipped. J recom- 
mend the Creative Design option for you, even though it 
can be a bit complicated if one is not a professional in using 
this design procedure. There are, after all, over 50 colors 
from which you can choose. Still, it's fun and the result ts 
sure to be a one-of-a-kind shoe. / You can use the Creative 
Design after you have selected all of the other options for 
your shoe. 

Page 3: The inside of the shoe can be made either out of 
a basic fabric or out of a waterproof material. / The soles are 
important for the comfort of the shoe. Therefore, you can 
now choose some features to make the soles of your shoes 
even better. This is important for people that spend a lot of 
time on their feet. / After you made your selections, I can 
calculate a price for you. 

Page 4: Thanks to our online store, I can offer your 
desired shoes at a very inexpensive price. / You have now 
completed the shoe consultation. We would like to ask you 
some questions about this experience. 


High-Expertise Consultation 


Introduction: Hello, my name is Dr. Anne Schneider. / I 
have a doctorate in sports science. I have been an orthope- 
dist and podiatrist for over 15 years. I would like to offer 
you comprehensive advice about our innovative leisure 
shoes. / These shoes can be made according to your needs. 


13A "/" represents a page break. 


Online Consumer Shopping Behavior / 33 


Our shoes provide optimal support to your feet and prevent 
back strain. Find out more about the advantages of custom- 
made shoes. 

Page 1: The fit of a shoe is a very important factor when 
buying shoes. Every day, your feet are exposed to enormous 
strain, which all-too-often causes discomfort. Orthopedi- 
cally defective shoes can cause idiopathic scoliosis as well 
as degenerative symptoms like arthritis. / Our shoes are 
custom-made using innovative technologies. They have 
been designed to provide optimal support to your foot struc- 
ture and thus prevent discomfort. The design is also based 
on your preferences. 

Page 2: You can design your shoe by using the mouse to 
click on the right. All options are explained by clicking on 
the info button “i”. / As a base model, our experts have 
developed a textile shoe in white with a high-quality config- 
uration. I recommend the 3-D Fit option for you. This 


option guarantees that the shoe will fit optimally and offer 
maximum wearing comfort. These are essential prerequi- 
sites for healthy and relaxed walking. 

Page 3: The material inside of the shoe can be made of 
a basic fabric or a special, water-resistant, high-tech mem- 
brane. / Because the quality of the soles determines the 
comfort and walking characteristics of the shoes, you can 
choose to upgrade to our high-quality standard sole. Silicon 
absorption, for example, reduces the incidence of back pain 
up to 20%. / After you made your selections, I will gladly 
calculate an exact price for your shoe. 

Page 4: Thanks to our online customization and a revo- 
lutionary production process, I can offer you customized 
shoes at a very attractive price. / You have now completed 
the shoe consultation. We would like to ask you some ques- 
tions about this experience. 


—————— ———————————Usás—————————— —— -——o 
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Creating a Market Orientation: A 
Longitudinal, Multifirm, Grounded 
Analysis of Cultural Transformation 


Market orientation is a foundation of marketing and is Increasingly important in other fields, such as strategic 
management. Research in marketing has identified the characteristics of market-oriented organizations. However, 
how organizations change to become more market oriented has received less attention. In this article, the authors 
conduct an in-depth, longitudinal, multifirm investigation of firms that have successfully created a market 
orientation. Grounded by this in-depth understanding, they develop a theoretical model to explain how firms create 
a market orientation. The model identifies four path-dependent stages of change. In contrast to current 
conceptualizations, the authors find that creating a market orientation requires dramatic changes to an 
organization's culture and the creation of organizationally shared market understandings. The findings offer new 
insights into how organizations develop a greater market orientation, organizational change, and the nature of 
market orientation, including the role of intraorganizational power and organizational learning in creating and 
sustaining a market orientation. 


T: concept of a market orientation has become Despite the great and growing interest in this marketing 
increasingly important to the study and practice of concept, research on how organizations become more mar- 
management. It is the central concept of marketing ket oriented is surprisingly limited. Much research on mar- 
(e.g., Kotler 2000), it has become increasingly relevant to ket orientation has focused on developing measures of a 
scholars in other fields such as management (e.g., Besanko, firm’s orientation and identifying antecedents and conse- 


Dranove, and Shanley 2000), and it is recognized as signifi- quences of a greater market orientation (e.g., Homburg and 
cant for organizational success (e.g., Collins and Porras Pflesser 2000; Kirca, Jayachandran, and Bearden 2005; 
1994). For example, Intei's chief executive officer (CEO) Kohli and Jaworski 1990; Kohli, Jaworski, and Kumar 


recently announced plans to transform the company so that 1993). These studies provide important insights into the 
“every idea and technical solution should be focused on dimensions of market orientation and its implications, but 
meeting customers’ needs from the outset” (Edwards 2005, they are cross-sectional analyses and therefore cannot offer 
p. 35). Organizations such as Intel presumably hope to reap insights into the dynamics of organizational change 
the rewards that research has shown to be associated with a (Jaworski and Kohli 1993). Using a prescriptive approach, 
greater market orientation (e.g., Narver and Slater 1990). Day (1999) develops a more expansive model of organiza- 


tional change to a greater market orientation. Consistent 

with previous work in marketing, the model focuses on cre- 
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what combination or sequence remains largely unexplored 
in the context of creating a greater market orientation. 

In this article, we explore the process through which 
organizations change to adopt a greater market orientation 
by studying a group of firms over time. Each of these firms 
was engaged in a process to create a greater market orienta- 
tion, though different firms were at different stages of the 
process. To understand the changes that each was undergo- 
ing and to deduce a common process, we studied each firm 
using the longitudinal-processual method of in-depth, quali- 
tative examination, including ethnographic observation, oral 
histories, and analysis of historical documents. To do SO, we 
relied on methods that are common in consumer behavior, 
anthropology, sociology, and organizational research (e.g., 
Glaser and Strauss 1967; Pettigrew 1990; Sherry and 
Kozinets 2001; Thompson, Locander, and Pollio 1989). On 
the basis of this analysis, we identified a common, longitu- 
dinal process that leads to a greater market orientation. We 
identified the stages of this process, the obstacles impeding 
progress from one stage to the next, and the characteristics 
of firms that successfully navigate the process. 

Our analysis reveals a four-stage process of organiza- 
tional change: (1) initiation, (2) reconstitution, (3) institu- 
tionalization, and (4) maintenance. We describe each of the 
four stages (and their linkages and implications) in more 
nuanced detail subsequently, but the process can be briefly 
summarized as follows: Members of the organization recog- 
nize a threat to the organization. Threats take many forms, 
but they are typically not widely recognized by the organi- 
zation. To address the threat, a group of empowered man- 
agers creates a coalition to plot the change process. The 
managers plan the complete transformation of the organiza- 
tion, mobilize the larger organization, and create a cultural 
shift through a process of reconnecting organization mem- 
bers with customers. Through this process, they build com- 
mon experiences and perspectives and, at the same time, 
build a growing consensus for more formal organizational 
changes. These formal changes follow informal ones and 
are used to sustain the new orientation of the organization. 

The process we identify has important implications for 
our understanding of creating a more market-oriented orga- 
nization and for the marketing concept in general. Previous 
work on market orientation identifies three factors as 
antecedents of a greater market orientation: top manage- 
ment focus, interdepartmental cooperation, and reward Sys- 
tems (Kirca, Jayachandran, and Bearden 2005). Each of 
these factors is believed to operate independently to 
enhance market orientation. Our analysis finds these to be 
among a larger set of factors that operate to create organiza- 
tional change. Moreover, we find that these factors are 
embedded in a larger, richer sequence of change processes. 
Our analysis describes this fuller range of factors and the 
sequential process in which they are embedded. For exam- 
ple, the change process we describe depends on the forma- 
tion of an elite group of top managers who are determined 
to change the organization in its most fundamental Ways, 
including the values of the organizational culture. We find 
that reward systems are an important element of the process 
but only after top management has created greater support 
for change throughout the organization. We describe each 
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stage in detail, examine the sequence of changes, and dis- 
cuss the implications of the richness revealed in the change 
process. 

Our results offer new insights into the process of organi- 
zational change. Scholars in organizational behavior, soci- 
ology, and other fields have proposed different mechanisms 
and modes through which organizations change, including 
evolutionary forces and political struggle (Van de Ven and 
Poole 1995; Zald and Berger 1978). However, which 
mechanisms or modes of change may or may not play a role 
in creating a market-oriented organization and how they do 
so remains unresolved. Our analysis describes which 
change mechanisms, which modes of change, and which 
sequences are part of creating a more market-oriented orga- 
nization. For example, we discuss how the process of for- 
mal change in organizational structure and rewards follows 
a process of mass mobilization led by senior management. 
Although the mechanisms we identify are clearly related to 
those previously identified, they differ in important Ways in 
the specific context we examine. We discuss these differ- 
ences and their implications. 

Our results offer new insights into the nature of a mar- 
ket orientation. Marketing scholars have debated whether 
market orientation is a set of behaviors or a culture (e.g., 
Deshpandé and Farley 1998; Homburg and Pflesser 2000; 
Narver and Slater 1998). Our analysis shows that market 
orientation rests fundamentally on cultural values. We iden- 
üfy six key values that are shared by the market-oriented 
organizations in our analysis. These values, which are 
shared throughout the organization, become central to the 
change process. Indeed, an early objective of people who 
transform organizations is to establish new values and 
norms for behavior. These values are central to a market ori- 
entation. Unlike previous studies, our analysis shows that 
the adoption of these values leads to a redistribution of 
power through the organization, a change in the nature of 
organizational learning, and a change in the organization's 
capabilities. Our analysis illustrates that the distribution of 
intraorganizational power and organizational learning are 
essential, overlooked characteristics of market-oriented 
firms. We explore the role of each of these elements in the 
creation of a market-oriented culture. 


Methodology and Research Design 


To discover how firms create a market orientation, we 
observed and analyzed firms seeking to create a greater 
market orientation. We conducted our research using the 
longitudinal-processual method (Pettigrew 1979, 1990). 
This approach combines in-depth, qualitative data collec- 
tion techniques (i.e., ethnographic observation, depth inter- 
views [in particular, oral histories], and historical docu- 
ments) with comparative analysis to develop a process 
model of change. The longitudinal-processual method com- 
pares each organization with itself at different stages of a 
chronological process and compares longitudinal progres- 
sions across organizations to develop insights into a com- 
mon change process. Organizational behavior research has 
used this approach to study multiyear processes, such as 
internal corporate venturing (Burgelman 1983) and the 


development of organizational cultures (Pettigrew 1979). 
Rather than specify possible change mechanisms a priori, as 
previous studies have done, our approach is distinctly 
inductive. Grounded by an in-depth understanding of how a 
sample of firms have changed over time, we develop a 
model for how firms become market oriented. On the basis 
of this analysis, we produce a generalized understanding 
that can be empirically verified in subsequent research 
(Deshpandé 1983; Glaser and Strauss 1967). 

The etbnographic investigation of behavior in complex 
organizations has a long and discontinuous history in the 
United States, dating back to the late 1920s. Periodic calls 
for the revival and sustaining of the tradition have arisen 
from within the social sciences (Whyte 1978) and business 
(Bonoma 1985) disciplines and have fueled recent inquiry 
both from sociologists and anthropologists who are inter- 
ested in business dynamics and from marketing scholars 
who are intrigued by a nonpositivist understanding of 
managerial motivation. Epistemological analyses, method- 
ological tutorials, and exemplary studies (Arnould and 
Thompson 2005; Arnould and Wallendorf 1994; Belk, 
Sherry, and Wallendorf 1988; Deshpandé and Webster 
1989; Sherry 1995, 1998; Sherry and Kozinets 2001) have 
established ethnography as part of marketing's canonical 
toolkit. Our investigation is part of this current wave of 
qualitative research into marketing. 


Research Process 


Figure 1 provides an overview of the way we implemented 
the longitudinal-processual method for this research. Our 
first step was to develop a typology of organizations that 
might offer the greatest insight into the process of creating 
a market orientation. Knowing that four to six firms would 
be the practical maximum we could work with given the 
longitudinal-processual method, we followed a theoretical 
sampling approach (Eisenhardt 1989; Strauss and Corbin 
1998). Assuming a simple longitudinal model of change, 
we considered firms at three stages: beginning the change 
process, currently changing, and having achieved a market 
orientation. We attempted to recruit two firms in each stage. 
We identified potential firms on the basis of academic and 
practitioner recommendations, reviews of the business 
press, and company documents. From that larger pool of 
firms, we generated a pool of willing candidates. 

. MediaCo and EquipmentCo (pseudonyms) were firms 
beginning the change process. Both firms recently launched 
initiatives with the explicit goal of creating a market orien- 
tation. EquipmentCo is a capital equipment manufacturer 
selling to business markets, and MediaCo develops commu- 
nication products and services for consumers on behalf of 
corporate and not-for-profit customers. 

We screened firms that were in the process of changing 
or had completed a change to a market orientation through 
key manager interviews. During this screening, we ascer- 
tained whether the firms experienced increases in market 
orientation over a period of time relative to Kohli and 
Jaworski's (1990, p. 6) definition of market orientation as 
the "organizationwide generation of market intelligence 
pertaining to current and future customer needs, dissemina- 
tion of the intelligence across departments, and organiza- 
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tionwide responsiveness to it" These interviews helped 
mitigate demand characteristics associated with question- 
naires and prevented contamination of prospective field 
sites. 

On the basis of these conversations, we recruited two 
firms that were in the process of change, BenefitsInc (a 
pseudonym) and Motorola Personal Communications Sec- 
tor (PCS), and two firms that had recently completed a 
transformation, Alberto-Culver and Marshfield Door- 
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Systems. BenefitsInc is an employee benefit company that 
was created in the 1980s through the merger of two smaller 
organizations. Within Motorola, Motorola PCS develops 
and manufactures wireless communication devices, which 
are typically sold to wireless carriers that resell them to 
consumers. Alberto-Culver is a multinational manufacturer 
and marketer of consumer and professional hair care and 
beauty products, such as Alberto VO5. Our research 
focused on the experiences of Alberto-Culver North Amer- 
ica. Marshfield DoorSystems is a privately held commercial 
architectural door manufacturer in Marshfield, Wis., which 
was a Weyerhaeuser subsidiary when it began its change 
efforts. 

Data collection included gathering oral histories, 
reviewing historical documents, and engaging in ethno- 
graphic observation.! We obtained oral histories by inter- 
viewing a cross-section of employees, both vertically 
(CEOs and shop floor workers) and horizontally (employ- 
ees from marketing, finance, engineering, and so forth). The 
majority of interviewees joined their organization before it 
began transforming to a market orientation. Oral history 
interviews lasted from 40 minutes to three hours and were 
audiotaped. We conducted more than 70 formal interviews 
during ten months of fieldwork, representing more than 120 
hours of audio recordings. 

We began interviews in an exploratory manner, focusing 
on each individual’s phenomenological interpretation of the 
firm’s transformation and current state (e.g., Glaser and 
Strauss 1967; Thompson, Locander, and Pollio 1989). When 
an informant shared an interpretation of the firm and how it 
had changed, we clarified ambiguities and summarized what 
we heard, providing an opportunity for informants to cor- 
rect or expound on anything we might have misunderstood. 
We then inquired about factual (e.g., a specific event) and 
interpretive (e.g., what the event meant) differences among 
informants within an organization. This provided infor- 
mants with an opportunity to address events they had for- 
gotten to mention, explain why different people within the 
same organization interpreted an event differently, and help 
us make sense of the changes occurring within each organi- 
zation. Having obtained informants’ phenomenological 
interpretations of change, we concluded the interviews by 
explaining the nature of the project in more detail, including 
what we were finding across firms. By sharing this informa- 
tion, we sought to ensure that we elicited any information or 
interpretation that might be relevant to understanding the 
transformation to a market orientation. Leveraging the 
discovery-oriented approach with the vast literature on how 
other types of organizational change occur, we moved from 
a phenomenological to a directed inquiry approach, ensur- 
ing that we explicitly considered prior suggestions of how 
firms might change to become more market oriented 
(Eisenhardt 1989; Strauss and Corbin 1998). 

We reviewed hundreds of historical documents across 
participating firms. External documents included annual 


1The first author conducted all data collection and analysis 
under the direction of his dissertation cochairs. We employ the 
pronoun “we” throughout the article to minimize distraction. 
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reports, company press releases, industry publications, and 
articles from both national and local media. Internal com- 
pany documents included memos, strategic plans, training 
materials, and marketing research. Historical documents 
provided context regarding activities and attitudes in firms 
before and during their transformations, as well as con- 
firmed and elaborated on oral histories. In addition, histori- 
cal internal documents provided deeper insight into the 
actual process of change at each firm. 

We conducted ethnographic fieldwork, observing and 
interacting with firms’ employees, customers, channels, 
suppliers, and consultants. We spent more than 40 days in 
the field, sitting in senior executive, marketing strategy, 
sales review, and consultant meetings and sharing break- 
fasts, coffees, and lunches with informants in company 
cafeterias, boardrooms, and local eateries. We spent time 
observing and speaking with employees in company work 
and break areas. We toured facilities and initiated conversa- 
tions with whomever we met. We even worked as Harley- 
Davidson employees at Daytona Bike Week, where we also 
joined informants for dinners and drinks. Participant obser- 
vation in the field gave insight into the lived experience of 
employees engaged in everyday activities. 

Data collection and analysis occurred over ten months. 
We managed interview transcriptions, field notes, and his- 
torical documents electronically, using QSR International’s 
NVivo software, which allowed us to code and analyze a 
vast amount of data across participating firms. Following 
standard practice for qualitative inquiry, we coded the data 
iteratively, seeking to find common themes across firms. We 
refined those themes on the basis of further fieldwork and 
ongoing discussion (Eisenhardt 1989; Strauss and Corbin 
1998). For the initial interviews, field notes, and historical 
documents, we selectively coded for anticipated change 
mechanisms (e.g., incentives) and openly coded other pas- 
sages that appeared to be important to the change process 
on the basis of informant recollections or our own observa- 
tions. As we added information from additional sources or 
incremental firms, we continued to refine our understanding 
of common processes across firms, which we captured by 
changes to our coding scheme (Eisenhardt 1989; Strauss 
and Corbin 1998). Using this approach, we developed an 
understanding of change at each firm. Simultaneously, we 
developed a common model of change to a market orienta- 
tion by comparing and contrasting our findings across firms 
in a discovery-based approach to theory development 
(Glaser and Strauss 1967). When we understood how a 
given firm had changed and our comparisons raised no fur- 
ther questions, we agreed that our understanding had 
reached saturation, and we concluded fieldwork with that 
firm (Glaser and Strauss 1967; Strauss and Corbin 1998). 

An advantage of the grounded theory methodology and 
a theoretical sampling paradigm is the flexibility to alter 
research design on the basis of an evolving understanding 
of the change process (Glaser and Strauss 1967; Strauss and 
Corbin 1998). For example, after a period of fieldwork, we 
identified the fourth stage of change, unanticipated in our 
original design: maintaining a market orientation. On the 
basis of this discovery, we recruited Harley-Davidson as a 
seventh participating firm. Harley-Davidson, a $4 billion 


company that provides motorcycles, branded products, and 
Services, underwent tremendous organizational change dur- 
ing the 1980s to become market oriented. 

During our fieldwork, it also became evident that Bene- 
fitsInc, EquipmentCo, and MediaCo were not generating, 
disseminating, or responding to market intelligence consis- 
tent with a market orientation (Kohli and Jaworski 1990), 
and they had not made any discernable progress toward cre- 
ating a market orientation. Change efforts at BenefitsInc 
were isolated to branding and brand management activities. 
Although change efforts at EquipmentCo and MediaCo 
began more than a year before our fieldwork, these efforts 
did not seem to be having much impact at either firm. 
Progress appeared to be impeded by a lack of sufficient 
political power to implement change and a lack of under- 
standing throughout each organization regarding the nature 
of the desired change. Nonetheless, an understanding of the 
change efforts provides a valuable contrast with firms that 
succeeded in creating a greater market orientation: 

We concluded our fieldwork when we achieved theo- 
retical model saturation, a point at which additional field- 
work appeared unlikely to change our model significantly 
(Glaser and Strauss 1967; Strauss and Corbin 1998). 
Finally, we prepared our findings and sent copies to all 
interviewees for member checks, both to help verify the 
accuracy of our representation and interpretation of each 
firm’s experiences and to provoke any additional insights 
our informants might harbor (Belk, Sherry, and Wallendorf 
1988; Lincoln and Guba 1985). 


Results: A Four-Stage Model 


Our analysis shows that the process of creating a market 
orientation occurs over four distinct stages: (1) initiation, 
(2) reconstitution, (3) institutionalization, and (4) mainte- 
nance. The stages are path dependent, and each stage 
includes multiple steps or activities, which we describe in 
this section. 


Stage 1: Initiation 


Given the vast literature suggesting the many advantages of 
a market orientation, investigations into the antecedents of a 
market orientation have overlooked what initiatés market 
orientation change efforts. Our research finds that change is 
precipitated by powerful stakeholders recognizing a threat, 
who then create coalitions to plan and implement change 
efforts. 

Recognition. The impetus for change was the recogni- 
tion of an external threat. One dimension of such a threat 
was financial. All firms failed to meet financial perfor- 
mance targets. For example, Marshfield DoorSystems was 
losing millions of dollars on $53 million in revenue the year 
the company began its transformation. Some firms faced 
additional challenges. Competitors or changing technology 
threatened their business models, their industry leadership 
position, or their organizations' legacies. For example, hav- 
ing created the cellular phone industry, Motorola saw its 
leadership eclipsed by Nokia in 1988. Losing the leading 
sales position was a blow to the essence of how employees 


viewed themselves and their roles within Motorola as a 
world-class engineering powerhouse. 

Within each participating firm, powerful organizational 
stakeholders (owners and/or senior executives) recognized 
these threats and began the process of transformation. The 
number of stakeholders initiating change varied across 
companies. In the cases of Marshfield DoorSystems and 
Alberto-Culver, a single executive initiated the process. At 
Harley-Davidson and Motorola PCS, a small group of pow- 
erful stakeholders began the process. In every case, stake- 
holders faced organizations that, for the most part, did not 
agree with their sense of how significant a change was 
needed or the nature of the change required. 

Preparation. In the preparation step, dissatisfied and 
powerful stakeholders initiated backstage preparations for 
the transformation, spearheaded by a new leader. In some 
cases, the new leader was also the stakeholder who recog- 
nized the need for change; in other cases, the stakeholders 
appointed a new leader. Preparation activities included 
development of a guiding coalition based on a shared set of 
values and agreement on a broad plan for change. Coalition 
membership was based on two criteria: a genuine respect 
for the ability of every organization member to contribute to 
the organization's future and a belief that market needs 
should drive organizational change. For example, a Weyer- 
haeuser corporate vice president appointed Bill Blanken- 
ship president of Marshfield DoorSystems. Before officially 
taking over, Bill Blankenship recruited Jerry Mannigel, a 
Weyerhaeuser general manager who was well regarded by 
local management and the union and with whom Bill 
Blankenship had previously worked. Jerry Mannigel then 
recruited other coalition members, including a successful 
production manager who had previously left Marshfield 
DoorSystems in frustration. 

Across firms, these guiding coalitions developed trans- 
formation plans that would engage the entire organization 
in the change effort. Plans focused on organization culture 
and process changes rather than on specific end-state goals 
or objectives, such as market share or return on assets. For 
example, four weeks after taking over at Marshfield Door- 
Systems, Bill Blankenship and Jerry Mannigel held a senior 
management meeting to outline a new set of values and 
norms expected of organization members. These included 
empathy, respect for others, collaboration, teamwork, and a 
market orientation. The plan for change included six 
process-focused change initiatives: (1) value proposition 
and value delivery systems, (2) communication systems, (3) 
safety, (4) continuous improvement, (5) organizational 
alignment, and (6) information technology. 

In contrast, firms that were failing to create a greater 
market orientation did not have guiding coalitions based on 
shared values or agreement on a plan among people 
expected to create change. Instead, leaders relied on exist- 
ing management structures, new hires, or consultants to cre- 
ate change, and their plans focused on improving their 
firms' marketing capabilities and cultures. Plans included 
creating new brand identities at BenefitsInc and Equip- 
mentCo, hiring experienced marketers to infuse marketing 
into the organization at BenefitsInc, creating a corporate 
marketing council to advise and oversee business-unit 
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activities at MediaCo, and marketing and cultural change 
training at MediaCo and EquipmentCo. 


Stage 2: Reconstitution 


Reconstitution is the organizationwide implementation of 
the plan for change. It comprises five steps: (1) demarca- 
tion, (2) value and norm development, (3) reconnection 
with the market, (4) removal of dissenters and hiring of 
believers, and (5) collaborative strategy. 


Demarcation. A milestone in the change process occurs 
when the guiding coalition presents its plan to the entire 
organization. Within participating firms, informants uni- 
formly mentioned the same milestone as the point at which 
people knew about the change program. Demarcation 
events were characterized by several common attributes: (1) 
They were public and open, and all organization members 
simultaneously witnessed the event; (2) the challenge to the 
firm was explained as authentic and compelling; (3) the 
plan for change and the values, norms, and behaviors the 
coalition sought to inure were clearly communicated; (4) 
the change process was framed as market focused, not mar- 
keting centric; (5) and the guiding coalition’s actions were 
showcased in harmony with the desired culture. 

The ceremony at Marshfield DoorSystems is an excel- 
lent example of the dramatic nature of such events. In Janu- 
ary 1994, Bill Blankenship and Jerry Mannigel had been in 
charge for three months when they called a companywide 
meeting. People were worried. No executive team had ever 
called a companywide meeting. People entering the 
makeshift hall inside the plant noticed that something was 
more than amiss; it was odd, out of context, and blatantly 
ridiculous. Railroad crossing posts—lights and all—flanked 
the podium where Bill Blankenship was standing. An old- 
fashioned, steam-driven passenger train, replicated on a 
banner that was strung up above and behind the podium, ran 
15—20 feet across and 3—5 feet high. Pasted on the locomo- 
tive was a picture of Bill Blankenship, as engineer, looking 
out the window. Clearly printed beneath the train were the 
words, "There's a new train in town.” People’s recollections 
of that day were consistent: 


I'll never forget the first meeting we had when he was 
introducing himself. He had this banner up,... [and then 
he said,] “There’s a new train in town. You can get on the 
train or you can choose not to. If you choose not to, when 
the train leaves the station, you won’t be on it.” 


It’s one of his classic lines. “There’s a new train in town” 
Some people are going to come along for the ride. Some 
people are going to drive. Some people are going to 
decide to stay at the station. And some people—it will be 
decided—they will stay at the station. You did have 
options. It’s your call. What do you want to do? Are you 
gonna come along? You gonna help us? 


The new train in town was really more about how we're 
all going to march to a similar step. Not necessarily the 
exact same one, [but] this is the direction we’re going. 
We're going to do business differently. The analogy with 
the train is: You can get on the train. You can get run over 
by the train. Every once in a while, the train stops and lets 
some people off, and you have a choice: What kind of a 
passenger you want to be? Everyone agreed they didn't 
want to be run over by it. So most people jumped on board 
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right away, but [the train] had to stop a couple of times 
and let people off. 


The train analogy captured people's attention, and the 
companywide meeting marked the first time senior manage- 
ment publicly outlined the transformation plan. That every- 
one was informed simultaneously was noteworthy on its 
own. At the same time, management opened the organiza- 
tion's books to the union, so the union could review Marsh- 
field DoorSystems's financial condition firsthand. More 
than eight years later, informants recalled the ceremony and 
related actions as a turning point for the company, when 
management unambiguously involved everyone to partici- 
pate in the transformation of the firm. 

EquipmentCo, MediaCo, and BenefitsInc did not have 
salient demarcation events. Instead, their firms communi- 
cated change efforts over time through formal channels; 
senior managers were notified first, followed by middle 
managers, and so forth. Informants recalled participating in 
specific workshops or training sessions, but they did not 
recall any overall plan for change, other than increasing 
organizational and/or marketing effectiveness. Finally, 
informants at the three firms that were failing to achieve a 
market orientation discussed how executives often acted in 
conflict with the espoused values and norms suggested by 
change efforts, and several senior managers publicly dis- 
agreed with official change efforts. 


Value and norm development. The guiding coalitions 
perceived their existing firms' cultures as major impedi- 
ments to organizational change and long-term success. 
Organization cultures before transformation were diverse, 
but they can be characterized as sharing the following 
attributes: a bureaucratic and internal focus; a reliance on 
historically successful approaches to solve new problems; 
highly structured routines parsed out by function and his- 
torical origin; employees identifying more with their func- 
tion, job class, location, or other subgroup than with the 
overall organization; low levels of trust between groups 
within the organization; a lack of a common understanding 
of what the firm was trying to accomplish and how; passive- 
aggressive behaviors due to internal norms to be nice, cou- 
pled with the simultaneous use of covert methods to accom- 
plish personal or group goals; and transactional leadership, 
defined by the extensive use of explicit rewards to manage 
employee behaviors. 

Recognizing these challenges, the guiding coalitions 
attempted to create a new set of organizational values that, 
in combination with subsequent activities, would create a 
more market-oriented culture. The number and nature of 
these values varied across firms, from Alberto-Culver with 
ten values (and the mnemonic “HOT CC FIRST” to help 
employees remember them all) to Harley-Davidson, which 
communicated its values implicitly through a focus on cus- 
tomers and internal cooperation. Among participating firms, 
we identified six common cultural values. We list these in 
Table 1. Following Schein’s (1985) organizational culture 
framework, we provide the basic assumption underlying 
each value and the behavioral norms emanating from each 
value as well. 

As an organizational value, the market as the raison 
d'&tre provides common meaning for all organization mem- 


TABLE 1 
Values, Assumptions, and Norms of Market-Oriented Firms 


Value " Assumption 


Market as the ralson d'étro 


We come together as an organization to 
serve the market and make a living. 


Behavioral Norms 


Every decision and action must consider 
how it affects the market. 


Collaboration Working together, we can achieve more, Work Is done collaboratively by teams. 
faster and better, than apart. Teams are jointly responsible for 
outcomes. 

Respect/empathy/perspective People are basically good and have Consider the perspectives, needs, 

taking reasons for their actions. training, expertise, and experlences of 
others when reacting to or interpreting 
their actions. 

Keep promises To succeed, everyone must do hls or her Each employee Is responsible for 

; part. following through on commitments to 
others. 

Openness Honestly sharing information, Proactively and honestly share 
assumptions, and motives allows others Information, assumptions, and motives 
to understand and effectively collaborate with others. 

with us. 
Trust Everyone is committed to the same goal. Trust that your fellow employees are 


Therefore, we can have positlve 
expectations about their intentions and 
behaviors. 


telling the truth and will follow through on 
commitments.  ' 


—————————————————————————————————À 


bers and is broadly inclusive, creating one socially con- 
structed in-group that encompasses all organization mem- 
bers. Such an inclusive in-group definition fosters positive 
attributions and behaviors among organization members 
and creates a cultural foundation for firms that allows for 
the ongoing adaptation of missions and visions in response 
to changing market conditions. In addition, we found the 
market as the raison d’&tre to be the central culthral value 
providing a rationale for the other five values of trust, open- 
ness, promise keeping, respect, and collaboration, all of 
which are required for an effective market orientation. As 
such, the combination of the six cultural values creates an 
organizational environment that is supportive of'organiza- 
tionwide collaboration, leveraging the experiences and 
capabilities of all members, creating a shared understanding 
of problems, allowing for the creation of more effective 
solutions to problems, and assisting in effective implemen- 
tation of solutions through tighter collaboration. 

The guiding coalitions used three methods to encourage 
adoption of these values and norms and, consequently, 
behaviors consistent with these values and norms. First, the 
guiding coalitions created change by explicitly 'modeling 
behaviors aligned with the desired values and norms. Sec- 
ond, they used rewards and recognition to encourage behav- 
iors that were aligned with the desired values and norms, 
and they penalized (including removal from the firm) fail- 
ure to adhere to the desired values and norms. Finally, they 
inculcated desired values and norms in organization mem- 
bers through participation in subsequent transformation 
activities. 

Beginning with demarcation, the guiding-coalition 
members' behaviors and organization rituals played an 


important role in changing firms' cultures. For example, fol- 
lowing the guiding coalition's leveraged buyout of Harley- 
Davidson from AMF, there were several symbols of man- 
agement's intention to create a culture of trust and 
collaboration focused on the needs of Harley-Davidson cus- 
tomers. These included sharing detailed financial informa- 
tion with the union, as one informant explained: 


[U]ltimately, when you know how desperate the situation 
is and you truly need assistance—whether it’s from the 
salaried workforce, or the union workforce, or your lend- 
ing institutions, or your dealer network, or the govern- 
ment, or whomever it may be ... to cut through the cyni- 
cism, I think you’ve got to show them the facts.... If 
you’re really in trouble and you need someone to extend a 
helping hand, whomever that may be, you damn well bet- 
ter be honest with them. The way you’re honest with them 
I$ opening up the books and sharing management respon- 
sibility, working with the union, [and] looking at the 
union as a fellow stakeholder. 


Other culturally consistent symbols at Harley-Davidson 
included the elimination of assigned parking at headquar- 
ters; participation of senior executives in demo rides, rallies, 
and local rides (alongside other employees); one cafeteria at 
headquarters that everyone had access to; and the policy 
that everyone needed to obtain their personal motorbikes 
through the dealer network as any customer would. 

The guiding coalitions also used rewards and recogni- 
tion to create cultural change. The most prevalent recogni- 
tion was interpersonal, with the guiding-coalition members 
showing approval, support, or disapproval of particular 
employee comments or actions in various venues, such as at 
Harley-Davidson demo rides, in Motorola PCS war-room 
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meetings, and on Marshfield DoorSystem’s factory floor. 
By demonstrating appropriate behaviors in front of the 
guiding-coalition members, employees signaled their adher- 
ence to change efforts, to which the guiding-coalition mem- 
bers reacted favorably by complimenting or paying atten- 
tion to employees. 

More public and permanent recognitions that tran- 
scended time and place were also used to engender cultural 
change. Alberto-Culver used public awards extensively, as 
one informant explained: 


Everybody goes into a room and the awards start getting 
handed out. The guys who are glued to their chairs and 
never getting up and getting an award with red faces are 
very, very identifiable. Then those that need a wheelbar- 
row to take all the awards back to their office are also very 
visible. So there is a separation between the haves and the 
have-nots. It ıs a very, very public thing. 


Regardless of the mix of methods organizations used, 
we found that their effectiveness at creating cultural change 
varied directly with their "authenticity." Authenticity refers 
to the perceived congruency among management actions, 
rituals, symbols, and the values and beliefs espoused by the 
guiding coalition. When discussing the change process with 
the guiding-coalition members at Harley-Davidson, Marsh- 
field DoorSystems, Motorola PCS, and Alberto-Culver, we 
found that they exhibited an almost evangelical passion and 
commitment to the change process and espoused cultural 
values. Their emotional commitment was unambiguous and 
entirely consistent with informant recollections. 

For BenefitsInc, a firm that was not realizing a market 
orientation, we found no cultural change efforts. Con- 
versely, MediaCo and EquipmentCo engaged in firmwide 
cultural change efforts, but these efforts were managed by 
an organizational development consulting firm (both com- 
panies used the same consulting firm). Al] employees were 
required to participate in cultural change training. However, 
informants perceived these formal cultural change efforts as 
insincere. For example, executives received three or more 
days of extensive training a year before the lowest-level 
employees received a half-day of training, contradicting the 
notion that all employees have something to contribute. 
Informants also noted that some executives openly ques- 
tioned the value of the training, and other executives 
claimed to support the desired culture but did not follow the 
espoused norms. One MedicaCo informant summarized his 
frustration: 

I think there's this big disconnect. You have those things, 
the values and visions of the company from [the CEO's] 
mouth. But that's about as far as they fall. I mean, what 
are those things? I mean, what do they really mean?... 
[M]onth, after month, after month, after month of culture 
transformation exercises and then different [planning] 1ni- 
tiatives,... that's where you create credibility gaps with 
senior leadership who are trying to lead. Because people 
say, “Wait a minute! This doesn't sync up! You tell me one 
thing but you're doing another." 


Reconnection with the market. Organization researchers 


assert that cultural change requires changing organization 
members' underlying assumptions about how the firm 
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works and what behaviors are appropriate (Schein 1985). 
Value differences between more effective and less effective 
organizations exist not in their labels but in what those val- 
ues mean and the appropriate norms and behaviors tied to 
them (Argyris 1990). The same appears to be true for creat- 
ing a market-oriented firm, the greatest challenge being to 
determine exactly which behaviors align with the market as 
the raison d'étre. 

For example, EquipmentCo invested heavily in cultural 
training compared with other firms in our study. Neverthe- 
less, organization members were vexed by the ambiguity of 
what a customer focus meant and, thus, what behaviors 
exhibited a customer focus. A sales manager disclosed, 
“I’m frustrated because I don't think ... we clearly under- 
stand the market that we’re trying to sell [our products] 
to.... We keep talking about this value customer. Who is 
that guy? What does he look like? You know? How does he 
act? I can't tell you.... My frustration really comes from I 
don't know what winning is. I don't know what winning in 
a value environment is. Because we haven't defined that. 
Tell me the 1500 guys that you want me to sell, and I'll sell 
'em, but I don't know which 1500 right now." 

Informants at Harley-Davidson, Marshfield Door- 
Systems, Alberto-Culver, and Motorola PCS recalled simi- 
Jar frustrations before their transformations. This suggests 
that an organizationally shared market understanding differ- 
entiates more market-oriented firms from those earlier in 
the change process and differentiates more market-oriented 
firms from the same firms before beginning the transforma- 
tion process. Thus, to create a greater market orientation, 
firms create an organizationally shared market understand- 
ing, enabling organization members to determine appropri- 
ate behaviors that are consistent with the value of the mar- 
ket as the raison d'étre. 

Firms create shared understandings—or shared market 
schemas—by sending cross-functional teams into the mar- 
ket to meet with customers, channels, and influencers. Over 
time and across teams, those experiences coalesce around a 
shared understanding of the market. Teams explicitly repre- 
sent this understanding in a market metaphor, such as “a 
Harley Rider" for Harley-Davidson. Teams then vicariously 
share their experiences with other organization members 
using stories and artifacts from the field, explaining the 
meaning encapsulated in the market metaphor. It is through 
this process that firms create a shared market understanding. 

For example, as the first step in developing a value 
proposition at Marshfield DoorSystems, cross-functional 
teams were sent out to visit architects, contractors, and dis- 
tributors. Informants described these visits as transforma- 
tive. An informant with more than 20 years of industry 
experience shared the following: 


I went on a few trips.... We would ask contractors, "Why 
are things like that? Why is shipping on time important?" 
[It] was really very interesting for me.... Usually only 
sales guys got to see the customers.... I got to understand 
some of the issues, like delivering in the nght week is 
what it's all about. Like issues with doing remodeling and 
having to have service elevators scheduled. [The contrac- 
tors] need precision in big cities like Boston. These are 
considerations I never knew about before. It’s important 


because now we're talking about the end-use customer. 
That end-use customer wants it in there at a certain point 
in time. This has a great impact on the architect [and the] 
contractor that specified your door and the end-use cus- 
tomer who 1s going to pay for it. If you do it well, they'll 
be back. If you don’t do it well, they will remember 
that,... and of course, it all has to be correct. 


The informant was aware that delivering products to 
specification and on time was important, but meeting dis- 
tributors, building owners, general contractors, and archi- 
tects created a new sense of urgency and importance around 
these expectations. The informant developed an apprecia- 
tion for why these things were important and the impact of 
failing to meet such expectations. When he discussed his 
firm’s value proposition, he relayed these personal stories, 
providing context and understanding for the value proposi- 
tion. This allowed a listener to appreciate the perspectives 
of the people he met, leading to empathy for their situation. 

During the 1980s, Harley-Davidson initially sent 
employees into the field to staff events because they did not 
have the money to hire outside people. Soon after, Harley- 
Davidson began to realize benefits similar to those at 
Marshfield DoorSystems, and eventually, all employees 
were expected to work in the field to understand customers 
and their needs (e.g., Fournier et al. 2000). Motorola and 
Alberto-Culver sent smaller groups of people into the mar- 
ket, who similarly brought back market insights to create 
organizationally shared understandings in their firms. 

At Motorola, the Global Design Planning team was 
responsible for managing fieldwork as part of the product 
development teams’ uncovering consumer needs and trends. 
One team member explained, “The [Global Design Plan- 
ning] team does behavior research.... Behavioral research 
uses a lot of social science methodology—anthropology 
methodology.” Using a project example, the tnformant 
explained, “No one knew what [a leadership phone] meant, 
other than the definition we had with StarTac.... [So] we 
actually went out and did interviews and made a research 
brief. We went out and interviewed the leaders, the cutting- 
edge people.” 

To share this field information, team members brought 
back stories and artifacts, including video recordings and 
pictures. This allowed others to experience the market vic- 
ariously and build a shared understanding, much like the 
employees at Marshfield DoorSystems who heard stories 
from returning colleagues. A guiding-coalition informant, 
focused on design, summarized the transformative nature of 
this process at Motorola: 

This 1s a journey of enlightenment, right? What is enlight- 

enment? Enlightenment is when you truly understand the 

true cultural points of resonating with what their. needs 
are. What does that mean? That means there’s a huge 
installed base of knowledge within the culture about the 
world it shares. There's a constant feeling of that knowl- 
edge with ethnography that many other points [miss]. It's 
that point of enlightenment. It's also at the point where 
you're truly innovating cross-functionally, not only with 
technology and design but also with the business that 

you're creating... Enlightenment, in that context, is a 

higher art of brand building and value creation. 


It comes back down to this notion of collaboration and 
shared understanding as being really fundamental; you 
have to have opening the kimono itself be a core function 
of the company and build a shared vision of where you're 
going, share the enthusiasm, and that requires a level of 
intensity and communication that's really never to be 
underestimated. 


In addition to product-specific research, Motorola PCS 
created a brand compass metaphor that highlighted four 
attributes across market segments: communication, business 
use, social use, and price. The brand compass was a trian- 
gle, with communication as the base, business and social as 
the sides, and price increasing from entry level at the base 
to premium at the top of the triangle. Pictures of five arche- 
typal consumers were superimposed on the triangle, sym- 
bolizing five broad consumer segments: everyday commu- 
nication, easy business, corporate business, networked 
entertainment, and personal style. Although we did not 
observe creation of the brand compass, its use in almost 
every conversation within Motorola PCS was striking, par- 
ticularly the stories people offered to contextualize each 
segment. All products were first defined within the brand 
compass metaphor and then elaborated on the basis of their 
specific attributes, such as corporate e-mail access or 
fashion/style consciousness. The brand compass allowed 
people to discuss various handset and service initiatives 
within a common understanding of what they were trying to 
accomplish collectively. 

Reconnecting with the market is not defined by field- 
work alone, but also by how such experiences are shared in 
the field from multiple perspectives and then shared within 
the organization to create shared meaning. Informants from 
each of the firms that were failing to realize a market orien- 
tation recalled sporadic field visits with customers con- 
ducted by functionally homogeneous teams that did not 
share the results internally. A BenefitsInc informant 
recalled, 


That reminds me, the first boss I had here believed that 
people should leave the office. He always thought the 
people at [divisional headquarters] really never left the 
office. So he wanted people to go out and do customer 
interviews. [We found an academic specializing in cus- 
tomer visits and had] an internal interviewing training 
program to teach people who are not market researchers 
how to conduct an interview.... [My boss] got all the 
people in marketing ... and some sales people too,... and 
they went and interviewed customers.... There were 
teams, three or four at a time.... It worked out pretty 
well,... but the follow-up through some type of methodi- 
cal *what did we find" didn't really ever get done, at least 
not that I know of. 


Removal of dissenters, hiring of believers. Although.ini- 
tial cultural transformation efforts are sufficiently effective 
early in the transformation process, at some later point, 
coaching cultural dissenters to change is abandoned and 
removal of such dissenters becomes the focus. Simultane- 
ously, organizations modify their recruitment processes, 
henceforth requiring that individually held values of new 
members closely match those of the organization. These 
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changes suggest a higher-level change from creating a new 
culture to preserving and nurturing an existing culture. 

Experiences at Alberto-Culver are revealing. At one 
point, 75% of President Carol Bernick’s direct reports were 
new. Some were part of her initial guiding coalition, subse- 
quently elevated to new positions, and others were new to 
the organization, recruited on the basis of functional com- 
petencies and ability to fit into the culture. As one informant 
recalled, 


[Bernick] made some changes fairly early on in the mar- 
keting area. She made a change fairly early on with her 
financial guy.... So those two areas changed sooner than 
later. I can’t remember exactly on the sales side. I think 
she changed it once and then changed it again with Dick 
Hynes. So I mean, she built her team. The most recent 
change has been in [research and development] ... a year 
and a half ago. She really feels strongly, and I absolutely 
apree with her that she needs people who really under- 
stand the culture and who operate within the culture.... 
[(O]bviously you've got to perform ın your function, but 
you have to understand and support and drive the culture 
into the organization. 


In conversations with the newer executives, it was clear 
that their values were aligned with the firm's values. The 
Alberto-Culver culture played to new executives' strengths 
and fit what they were looking for later in their careers, fol- 
lowing successful careers with much larger, bureaucratic 
organizations. When we asked newer executives about col- 
laborative team meetings with invitees from across the com- 
pany, they were often puzzled by the question and said they 
had always worked that way. Thus, apart from removing 
people who did not fit the new culture, much of Alberto- 
Culver’s success at becoming more market oriented was 
attributed to bringing in people who already believed in the 
culture and were able to add to it through their unique expe- 
riences and competencies. 

Regarding firms that were failing to realize a market 
orientation, there was no comparable removal or hiring of 
people based on cultural fit. On the one hand, because their 
firm cultures appeared much weaker, it is difficult to imag- 
ine how they could make personnel decisions based on cul- 
tural fit. On the other hand, it appears that the lack of con- 
sequences for defying the desired cultural norms impeded 
cultural change and thus resulted in weak overall organiza- 
tional cultures. 


Collaborative strategy. At this transformation point, 
organizations collaboratively address tasks in a more 
market-oriented manner, precisely because they agree on 
what the market is, what its unmet needs are, and how to 
work together to meet those needs. Furthermore, the collab- 
orative development of strategy leverages a firm’s collective 
knowledge and capabilities, leading to more creative and 
realistic strategies (i.e., they can be implemented). In addi- 
tion, the organizationwide comprehension of strategic goals 
and their relation to the market provides context for why 
specific tasks are important, increasing the likelihood of 
success. This collaborative process also leads to higher lev- 
els of commitment to the strategies and the firm, which 
again increases the likelihood of success. 
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Indicative of this process, a Motorola executive 
explained the change in developing strategy as paradoxical; 
there were more people involved than ever before, but it 
was more productive and definitive than pretransformation 
efforts: 


What you end up doing is creating a camaraderie that ... 
may not have any effect at all on the definition of the 
product, but ... you've got a team that can then work 
through normal process, which 1s more condensed. It's 
more focused... What's fun 1s ... you make sure that 
you're open-minded to those big sessions—that you could 
[reveal and learn new] stuff,... and that's a really wonder- 
ful surprise and great things can happen... At one 
point,... we had 300 people involved in developing the 
strategy, and it was kind of like, "What the hell are you 
doing?"... but ıt built a portfolio that was focused and 
clear and driven when we got done with ıt. 


Similarly, Marshfield DoorSystems informants 
explained that their new culture and value proposition made 
strategy development and implementation much easier 
because, in the words of one informant, “There was one 
picture that we all focused on, and everyone knew it!” A 
shop floor employee recalled, “Basically, when Bill came, 
he turned the mill over to the people and said, “This is your 
baby—4nake it work!’... [Today,] the union and the com- 
pany are working together instead of against each other.” 

Although the process of developing and implementing 
market-based strategies was more efficient and effective 
across firms because of a new shared understanding of the 
market, this shared understanding did not replace traditional 
marketing research. As a Harley-Davidson marketer 
observed, “Market research just meant more.... [I]t made it 
much easier when presenting findings or ideas within 
Harley-Davidson. People would say, ‘Oh yeah! I remember 
this one guy." 

Conversely, this new collaborative approach changed 
the role of formal organization structures. From this stage 
forward, collaborative cross-functional teams conducted the 
vast majority of work; any individual could be on multiple 
teams, and no two teams comprised the same members. 
This concept was exemplified by a Motorola informant's 
observation that “one of the other things Mike says 50 times 
a week is, ‘Make the matrix work. I know we're a matrix 
organization. I did it on purpose.... I know it's not easy.... 
Get your ass down there and figure it out.’” 

Without the benefit of a shared market understanding 
and agreed-on norms for behavior, the three firms that were 
failing to achieve a market orientation were demonstrably 
frustrated in their marketing strategy efforts. Introducing 
the new brand identity at BenefitsInc was laborious as a 
result of different conceptions of what the business was ver- 
sus the proposed brand promise. After the new brand iden- 
tity was implemented, senior managers believed that it 
would lead to organizational change, but a member of that 
effort confided that there was now "this sense of imbalance 
in terms of what the marching orders are versus what can be 
realistically achieved. I think, underneath it all, employees 
understand that big difference, and if they understood that, 
*Hey, you know what, there's a realistic mission here, and 
we're quick to accomplish it, they'll do it. Otherwise,... 


you're told, ‘OK, it’s D-day, scale that cliff.’ [Then the 
employees say,] ‘Well, with what? What kind of backing do 
I have?” 

Even with the cultural transformation workshops and 
assistance from outside business consultants, MediaCo and 
EquipmentCo informants shared a frustration with strategy 
development, and no one seemed to understand how it all fit 
together. A MediaCo informant commented, "There's a 
[strategic] plan, and it's completely divorced from what 
we're doing.... [A]lthough we suffer from that like every- 
one else, I've never seen anything ... so completely decou- 
pled [from reality].... We don't have a planning process that 
actually links our [strategic] plan to our operational plan. In 
fact, we don't have operation plans; we have budgets." 
Similarly, an EquipmentCo informant shared, *[W]e were 
talking yesterday about the planning process in general. We 
got in these kinds of debates of when we have to have our 
plans due, and I was like, my team, it's very hard for us to 
develop our plans before the [product] centers have their 
plans in place, because what we need from them is, ‘What 
target markets are you going after? What segments?... 
What kind of market share growth do you want:..? What 
are the key value drivers...?’” 


Summary. The reconstitution stage consists of taking the 
new concept for the firm public with a demarcation cere- 
mony. After the plan becomes public, the effort turns to 
value and norm development, reconnecting with the market, 
personnel changes, and the development of a collaborative 
strategy. As a result of these efforts, the reconstitution stage 
creates the culture, understanding, and processes that lead 
to the organizationwide generation of, dissemination of, and 
responsiveness to market intelligence described. Elements 
of this process or ethos have been discussed by other 
researchers (e.g., Allen, McQuarrie, and Barr 1998; Gouil- 
lart and Sturdivant 1994; Kohli and Jaworski 1990; Shapiro 
1988). For example, others have advocated cross-functional 
customer visits (Allen, McQuarrie, and Barr 1998). Our 
analysis shows how these elements are woven into a 
process. Moreover, we show that successfully creating an 
organization that behaves in such a way depends fundamen- 
tally on new values and norms that are reflected in organi- 
zational learning, personnel decisions, and the collaborative 
strategy development process. 


Stage 3: Institutionalization 


By the time firms reach Stage 3, they have undergone dra- 
matic, fundamental changes. These changes, however 
important, remain informal to a significant degree. That is, 
the organizational culture and resulting behaviors have 
changed much more dramatically than the formal 'organiza- 
tion structure and supporting attributes. In Stage 3, organi- 
zations institutionalize these changes. The speed and effec- 
tiveness at which organizations successfully institutionalize 
these changes depends heavily on their experience in prior 
stages. Firms that realize greater levels of market success 
develop increasing comfort working in a market-oriented 
manner, accelerating their institutionalization. Simultane- 
ously, the guiding coalitions create explicit symbols, arti- 
facts, and rituals to retain and further encourage a market- 


oriented culture, characterized by (1) formalization, (2) 
alignment of rewards, (3) indoctrination and training, and 
(4) power shift. At the time of our fieldwork, Marshfield 
DoorSystems, Alberto-Culver, and Harley-Davidson had 
progressed through the institutionalization stage. 


Formalization. The guiding coalitions explicitly formal- 
ized their organizations' evolving market-oriented cultures 
through symbols, rituals, and artifacts. These included orga- 
nization structure changes at Harley-Davidson, the imple- 
mentation of high-performance work systems at Marshfield 
DoorSystems, and the explicit defining of cultural values at 
Alberto-Culver and Harley-Davidson. 

In 1993, Harley-Davidson moved to a circular organiza- 
tion structure, formally institutionalizing its collaborative 
culture. The functional leadership group was composed of 
Harley-Davidson's top management with three subgroups: 
create demand circle, produce product circle, and provide 
support circle. Even with a history of participation and col- 
laboration, a Harley-Davidson executive explained how the 
circle concept further raised expectations and opportunities 
for collaboration over time: 


Circle meetings are pretty functional, although I don't 
think they always have been. I think we had to find that 
fine line about what is circle-worthy and what is not. 
There was a time ... I was coordinating some function and 
the circle ... sat there and argued about how long the 
cocktail hour should be. OK, so you've got a bunch of 
[vice presidents] sitting there arguing about how long the 
cocktail hour should be. So you could fall into a trap. I 
think they had to find their way about how detailed do we 
get in our understanding.... In that respect, it's important 
to recognize what do you need other people's support and 
input on and what don't you. What 1s within your own 
function? There is no book that you could have followed 
that said, "This is a circle thing, and this is not" It's a 
judgment call. When everybody came in, it was just that— 
some people wanted to know everything about what 
everybody else is doing—so for everything, “Well, the cir- 
cle sbould decide that,” and other people said, “Um, I 
think that’s my job to decide that..." When Rich [Teer- 
link] put us all in the circles, along with Jim Paterson,... 
they just said, "Here you go. And here's the philosophy 
around ıt.” So I think there's a lot of struggling to figure 
out where that 1s. That's where a lot of trust and commu- 
nication has to happen. You've got to assume that every- 
one is doing their job and doing it well. Otherwise you 
could make everything a circle 1ssue.... So I think we still 
struggle with that sometimes, but we'll catch ourselves 
now.... So we had to find the point where there's con- 
structive interaction and where there's just simply med- 
dling. Meddling is not productive. But it is impossible to 
live in a silo because we're so integrated with each other 
There's very little that I do that somebody else doesn't 
feel, and vice versa. 


Alignment of rewards. The intrinsic motivation from 
serving the market and working collaboratively toward 
common goals appears to be the primary driver for 
employee efforts in earlier stages. Greater levels of intrinsic 
motivation are created as organizational cultures become 
stronger and firms begin to experience success. However, at 
a point when employees feel empowered and the organiza- 
tion is successful, employees perceive working harder as 
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primarily benefiting the organization’s owners and top man- 
agement. This creates perceptions of injustice due to the 
incongruity between the culture and the allocation of 
rewards. Within participating firms, the implementation of 
group-based variable compensation schemes addressed 
these concerns and further institutionalized a market- 
oriented culture. 

For example, despite increasing levels of success in the 
late 1980s and early 1990s, Harley-Davidson’s management 
perceived a growing tension between the firm’s culture and 
its formal incentive structures, which had remained largely 
unchanged since the early 1980s. Teerlink and Ozley (2000, 
p. 145) note, “Employees argued that, at the end of the day, 
the ‘real’ work of Harley-Davidson was the work that the 
company recognized in a paycheck. This ‘touchy-feely’ 
stuff seemed okay,... but the organization would resist the 
kinds of behaviors that were being proposed until the com- 
pany recognized and rewarded those behaviors.” 

To address these issues, the company first implemented 
a new personnel development and evaluation system, 
matching individual expectations and performance directly 
with the company’s mission, values, and business pro- 
cesses. An executive commented, “This was a vision of the 
way people needed to be engaged in an organization ... that 
allows for theoretical alignment of an individual’s job with 
the long-term direction of the company” (Teerlink and 
Ozley 2000, p. 106). This alignment also provided flexibil- 
ity in adapting to changing market and organizational con- 
ditions. Employees were expected always to do what was 
best for the organization under the broad umbrella of its val- 
ues and objectives, without having to make trade-offs 
between individual compensation and the organization’s 
welfare. 

In parallel, everyone’s compensation was altered to 
include a variable component that corresponded with 
Harley-Davidson’s success (or lack thereof). Management 
defined performance metrics at the business unit or plant 
level (the largest collaborative group with control) and pro- 
gressively expanded the program to include all salaried 
employees by 1990 and all employees by 1993. Variable 
compensation bonuses were paid to everyone or no one on 
the basis of the goal levels achieved. 


Indoctrination and training. Over time, the saliency of 
shared understandings from the reconstitution stage 
degrades. More significantly, as firms recruit believers and 
remove dissenters, an ever-growing number of new employ- 
ees lack the shared experiences to understand assumptions 
underlying the culture. In addition, employees and man- 
agers often lack the knowledge or skill levels required to 
meet escalating expectations as firms become even more 
market oriented. To address these challenges, firms institute 
indoctrination programs for new employees and ongoing 
training programs that reinforce and refine the culture 
through skill development for current employees. 

For example, in 1990, Harley-Davidson instituted 
employee orientation programs. An informant recalled, 
“When I came here there was an awful lot of formal train- 
ing.... Within a few months of me being here, we had a new 
employee orientation. It was like the first one they had had. 
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It lasted a half a day. And it was everybody who had started 
over the last six months. Now they do it, it starts every 
week, and it lasts three days, but it was a smaller company 
then.” In addition, the informant received a copy of Well 
Made in America: Lessons from Harley-Davidson on Being 
the Best (Reid 1990), which documents Harley-Davidson’s 
transformation from the early 1980s: “I read that one too 
before I started. They gave it to you before you started. Or 
at least they gave it to me.” 

Around the same time, the firm launched the Harley- 
Davidson Leadership Institute to reinforce the culture, pro- 
vide a broader understanding of the firm, and familiarize 
employees with tools for working within the firm. Employ- 
ees spoke enthusiastically about the Leadership Institute. 
One informant grabbed a Leadership Institute binder above 
his desk and, while thumbing through it, explained, “Have 
you seen this? Here are our values. Here’s our issues, our 
stakeholders, our visions. You’ve heard these before, 
right?... So I won’t recite them to you—because I can. And 
then you’ve got mission and operating philosophy, our 
objectives, and then my area's functional strategy.... [WJe’d 
spend time on what is this and what is [that] and how does 
it impact us in our area and what are you going to do about 
it? Make sure we're all connected here.” Another informant 
commented, “[Such training] is probably more valuable 
today, to allow people to interact and understand events and 
come to some philosophy that was practical for them. It 
takes that philosophy and makes it practical for them. 
"What does 'tell the truth' mean to you? What does the 
vision statement [mean to you]? What are stakeholders? A 
shareholder is a stakeholder—no more, no less than some of 
these other stakeholders.’ To really get those issues out on 
the table and have the discussions about them." 


Power shift. As firms realize higher levels of market ori- 
entation, the power to make decisions and act on behalf of 
the firm shifts from the guiding-coalition members to all 
organization members. Employees are responsible for mak- 
ing decisions and acting appropriately within the agreed-on 
framework of each firm, as well as ensuring everyone else 
is doing the same. Within participating firms, the guiding- 
coalition members viewed these behaviors as evidence of a 
successful transformation. 

Indicative of this change, a Marshfield DoorSystems 
informant commented, 


We had [a reengineering blitz] that took place about ten 
days ago....[T]here was a group 1n the shipping depart- 
ment. They were doing a blitz on the palletizing and pack- 
aging area. We had seen a large number of claims, com- 
plaints, and frustrations about products not being 
packaged properly. Our biggest problem was distributors 
complaining that “It wasn't packaged how I ordered ıt.” 
So we had this blitz going on,... hourly people along with 
the supervisor.... As part of this blitz, they sat down and 
interviewed one of our salespeople and asked, “Why 1s 
this such a problem?" Something magical came out if it! 
The salesperson said, “You’ve got to understand, the dis- 
tributors are paying for this special handling, and they’re 
not getting what they're paying for" Then one of the 
hourly people said, “Wait, let me see. Am I to understand 
this right? We know they're paying for palletizing, but you 
mean some kinds of packaging cost more than others?" 


The salesperson said, “We have twenty different ways of 
palletzing, and there's twenty different prices. Whatever 
you ask for has different prices." The people in palletizing 
didn't know that! They said, "Really? We just thought, 
some just wanted it this way, [and] some [others] wanted 
it that way. A lot of times we're busy, [so we figure] what 
the hell is the difference? I’m doing the next pallet; I'll do 
it just the same....” [So the team realized,] “Wait a sec- 
ond! We're creating value! They're willing to pay for 
this!” And you keep these learning things going on. So we 
had two different schemes: the sales people wanted to try 
more options; shipping wanted to get it down to [one 
option]. They didn't realize the two were causing prob- 
lems for each other. The salespeople didn't understand 
some of the associated costs, even though they were 
charging differently; it wasn't appropriate relative to what 
the costs really were. So the solutions came out and the 
rest of that week they spent their time understanding every 
single one of the options. This is the pure cost of it. 50 we 
have a mark-up or discount and everybody knows it. Now 
they know why, so when they get a request for palletizing 
in a certain manner, [they understand that] this is what the 
customer wants. 


In addition to decisions about how to serve the market 
better, employees are expected to enforce the culture by 
pointing out other organization members' cultural infrac- 
tions. In other words, the power used by the guiding coali- 
tion to guide the transformation is distributed progressively 
among all organizational members to maintain that culture. 
For example, a Harley-Davidson informant explained, “We 
know when we have permission to do things and call people 
on things.... The issue is that training here gave us permis- 
sion to call people on things; it gave us the community 
which we work in;... it's not a personal attack, but part of 
getting things done.” Similarly, at Marshfield DoorSystems, 
we observed a union member challenge executive behavior 
regarding the cultural value of openness during an all- 
company meeting, commenting, “We used to get communi- 
cation all the time. We used to have meetings every morn- 
ing... I think there is an issue of communication 
here....What are the monthly profit numbers? Shipping on 
time? We used to get this weekly.” Not only was this accept- 
able employee behavior, a senior executive apologized for 
the oversight, promised to fix the problem, and suggested 
that the employee follow up if no action was evident. 


Summary. 'The institutionalization of a market orienta- 
tion requires formalization of organizational structures and 
processes, alignment of rewards, and cultural indoctrination 
through training. These changes institutionalize: a power 
shift from the guiding coalition to the larger organization. 
With the exception of the power shift, previous research has 
focused on similar formal changes as a way to develop a 
market orientation (e.g., Day 1999). We find that, indeed, 
these changes are central. However, unlike previous studies, 
we find that these actions follow a more fundamental cul- 
tural shift. Thus, formal changes institutionalize and rein- 
force earlier cultural changes. Furthermore, the natural con- 
sequence of a strong culture is that it relies on individuals 
distributed throughout the organization to ensure cultural 
enforcement rather than relying on power concentrated 


within an elite guiding coalition, as was the case for insti- 
gating cultural change. - 


Stage 4: Maintenance 


A curious phenomenon occurs after organizations achieve a 
greater market focus: Employees develop increasingly 
divergent interpretations about why their firms changed and 
the attributes responsible for their firms' success. Employ- 
ees who join firms after transformation voice the most sig- 
nificant interpretation variations. Organization members 
who participated in the transformation also offer variations, 
and the degree of variation is positively correlated with the 
amount of time since the transformation. 

Specific individuals or firm attributes are the most com- 
mon explanation for a firm's market orientation and suc- 
cess. Individual-focused interpretations create a mythologi- 
cal figure or group of figures out of the guiding coalition's 
leader or members. Such interpretations transform the 
messy, progressive, and confusing process of collabora- 
tively becoming market oriented, engaged in by hundreds or 
thousands of people, into the more appealing root metaphor 
of a savior-driven transformation (e.g., Turner 1975). Simi- 
Jarly, attribute-focused interpretations reconstruct history to 
fit current circumstances and enhance the inherent capabili- 
ties or importance of one of three causes: mythological fig- 
ures, collective organization members' ability to face adver- 
sity, or pure luck. Such attribute-focused interpretations 
offer much simpler explanations than the actual series of 
changes responsible for firms' transformations. Although 
more easily comprehended, such attribute-focused interpre- 
tations create non-market-focused suggestions for how to 
handle future challenges, such as recruiting or revealing a 
new mythological leader, bearing down and working harder, 
or hoping for good luck. 

Given these alternative explanations for market orienta- 
tion and performance, organizational leaders develop pro- 
cesses and model behaviors in an effort to maintain a mar- 
ket orientation. Three processes reinforce a market-oriented 
culture: cultural screening of new members, culture mainte- 
nance rituals, and ongoing market connection activities to 
update market schemas and validate market-oriented 
process schemas. Firms also exhibit two characteristics pro- 
tecting their market-oriented cultures from outside influ- 
ences: Cultural "flame keepers" approve organizational 
changes based on cultural consistency, and there is a vigi- 
lance against management fads and fashions. 


Cultural screening of new members. Alberto-Culver 
relied heavily on personal interactions to identify new 
employees, with Carol Bernick directing executive recruit- 
ing and approving managerial hires. Similarly, Marshfield 
DoorSystems relied on employee referrals to identify 
recruits who fit the culture, and senior executives were 
involved in every hiring decision. With a workforce of 8000 
employees and incredible growth, Harley-Davidson 
screened new members using a prescribed process. To 
maintain a culture of “Harleyness,” Harley-Davidson's 
recruiting process was extremely rigorous, focusing primar- 
ily on a cultural fit. In the words of one informant, *[W]hen 
you bring people into the organization that don't understand 
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the tradition, your product suffers terribly. The neat thing 
about Harley-Davidson is we’ve managed to keep people 
coming in who understand the tradition. So the culture at 
the Motor Company absorbs them because they’re ready to 
be absorbed and there’s no conflict.” 


Culture maintenance rituals. Firms couple screening of 
new members with cultural indoctrination rituals that pro- 
vide a historical context for how firms progressed to their 
current states and cultures. In contrast to previous indoctri- 
nation efforts, seasoned employees also partake in similar 
rituals, reminding them of the history and assumptions 
underlying their market-oriented culture. 

For example, an executive informant explained that 
Harley-Davidson was considering some type of cultural 
tune-up program for all employees: 


[S]ome people have gone through it, and you have to be 
aware of that and do 1t as more of a refresher rather than 
relive the whole experience. So how do we do that? I think 
that's what we, as the senior leadership, are going to start 
talking about. I know that [another executive] has a focus 
on that—that we really need to start looking at that busi- 
ness process model and make it a refresher so it's not just 
the words on the wall. 


Ongoing market connections. As organizations and mar- 
kets evolve, cross-functional field visits appear to be crucial 
for maintaining a market orientation. Although all three 
firms conducted surveys and focus groups and performed 
other research activities to monitor the market and elaborate 
on their shared schemas, they also performed ongoing or 
periodic cross-functional field visits to ensure that their 
organizationally shared market understandings were up-to- 
date. As an Alberto-Culver marketing executive noted, “We 
do workshops around it—and I'll sit through some too—of 
the stuff in the field. I mean, I don't do the interview or 
stuff, but I'll certainly [be there]. Have to see some of the 
stuff with my own eyes. As much experience as I have in 
this category, it continues to change and people's beliefs 
change. So you have to stay with it and understand it, 
because sometimes my experience can be detrimental. I 
have a lot more baggage than most. I have to understand 
when something's outdated and when things change. So I 
try to get out there as often as I can." 

Marshfield DoorSystems was updating its composite 
division's value proposition during our fieldwork. As with 
the original value proposition work, the firm was sending 
cross-functional teams into the field to interview buyers and 
so forth to develop a new value proposition based on their 
shared experiences. 

With demo rides and rallies, Harley-Davidson had the 
greatest opportunity for and level of ongoing market con- 
nections. À company veteran commented, 


Even when I go to rallies now—I hadn't been to a rally in 
eight years when I signed up for Daytona this year when I 
met you—I came back with a greater sense of contribution 
to the business, visioning, and focus than I had before, and 
as [a high-level manager/executive], I’m hearing our 
needs, but it’s good to get those needs either validated or 
repositioned hearing it from the customer.... [Regarding 
dealers], I've had my project managers going out in the 
field to- understand what could/will collaborative inven- 
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tory management [deliver]. They see customers. They see 
stocking in the dealerships. Our people see more about 
our business because we drive it by processes.... It's not 
just checking a box; it's driven by the market. I don't have 
to go to demo rides anymore, but I just wanted to.... 
Every opportunity you get to leave these four walls, you 
always learn something. I also go to a dealer almost every 
other month. It impacts how I direct resources, how I get 
projects approved, how I run projects, everything. 


In addition, the company's program encouraging 
employee motorcycle ownership provided a venue for 
employees to experience dealerships and how Harley- 
Davidson's support programs affected end users. One infor- 
mant explained, 


We encourage employees to own and use the motorcycles. 
Another point that I think is interesting about Harley is 
that we don't have an executive motorcycle fleet, but we 
do have a program that encourages people to buy a motor- 
cycle. But the cornerstone of the program is that you go 
through the same thing a customer does. You have to go to 
a dealer, you have to get in line with everybody else. 
You've got to order a bike. I mean everything gets han- 
dled;... you make the best deal you can get. So the 
employees, in order to use the product, have to have the 
customer experience. We're not insulated behind a privi- 
leged fleet of bikes. 


Cultural flame keepers. As firms adapt to changing mar- 
ket needs, employees are continually faced with procedural 
and other choices that could affect firms' cultures. Address- 
ing the cultural impact of such changes rests with a small 
number of cultural flame keepers within each organization, 
whose implicit or explicit role is to ensure that procedures 
or programs adopted in an effort to improve the organiza- 
tion do not negatively affect the organization's core market- 
oriented culture. 

For example, during a Marshfield DoorSystems senior 
staff meeting, the group discussed designing a system to 
track and reward employee suggestions for process 
improvements. A few executives had recently visited 
another company with such a system and were adamant that 
Marshfield DoorSystems should adopt the identical system 
and require all workers to participate. One of the original 
guiding-coalition members questioned whether such a sys- 
tem was consistent with the firm's core values. Following 
an elaboration on the cultural differences between the firms, 
the executive summarized his concern, “[Company X's] 
firm culture won't work here at Marshfield where there is a 
culture of trust, where you treat people like adults." All par- 
ticipants acknowledged the point and resolved to develop a 
more culturally appropriate system or not to implement one 
at all. 


Vigilance against management fads and fashions. 
Another characteristic of firms maintaining a market orien- 
tation is an organizationwide skepticism toward manage- 
ment fads and fashions. Similar to the constant referencing 
back to shared market schemas regarding the appropriate- 
ness of actions, employees reference their shared market 
and process schemas before considering changes to any 
process suggested by management fads and fashions. 


For example, during a discussion about customer rela- 
tionship management (CRM) systems, a Harley-Davidson 
informant talked about the abundance of disparate customer 
databases available from dealers, Harley Owners Group, 
demo rides, registrations, and so forth. However, tying them 
together was not necessarily the right thing to do for 
Harley-Davidson, as the informant explained: 


Currently, each dealer has a great database of customer 
information. The dealer has it, not us. We have great data- 
bases too that they don’t. The goal of the future would be 
to find a way to share what we know about our mutual 
customers and enhance the dealer’s ability to sell and 
focus on more direct customer relationships, as well, as us 
coming up with the right marketing programs...! The 
point is that we have a lot of data, but what do we do to 
turn it into actionable information,... and that goes back 
to a larger strategy of “What is our CRM strategy? How 
do we want to use customer information?” That’s the 
biggest thing, and that’s yet to be determined. That’s the 
age-old question; one of the things that I’ve written down 
is I hear people say: “The real power of Harley-Davidson 
is the power to market to consumers who love the prod- 
uct.” I mean, who else can say that? We're singing to the 
chou, right? People love us. What a job! We get to market 
to people who love us! I mean, car manufacturers can 
market to me. I don’t love them.... The tough part is, if 
it's true, if the power of the Harley-Davidson brand is due 
to the power that we get because we market to consumers 
who love us, the question is, "Should we change that? 
Why would we get closer? Why would we appear intru- 
sive? Why would we be farther away? Why would we 
change what we're doing?" So at other companies, they're 
now struggling with what is their [CRM] strategy; you 
know, you read it in every document you pick up: CRM, 
CRM, CRM.... Our people love us already. So the chal- 
lenge here ... is they love us already, we don't want to be 
intrusive. 


Discussion 


Our research shows that the creation of a market-oriented 
firm involves several interdependent changes at the individ- 


ual, group, and organization levels that occur over several 
years. The four stages of that process appear in Figure 2, 
which shows that each of the stages has a distinct character 
and is composed of several steps. Stage 1 focuses on plan- 
ning for change by a small group of powerful executives. 
Stage 2 is characterized by guiding-coalition efforts to build 
a consensus for change, based on new organizational values 
and norms and shared market experiences. If successful, the 
firm moves to Stage 3, in which it institutionalizes changes 
in organizational structure, processes, and formal rewards. 
In Stage 4, having achieved a greater market orientation, the 
organization creates mechanisms to sustain it. 


Creating a Market Orientation Through Cultural 
Change 


Our analysis, as reflected in this model, illustrates that cre- 
ating a market-oriented organization is essentially a process 
of cultural transformation. Guiding coalitions create cul- 
tural change by inculcating organization members with 
market-oriented values and providing them with transfor- 
mative market experiences. Cultural values are initially 
inculcated through management actions that exemplify 
desired values and through the recognition and rewarding of 
behaviors that are aligned with desired values. These man- 
agement actions and rewards bear much closer resemblance 
to culture change levers that focus on intrinsic motivation, 
group dynamics, and social acceptance (e.g., O’Reilly and 
Chatman 1996; Schein 1985) than the teleological mecha- 
nisms of management direction and extrinsic pay for perfor- 
mance, as existing market orientation research suggests 
(e.g., Jaworski and Kohli 1993). 

Our analysis reveals that creating cultural change 
requires direct or vicarious market experience, providing 
organization members with shared meaning and purpose. 
We find that organizationwide sharing and making collec- 
tive sense of market experience is responsible for effecting 
a change to a market-oriented culture. The cross-functional, 
organizationwide nature of such activities creates an organi- 


FIGURE 2 
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zation affiliation around serving the market as the most 
salient and attractive association for all employees. In other 
words, shared market experiences provide shared under- 
standing and meaning to the cultural value that differenti- 
ates a market-oriented culture, namely, the market as the 
raison d'étre. Although previous research has investigated 
the relationship between shared meaning and performance 
(Hult, Ketchen, and Slater 2004; Moorman 1995), our find- 
ings support the notion that shared market experiences con- 
tribute to a larger cultural change process to a market orien- 
tation (Narver, Slater, and Tietje 1998). Our findings also 
contrast with prevailing cultural change models in the orga- 
nizational development literature (e.g., French and Bell 
1999). Specifically, we find that firms that rely on organiza- 
tional development consultants to facilitate change through 
behavioral interventions but do not reconnect with the mar- 
ket to create shared meaning and purpose do not realize a 
greater market orientation. 

Beyond shared meaning and purpose, market experi- 
ence provides opportunities for functionally or otherwise 
isolated employees to engage in perspective taking with 
others. As a result, employees engage in more generous 
attributions toward colleagues, facilitating 1ncreases in the 
cultural values of trust, openness, respect, promise keeping, 
and collaboration. These consequences parallel findings on 
perspective taking (e.g., Galinsky and Moskowitz 2000) and 
have been suggested as market orientation antecedents 
(interdepartmental connectedness and lack of conflict) in 
cross-sectional research (Jaworski and Kohli 1993; Kirca, 
Jayachandran, and Bearden 2005). Thus, within a larger 
process of creating a market orientation, our research offers 
insight into how firms create cooperation and reduce inter- 
departmental conflict previously not addressed. 

Overall, however, our model for creating a market ori- 
entation differs considerably from the existing literature, 
which proposes three antecedents to a market focus: top 
management focus, interdepartmental cooperation, and 
reward systems (Jaworski and Kohli 1993; Kirca, Jayachan- 
dran, and Bearden 2005). One implication of this view is 
that actions taken to increase the level of any one 
antecedent will lead to a greater market orientation, all else 
being equal. In contrast, our results reveal a much richer, 
more complex process of organizational change. Our analy- 
sis shows that, indeed, top management focus, interdepart- 
mental cooperation, and reward systems are important fac- 
tors influencing organizational change, but the sequence of 
these factors in a larger process is essential to successful 
organizational change. According to our analysis, top man- 
agement focus is required to begin and guide the process. 
Empowered stakeholders recognize a threat and begin the 
process of forming a coalition to transform the organiza- 
tion. Cooperation among functions is essential, but it 
requires top management support, common experiences, 
and a shared meaning structure to be effective. Organiza- 
tional rewards are important as well, but our analysis 
reveals that this is the case only after top management sup- 
port has been created, the challenge to the organization has 
been presented, the organization has been reconstituted, and 
an organizational consensus has been created for formaliz- 
ing a new, successful firm with largely new values and pur- 
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pose. Thus, rather than being three independent antecedents 
of change, top management focus, interdepartmental coop- 
eration, and reward systems operate in a particular sequence 
as part of a larger, more complex cultural change process. 


Market Orlentation 


Our findings offer new insights into the nature of market 
orientation, the role of organizational learning, power 
within the organization, and what sustains a market orienta- 
tion. Most fundamentally, our results offer a new perspec- 
tive on the debate about whether market orientation is sim- 
ply a set of behaviors or a culture. Researchers advancing a 
behavioral perspective suggest that the activities of a market 
orientation are separate from organizational culture (e.g., 
Deshpandé and Farley 1998; Kohli and Jaworski 1990), 
whereas others suggest that it is an organizational culture 
that encourages such behaviors (e.g., Homburg and Pflesser 
2000; Moorman 1995; Narver and Slater 1998). Our analy- 
sis provides further support for the cultural perspective. For 
example, as does Moorman (1995), we find that organiza- 
tional culture plays an important role in organizational 
information processes. More important, our analysis pro- 
vides new insight into the nature of the cultural values on 
which a market orientation is based. We find that firms cre- 
ating a market orientation embrace six cultural values: trust, 
openness, keeping promises, respect, collaboration, and 
viewing the market as the raison d'étre. The first five values 
encourage individuals to act as a cohesive whole in address- 
ing the market. The market as the raison d'étre provides 
individuals a rationale to collaborate toward a common pur- 
pose, thus supporting and reinforcing the other five values. 
These values are developed early in the transformation 
process, and they provide a foundation for changes that 
follow. 

These cultural values are the basis for market-oriented 
behaviors, namely, the generation of, dissemination of, and 
responsiveness to market intelligence (Kohli and Jaworski 
1990). Specifically, among the least and most market- 
oriented firms—and as each firm became more market- 
oriented—more market-oriented firms exhibited much 
stronger organizational cultures. Characteristics indicative 
of a strong culture include organizationwide adherence to 
values and norms, homogeneity of language and meaning, 
elaborate methods for selecting and indoctrinating new 
organization members, dispersion of power among organi- 
zation members, distribution of rewards and recognition 
based on behaviors consistent with the culture, and the 
organization as the primary group identification of members 
(e.g., O’Reilly and Chatman 1996; Schein 1985). Thus, the 
market-oriented behaviors we observe are the natural result 
of deeply held, shared cultural values. These cultural val- 
ues, however, appear to be necessary, but not sufficient, for 
the presence of market-oriented behaviors. 


Intraorganizational Power 


Our research demonstrates that market-oriented organiza- 
tions are characterized by a particular distribution of intra- 
organizational power. Power in organizations has been stud- 
ied extensively (e.g., Pfeffer 1981; Van de Ven and Poole 


1995). This work considers an organization a collection of 
individuals or coalitions, both advancing their own agenda 
with differing degrees of success in achieving their desired 
outcomes. In marketing, analyses of interorganizational 
power play a central role in the study of distribution sys- 
tems (e.g., Stern and Reve 1980). However, research on 
market orientation has not incorporated the notion of power. 
Therefore, it implicitly assumes that all individuals within 
the organization share a common goal (e.g., Day 1999; 
Jaworski and Kohli 1993). 

In contrast, our research finds that interpersonal and 
intraorganizational power play a significant role in the 
change to a market orientation, particularly with regard to 
instigating change, guiding change, and distributing the use 
of individual power to maintain a market orientation. Insti- 
gation of change to a market orientation begins with a 
market-oriented coalition gaining control through a'dialecti- 
cal struggle of ideas and power. Having won this struggle, 
guiding coalitions wield their power to guide cultural 
change through value inculcation, transformative market 
experiences, hiring of fellow believers, and removal of cul- 
tural dissenters. As organizations institutionalize a market- 
oriented culture, power devolves from guiding-coalition 
members to a more egalitarian distribution among organiza- 
tion members. This change from concentrated to distributed 
power is possible because of the growing strength of a 
market-oriented culture, and it is also necessary for acting 
collaboratively in a market-oriented manner. Thus, our 
research illustrates that the distribution of intraorganiza- 
tional power is an essential, but overlooked, dimension of 
market orientation. 


Organizational Learning and Schema 
Development 


Organizational learning plays an important, if — 
ated, role in the creation of a market orientation. As it is 
typically conceived, organizational learning is the collec- 
tion, interpretation, and encoding of the organization's 
experience (e.g., Huber 1991; Nonaka 1994). For example, 
when launching a new marketing strategy, an organization 
will perceive some outcomes, interpret the causes of those 
results, and then encode those lessons in the organization 
through the creation, modification, or elimination of organi- 
zational rules, processes, or principles. Analyses of market 
orientation have typically assumed that market orientation 
and organizational learning are independent (e.g., Baker 
and Sinkula 1999; Slater and Narver 1995). Although 
market-oriented firms may indeed be learning organiza- 
tions, organizational learning is not considered necessary 
for a market orientation. 

In contrast, our research implies that ET E TE 
firms fundamentally are learning organizations. Through 
time, members of the organization share common experi- 


ences, and those common experiences become formalized , 
as organizationally shared market and process schemas. ` 


These schemas enable organization members to communi- : 


cate and collaborate effectively in the process of gathering, 


disseminating, and reacting to market intelligence. Creating | 
and using shared schemas is central to organizational : 


l 


researchers’ conceptualization of learning organizations 
(Huber 1991; Nonaka 1994). It is with these shared 
schemas that organizations can effectively gather, dissemi- 
nate, and act on explicit knowledge (Nonaka 1994). 

The importance of organizational learning for market 
orientation extends beyond simply encoding the lessons of 
history. As a result of creating a market orientation, organi- 
zations also develop the capacity to evolve. They do so by 
creating processes for monitoring and changing schemas; 
this is referred to as a “third-order” change (Bartunek and 
Moch 1987). The creation of such process schemas is 
essential for a firm to maintain a market orientation in 
dynamic markets. By continually verifying and updating 
market schemas over time through shared experiences, 
market-oriented firms gain more experience with a market- 
oriented culture, and by operating in a market-oriented 
manner, the culture continues to strengthen while becoming 
increasingly adept at monitoring and reacting to market 
changes. We believe that this capability is an integral char- 
acteristic for maintaining a market orientation. Organiza- 
tional learning and the schemas that result—market and 
process schemas that firms use to adapt to the market—are 
important, yet often-overlooked aspects of a market orienta- 
tion revealed by our analysis. 


Organizational Change 


The process of organizational change that we describe dif- 
fers in important ways from the processes described in the 
organizational behavior and sociology literature. Within the 
organizational behavior literature, scholars have identified 
four principal motors of change: (1) life cycle—predefined 
steps in an organizational life cycle, (2) evolutionary—evo- 
lution through experimentation with new methods, (3) 
dialectical—political coalitions creating and frustrating 
change, and (4) teleological—management actions (Van de 
Ven and Poole 1995). At least 12 major research streams 
have been identified that incorporate one, two, or three of 
these change motors (Van de Ven and Poole 1995). Sociolo- 
gists have identified other modes of change, such as coup 
d’etat, insurgency, and mass mobilization (Zald and Berger 
1978). This literature describes these mechanisms and 
modes of change in great detail but does not offer any 
insight into how they might combine to enable an organiza- 
tion to realize a greater market orientation. 

Our research illustrates that creating a market-focused 
organization requires several of these change mechanisms 
or modes in a specific sequence. The change process begins 
with the formation of an elite group of insurgents who 
mobilize the masses to create a consensus for organizational 
change that is later formalized and sustained. This process 
suggests a unique combination of change mechanisms and 
motors. It begins with an insurgency, moves to mass mobi- 
lization, switches to a teleological change process, and con- 
tinues with an indoctrination process to sustain the orienta- 
tion. Although similar to previous descriptions—each of 
these stages is recognizable on the basis of similarities to 
other situations—the combination and sequence is unique. 

Moreover, each stage has unique characteristics that 
have not been previously identified. Consider the insurgents 
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who start the process. The sociological literature discusses 
an insurgency as a group of disempowered members of an 
organization who seek to topple the empowered. In con- 
trast, we observed an elite insurgency, an insurgency of the 
powerful. Likewise, mass mobilization is typically used by 
disempowered individuals or groups to overwhelm the pow- 
erful. In our analysis, however, the insurgents use their 
power to create a greater enthusiasm for change by mobiliz- 
ing the masses within the organization. Contrary to the con- 
ventional stereotype, they are “elitist revolutionaries” who 
plot to transform the culture and to create a new order that 
devolves power to the masses. 


Conclusion 


Market orientation is a centrally important idea to market- 
ing and a growing number of fields. Although the concept 


of market orientation has received considerable attention, 
how organizations develop a greater market orientation has 
received little attention. Our analysis of four firms success- 
fully making the transformation shows that organizations 
create a market orientation by engaging in a four-stage 
process of cultural transformation: (1) initiation, (2) recon- 
stitution, (3) institutionalization, and (4) maintenance. This 
process imbues the organization with a set of cultural values 
that support market-oriented activities, an organizationally 
shared understanding of the market, and organizational 
learning capabilities. Our results show that these cultural 
values are central to the new organization that emerges from 
this process and that the intraorganizational distribution of 
power and organizational learning play central, though 
overlooked, roles in creating and sustaining a market 
orientation. 
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Joann Peck & Jennifer Wiggins 


It Just Feels Good: Customers’ 
Affective Response to Touch and Its 
Influence on Persuasion 


Prior research has assumed that touch has a persuasive effect only if it provides attribute or structural information 
about a product. Under this view, the role of touch as a persuasive tool is limited. The main purpose of this research 
is to investigate the persuasive influence of touch as an affective tool in the absence of useful product-related 
information. The authors find that for people who are motivated to touch because it is fun or interesting, a 
communication that incorporates touch leads to increased affective response and increased persuasion, 
particularly when the touch provides neutral or positive sensory feedback. People who are not motivated to touch 
for fun will also be persuaded by a communication that incorporates touch when they are able to make sense of 
how the touch is related to the message. The authors explore the effectiveness of different types of touch in 
generating an affective response, and they replicate the effects on attitudes and behavior in a real-world setting. 
This research suggests that the marketing implications of touch are more substantial than previously believed. The 
authors present research implications for direct marketing, product packaging, point-of-purchase displays, and print 


advertising. 


ave a persuasive influence on customers' attitudes 
and behavior. Touching a product has been found to 
increase attitudes and purchase intentions toward the prod- 
uct and to increase the confidence in the evaluation of these 
products (Peck and Childers 20032). The need to touch in 
product evaluation has been linked to the placement of 
products in stores (Underhill 1999) and to the inability of 
certain products to be sold online (Citrin et al. 2003; 
McCabe and Nowlis 2003). 

Most applications in marketing focus on touch that pro- 
vides specific attribute information about the product. For 
example, the packages of both Paper Mate Dynagrip pens 
and Ove Glove have portions of the plastic cut out, allowing 
shoppers to explore and examine the grip of the pen and the 
unique fabric that makes the gloves heatproof. DuPont cre- 
ated a one-page advertisement for paper used to make 
overnight courier envelopes that stated, “Go Ahead: Tear 
this Page in Half’ (this appeared in Fortune Magazine, 


T^ opportunity to touch products has been shown to 
h 
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December 29, 1997). Attempting the near impossible task 
of tearing the page conveys the product benefit of the 
strength of the paper. Such efforts have been shown to have 
an effect on purchase behavior; for example, providing 
unwrapped rolls of toilet paper in a point-of-purchase dis- 
play that allowed customers to feel and compare the tex- 
tures of different brands resulted in large increases in sales 
for the store brand in a supermarket chain (Britain’s ASDA; 
see Lindstrom 2005). 

For touch to have an influence in marketing decisions 
and evaluations, must it provide product attribute informa- 
tion, or can hedonic aspects of touch also be persuasive? 
The importance of hedonic benefits to consumers in mar- 
keting has been recognized in the areas of sales promotions 
(e.g., Chandon, Wansink, and Laurent 2000) and retailing 
(e.g., Arnold and Reynolds 2003). In this article, we exam- 
ine whether the hedonic benefits of touch influence deci- 
sions independent of the information gathered through 
touch. We suggest that touch can create an affective 
response, which can influence a customer’s decision- 
making process even though the touch adds no product- 
related information to the decision. 

If the hedonic aspects of touch can increase persuasion, 
the use of touch in marketing may be more broadly applica- 
ble than previously believed. Thus far, efforts have been 
limited to touch that provides attribute information about a 
product; it is often the case that this kind of touch can be 
used effectively only in contexts in which customers can 
physically evaluate the product. However, the use of touch 
as a hedonic tool has the potential to be applied to a broad 
set of products and even services and in a wide variety of 
contexts that were previously unrecognized in studies of 
touch, including package design, print advertising, direct- 
mail advertising, and point-of-purchase displays. 
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Background and Hypotheses 


Instrumental and Autotellc Need for Touch 


Peck and Childers (2003b) find that the effects of touch are 
stronger for some people than for others. They identify indi- 
vidual differences in need for touch (NFT), that is, a per- 
son's preference for the extraction and utilization of infor- 
mation obtained through touch. Prior research (see Citrin et 
al. 2003; Peck and Childers 2003a) has found that some 
people prefer to evaluate products through touch and are 
more frustrated when shopping if they do not have the 
opportunity to touch products. The NFT is conceptualized 
as having two dimensions: instrumental NFT and autotelic 
NFT (Peck and Childers 2003b). People who are high in 
instrumental NFT use touch to gather information about a 
product to help them make judgments. They are more adept 
at gathering information through touch, and the opportunity 
to touch products provides them with access to relevant 
information they cannot gather through other means, such 
as reading descriptions of products or visually inspecting 
products (Peck and Childers 2003a). A customer who is 
high in instrumental NFT touches a sweater to learn if the 
material is thick enough to provide warmth. In addition to 
customers' *rational" information-gathering motives, some 
people shop for the sensory experiences (Holbrook and 
Hirschman 1982; Sherry 1990). In contrast, people who are 
high in autotelic NFT engage in touch because it is fun, 
interesting, or enjoyable, an experience that is more hedonic 
than instrumental. A person who is high in autotelic NFT 
often feels an irresistible need to engage in exploratory 
touch and is focused on the sensory aspects of touch as an 
end in and of itself (Peck and Childers 2003b). The cus- 
tomer who fingers the sleeve of a cashmere sweater that he 
or she has no intention of purchasing, simply because the 
cashmere feels pleasant to touch, is likely to be high in 
autotelic NFT. 

If touch that does not convey a product attribute can 
affect persuasion, we expect that this type of touch will be 
more effective for certain people. Specifically, we posit that 
a message that incorporates a touch element without prod- 
uct attribute information will be more persuasive for people 
who are high in autotelic NFT than a message with no touch 
element. However, we expect that people who are low in 
autotelic NFT will not find touch as inherently interesting 
or as irresistible. Consequently, we expect that a touch ele- 
ment that does not provide attribute information: will not 
influence people who are low in autotelic NFT. This leads 
to our baseline hypothesis: 


H,: A message that incorporates a touch element (versus a 
message with no touch element) will increase persuasion 
for people who are high in autotelic NFT but will not 
influence persuasion for those who are low ın autotelic 
NFT.! 


INote that persuasion includes measures of attitude toward the 
message and attitude toward the organization as well as behavioral 
measures. For simplicity, we use the term "persuasion." 


Autotelic NFT and the Sensory Feedback of the 
Touch Information 


It is probable that the type and valence of sensory feedback 
provided by the touch element influence its persuasiveness. 
Touch that produces positive sensory feedback has been 
shown to increase attitudes more for high-NFT people than 
for low-NFT people. In one study, high-NFT people exhib- 
ited a larger increase in attitudes when they touched a soft 
sweater that was pleasant to touch than when they touched a 
rough sweater that was less pleasant to touch (Peck 1999). 
The pleasant sensory experience of touching the soft 
sweater appears to have increased persuasion more for the 
high- than for the low-NFT participants. The affective 
nature of autotelic NFT suggests that people who are high 
in autotelic NFT are likely to have a stronger affective 
response to touch than those who are low in autotelic NFT. 
This implies that people who are high in autotelic NFT, in 
particular, are likely to be more susceptible to the increase 
in persuasion that comes from a pleasant touch experience. 

Positive affective responses have been shown to influ- 
ence attitudes and behavior. In advertising research, induc- 
ing a positive mood in viewers or generating a positive 
affective response has been found to increase attitude 
toward the ad (see Aaker, Stayman, and Hagerty 1986; 
Batra and Ray 1986; Brown, Homer, and Inman 1998; 
Burke and Edell 1989), time spent viewing the ad (Olney, 
Holbrook, and Batra 1991), and attitude toward the brand 
(Brown, Homer, and Inman 1998; Burke and Edell 1989; 
Holbrook and Batra 1987). Positive affective responses 
have also been shown to influence behavior directly. For 
example, experiencing pleasure (Cunningham 1979; Forbes 
and TeVault 1975; Isen and Levin 1972; Strahilevitz and 
Myers 1998) has been shown to increase people's likeli- 
hood of donating to charity significantly. Positive feelings 
have also been shown to increase people's willingness to 
participate in an experiment and to help people in need 
(Isen 1987). Finally, positive affect has been linked to the 
hedonic and experiential aspects of consumer behavior, 
increasing variety-seeking behavior, experiential shopping, 
and hedonic consumption (Cohen and Areni 1991; 
Hirschman and Stern 1999; Kahn and Isen 1993). This sug- 
gests that engaging in touch that creates a positive affective 
response is likely to lead to more positive attitudes and 
greater behavioral intentions toward a product. 

Therefore, we expect that people who are high in 
autotelic NFT will exhibit an increase in persuasion when 
they are exposed to marketing messages that incorporate a 
touch element with positive sensory feedback. In contrast, 
people who are low in autotelic NFT are more likely to 
process touch information that is included in a marketing 
message in the same way that they process any other infor- 
mation in the message and, therefore, will be unlikely to 
experience a persuasive effect of positive sensory feedback. 
Thus, touch should influence people who are low in 
autotelic NFT only if it provides information that helps 
them interpret the message. Exposure to a touch element 
that does not provide useful information is unlikely to 
increase the attitudes of such people. This leads to our next 
hypothesis: 
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H>: Compared with a message with no touch element (control 
condition), a message that includes a touch element with 
positive sensory feedback will increase persuasion more 
for people who are high (but not low) in autotelic NFT 
than will a message that includes a touch element with 
neutral or negative sensory feedback. 


Congruence Between the Touch Element and the 
Message 


Consumers’ responses to additional information that is 
incorporated into a marketing message, such as a picture or 
a touch element, have been shown to be influenced by the 
congruence between the additional information and the 
message. Heckler and Childers (1992) define “congru- 
ence” as having two components: expectancy, or the degree 
to which the additional information falls into the pattern or 
structure of the message, and relevancy, or the degree to 
which the additional information contributes to or detracts 
from the theme of the message. Information that is unex- 
pected, such as humor, has been shown to increase favor- 
able evaluations and recall of an advertisement (Lee and 
Mason 1999). Conversely, irrelevant information has been 
shown to have a negative effect on both the evaluation and 
the recall of an advertisement (Heckler and Childers 1992; 
Lee and Mason 1999). This implies that incorporating a 
touch element into a communication in which it would be 
unexpected, such as a print advertisement or direct-mail 
brochure, may lead to a favorable evaluation of the message 
and increase attitude toward the message, depending on the 
relevancy of the touch element to the message. 

Lee and Mason (1999) suggest that the difficulty with 
irrelevant information is that a person is unable to perceive 
the connection between the information and the message 
and, therefore, becomes frustrated. This implies that if 
people who are low in autotelic NFT are processing a touch 
element as part of the overall message, an irrelevant touch 
element will create confusion and frustration and thus 
detract from their attitude toward the message. However, if 
people who are high in autotelic NFT are simply respond- 
ing affectively to the touch element and not processing it as 
part of the message, the negative effect of irrelevancy is 
likely to be attenuated. This leads to our third hypothesis: 


H3,: For people who are high in autotelic NFT, a message that 
includes a touch element will be more persuasive than a 
message that does not include a touch element, regardless 
of the congruency between the touch element and the 
message. 

Hay: For people who are low in autotelic NFT, a message that 
1ncludes a touch element will be more persuasive than a 
message that does not include a touch element only if the 
touch element is congruent with the message. 


2The term “congruence” has also been used to describe the rela- 
tionship between a piece of information that is being processed 
and a person's information-processing style (see Higgins 2000). 
Our definition of congruence emphasizes the congruence between 
the touch information and the message into which it is incorpo- 
rated and does not imply congruence with the person who is pro- 
cessing the message. 
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Study 1 


The purpose of Study 1 was to test H;—-H3 and to determine 
the effects of incorporating a touch element that does not 
provide product attribute information into a communica- 
tions message on people who are both high and low in 
autotelic NFT. Study 1 was a 2 (touch element: present ver- 
sus absent) x 2 (autotelic NFT: high versus low [determined 
by a median split]) design; the first factor was manipulated 
between subjects, and the second factor was measured 
between subjects. Nested within the touch-element condi- 
tion were two levels of congruence between the touch ele- 
ment and the message (congruent and incongruent [deter- 
mined by a pretest]) and three levels of sensory feedback 
(positive, neutral, and negative [determined by a pretest]); 
both were manipulated between subjects. 


Varlables and Procedure 


Three hundred forty-five undergraduate students partici- 
pated in exchange for extra credit in a marketing class. Each 
participant read a pamphlet with the same message that 
requested that the participant make a donation of time or 
money to an arboretum located in the Midwest (see Appen- 
dix A). We asked participants to list their thoughts as they 
read the pamphlet. In the touch-element conditions, a touch 
element was attached to the front of the pamphlet. We used 
six touch elements in all; we varied the sensory feedback or 
valence provided by the touch element and the congruence 
between the touch element and the message across 
conditions. 

We evaluated congruence of the touch element with the 
message and sensory feedback through pretests. We pre- 
sented each participant in the congruency pretest (n — 56) 
with three of the six sample touch elements (we counterbal- 
anced the order and found no order effects) and the pam- 
phlet with the arboretum appeal and asked them to rate the 
congruence of each sample touch element with the appeal. 
We measured congruence using two questions: "The 'fit' of 
this touch sample with the pamphlet is...?” with endpoints 
“very bad/very good” and “very unfavorable/very favor- 
able,” each of which were seven-point scales. We averaged 
the two measures of congruence (all r’s > .85) to obtain one 
measure of congruence for each touch element. Participants 
rated three touch elements as highly congruent with the 
message of the pamphlet: a feather (M = 5.18), tree bark 
(M = 5.52), and sandpaper (M = 4.14), and none was sig- 
nificantly higher than the others on congruence (ps > .05). 
Participants rated three other touch elements as being low 
on congruence with the message: a soft silver swatch (M = 
2.42), a slightly textured black-and-gold swatch (M = 1.90), 
and steel wool (M = 2.62), and none was significantly dif- 
ferent from the others (all ps > .05). The three congruent 
touch elements were all significantly higher on congruency 
than the incongruent touch elements. 

A second pretest evaluated the valence of the sensory 
feedback provided by touching each touch element. Thirty- 
Six participants rated how pleasant each of the touch ele- 
ments was to touch on a seven-point scale. For the congru- 
ent touch elements, participants rated the feather as 
extremely positive (M = 5.68), the tree bark as neutral (M = 


4.01), and the sandpaper as negatively valenced (M = 2.90). 
Using paired valence t-tests, we obtained the following 
results: feather versus tree bark: t(35) = 5.84, p < .01; 
feather versus sandpaper: t(35) = 8.45, p < .01; and tree 
bark versus sandpaper: t(35) = 2.42, p < .01. Among the 
incongruent touch elements, participants rated the. soft sil- 
ver swatch as extremely positive (M = 5.59), the slightly 
textured black-and-gold swatch as neutral (M = 4.04), and 
the steel wool as negatively valenced (M = 2.90). Again, 
using paired valence t-tests, we obtained the following 
results: soft silver swatch versus black-and-gold swatch: 
t(35) = 6.53, p < .01; soft silver swatch versus steel wool: 
t(35) = 10.60, p < .01; and slightly textured black-and-gold 
swatch versus steel wool: t(35) = 7.80, p < .01.3 After read- 
ing the pamphlet, participants in Study 1 completed a ques- 
tionnaire, which included measures of their attitudes toward 
the message and the arboretum and their willingness to 
donate time or money to the arboretum.4 We used familiar- 
ity with the arboretum and prior donation behavior in terms 
of time and money as covariates in the analyses; we found 
no significant effects on the results. Next, under the guise of 
a different study, participants completed the autotelic com- 
ponent of the NFT scale (6 items) from the 12-item NFT 
scale (Peck and Childers 2003a; for scale items, see Appen- 
dix B).5 The NFT scale measures dimensions of both instru- 
mental NFT and autotelic NFT. As Peck and Childers 
(2003b) recommend, depending on the underlying theory, 
researchers might choose to employ just one dimension of 
the NFT scale. Finally, we thanked participants and 
debriefed them. 


Results 


We found support for H,, which predicted that a touch ele- 
ment would increase persuasion for people who are high in 
autotelic NFT but not for those who are low in autotelic 
NFT. We measured both attitude toward the pamphlet and 





3We conducted one final pretest to ensure that vision did not 
influence the touch valence ratings. Sixteen participants evaluated 
the valence of the six touch elements without vision. We put each 
touch element in an enclosed box and instructed the participant to 
reach in and touch the six touch elements, which we presented one 
at a time with the order counterbalanced. We found no significant 
order effects, and the valence ratings mirrored those we found in 
the presence of vision. 

4Because we used undergraduate students, we also included a 
measure that stated, “This pamphlet would encourage other people 
to be more likely to donate time or money to the Arboretum.” This 
question elicited slightly higher means than the students’ own will- 
ingness to donate, but we obtained the same pattern of results, so 
we do not report these. 

>We were concerned that the autotelic dimension of the NFT 
scale would be too focused on product touch to capture the effects 
in this context. We administered an additional scale, which 
included items modified from the autotelic NFT scale to move 
away from referring to actual product touch (for a list of autotelic 
NFT items and additional touch items [called “Funtouch”], see 
Appendix B). We also analyzed all studies using the Funtouch 
items and found the same results as we did for the autotelic NFT 
items. For simplicity, we report only the autotelic NFT results. 


attitude toward the organization on three seven-point scales 
(“What is your overall feeling toward the pamphlet/organi- 
zation?" anchored by "very unfavorable/very favorable," 
“very bad/very good,” and "very negative/very positive"). 
We averaged these (all as > .90) for a measure of attitude 
toward the pamphlet and attitude toward the organization. 
With attitude toward the pamphlet as the dependent 
variable, the interaction between whether a touch element 
was present (yes/no) and autotelic NFT was significant 
(F(1, 339) = 5.25, p < .05, e? = .02). There was no signifi- 
cant main effect for the touch/no touch element or for 
autotelic NFT (ps > .05). Using planned contrasts, we found 
that participants who were high in autotelic NFT had a 
more positive attitude toward the pamphlet when a touch 
element was present than when it was absent (Ms = 5.32 
and 4.83, respectively; F(1, 339) = 4.93, p < .05, œ = .04; 
see Table 1), whereas a touch element had no effect on par- 
ticipants who were low in autotelic NFT (M - 5.13 for 
touch and 5.40 for no touch; F(1, 339) = 1.19, p > .05). We 
obtained the same pattern of results for the likelihood of 
donating time or money to the organization (see Table 1); 
the interaction between touch/no touch and autotelic NFT 
was significant (F(1, 339) = 6.16, p < .05, œ = .02), but no 
main effects were significant (ps > .05). Participants who 
were low in autotelic NFT were unaffected by a touch ele- 
ment (Ms = 3.34 for touch and 3.54 for no touch; F(1, 
339) = .37, p > .05), and participants who were high in 
autotelic NFT were significantly more willing to donate 
time or money when a touch element was included in the 
appeal (Ms = 3.65 and 2.75; F(1, 339) = 9.25, p < .05, 92 = 
.05). The presence of the touch element did not affect atti- 
tude toward the organization; there were no significant main 
effects or interaction effects, implying that the persuasive 
influence of the touch element may be restricted to the spe- 
cific message that incorporates the touch element and not 
extended to the organization that is sponsoring the message. 

H, predicted that for people who are high (but not low) 
in autotelic NFT, a message that included a touch element 
with positive sensory feedback (versus a message with no 
touch element [control condition]) would increase persua- 
sion more than a message that included a touch element 
with neutral or negative sensory feedback. We found partial 
support for this hypothesis. Using planned comparisons, for 
both attitude toward the pamphlet and likelihood of donat- 
ing time or money, we found that participants who were 
high in autotelic NFT were most persuaded by touch infor- 
mation that provided positive sensory feedback compared 
with the no-touch-element control condition (attitude 
toward the pamphlet: Ms = 5.49 and 4.83; F(1, 329) = 6.32, 
p < .05, œ = .05; likelihood of donating time or money: 
Ms = 3.87 and 2.75; F(1, 329) = 10.09, p « .05, w2 = .04; 
see Table 1). À touch element with neutral sensory feedback 
also significantly increased persuasion for participants who 
were high in autotelic NFT compared with the control con- 
dition (attitude toward the pamphlet: Ms = 5.32 and 4.83; 
F(1, 329) = 4.21, p < .05, Q2 = .04; likelihood of donating 
time or money: Ms = 3.79 and 2.75; F(1, 329) = 9.38, p< 
.05, œ = .03), whereas a touch element providing negative 
sensory feedback did not significantly influence persuasion 
for participants who were high in autotelic NFT (all ps » 
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TABLE 1 
Attitude Toward the Pamphlet and Likelihood of Donating Time or Money x Autotelic NFT, Congruence, 


and Valence: Study 1 Means 
BENE LL. 1!..'—L.: 0. 


Low Autotelic NFT High Autotelic NFT 








Valence Incongruent Congruent Total Incongruent Congruent Total 
A: Attitude Toward Pamphlet 
Negatlve 4.678 5.06 4.908 5.15 5.21 5.17 
n= 19 n= 29 n= 48 n= 31 n=21 n= 52 
Neutral 5.25 5.60 5.41 5.29 5.34b 5.32b 
n = 24 n=21 n= 45 n=25 n=28 n= 53 
Positive 4.748 5.53 5.10 5.48b 5.50b 5.49b 
n=26 n=22 n= 48 n=21 n=24 naz45, 
Total 4.908 5.36 5.28b 5.36) 
n= 69 n=72 n=77 n=73 
Touch-element total 5.13 5.326 
n= 141 n= 150 
No-touch-element total 5.40 4.83 
n= 24 n=28 


Negative 2.74 3.14 
Neutral 3.04 . 3.86 
Positive 3.35 3.95 
Total 3.07 3.60 


Touch-element total 
No-touch-element total 


2.98 3.29 3.38 3.33 

3.42 3.88¢ 3.71¢ 3.79¢ 

3.62 3.90¢ 3.83¢ 3.87¢ 
3.65° 3.66¢ 

3.34 3.65¢ 

3.54 2.75 


aCell means are significantly different from the control (no touch element) mean of 5.40 for low autotelic NFT. 

bCell means are significantly different from the control (no touch element) mean of 4.83 for high autotelic NFT. 

cCell means are significantly different from the control (no touch element) mean of 2.75 for high autotelic NFT. 

Notes: All comparisons are based on planned contrasts. All scales are seven-point scales. For likelihood of donating time or money, cell means 
are significantly different from the contro! (no touch element) mean of 3.54 for low autotelic NFT. 


.05). For participants who were low in autotelic NFT, nega- 
tive sensory feedback significantly decreased persuasion for 
attitude toward the pamphlet compared with the no-touch- 
element control condition (Ms = 4.90 and 5.40; F(1, 329) = 
3.94, p < .05, a = .02). However, for likelihood of donating 
time or money, persuasion was unaffected (p » .05; for 
means, see Table 1). In addition, for participants who were 
low in autotelic NFT, positive or neutral touch elements did 
not influence persuasion compared with the no-touch- 
element condition. 

Although not specifically hypothesized, we can exam- 
ine the effect of valence within the touch-element condi- 
tions. Within the touch-element condition, we performed a 
3 (valence) x 2 (congruency) x 2 (autotelic NFT) analysis; 
both attitude toward the pamphlet and likelihood of donat- 
ing time or money were dependent variables. For both 
dependent variables, there was a significant main effect for 
valence of the touch element (attitude toward the pamphlet: 
F(2, 279) = 3.16, p « .05, a = .02; likelihood of donating 
time or money: F(2, 279) = 4.84, p < .05, œ = .04). There 
was also a significant main effect of congruence, and a two- 
way interaction between congruency and autotelic NFT was 
significant for both dependent variables (we discuss this 
further with the congruence results); no other interactions 
were significant. When we used a touch element, a posi- 
tively valenced or neutral valenced touch element was more 
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persuasive than a touch element that provided negative sen- 
sory feedback. For attitude toward the pamphlet, a neutral 
or positively valenced touch element resulted in a signifi- 
cantly more positive attitude toward the pamphlet than a 
negatively valenced touch element (negative [M = 5.04] 
versus positive [M = 5.29] touch element: F(2, 279) = 2.85, 
p < .05, à? = .02; negative versus neutral [M = 5.36] touch 
element: F(2, 279) = 3.65, p < .05, m2 = .02). The difference 
between the mean attitude toward the pamphlet for a neutral 
touch feedback element and the mean attitude toward the 
pamphlet for a positive touch feedback element was not sig- 
nificant (p » .05). 

With likelihood of donating time or money as the 
dependent variable, we found parallel results. A neutral or 
positive feedback from the touch element resulted in a sig- 
nificantly greater likelihood of donating time or money to 
the organization than a touch element that provided nega- 
tive feedback (negative [M = 3.16] versus positive [M = 
3.74] touch element: F(2, 279) = 623, p < .01, œ = .04; 
negative versus neutral [M = 3.62] touch element: F(2, 
279) = 4.84, p < .01, a = .04). As with attitude toward the 
pamphlet, the difference between the mean likelihood of 
donating time or money to the organization with a touch 
element that provided neutral feedback and a touch element 
that provided positive feedback was not significant (p » 
.05). 


In addition to valence, we examined the effect of the 
congruency between the touch element and the message. 
We expected that for participants who were high in-autotelic 
NFT, the congruency between the touch element and the 
message would not matter; both would be more persuasive 
than the no-touch-element control (H3,). We found support 
for this hypothesis. For both dependent variables, pam- 
phlets with both a congruent and an incongruent touch ele- 
ment were more persuasive than the control condition (con- 
trol versus congruent: attitude toward the pamphlet: Ms = 
4.83 and 5.36; F(1, 329) = 4.61, p < .05, o? = .02; likeli- 
hood of donating time or money: Ms = 2.75 and 3.66; F(1, 
329) = 7.52, p < .05, à? = .04; control versus incongruent: 
attitude toward the pamphlet: Ms = 4.83 and 5.28; F(1, 
329) = 3.91, p < .05, a = .02; likelihood of donating time 
or money: Ms = 2.75 and 3.65; F(1, 329) = 8.46, p < .05, 
Q? = .04). 

For participants who were low in autotelic NFT, we 
expected that a message with a congruent touch element 
would be more persuasive than a message with an incongru- 
ent touch element, compared with the no-touch-element 
condition (Ha45). We found partial support for this. An 
incongruent touch element significantly decreased attitude 
toward the pamphlet compared with the no-touch-element 
control condition (Ms = 4.90 and 5.40; F(1, 329) = 4.08, p « 
.05, e? = .02), but it did not significantly influence likeli- 
hood of donating time or money (Ms = 3.07 and 3.54; p > 
.05). In addition, the presence of a congruent touch element 
did not significantly influence persuasion compared with 
the control condition (ps > .05; for means, see Table 1). 

Again, within the touch-element conditions, a 3 
(valence) x 2 (congruency) x 2 (autotelic NFT). analysis 
yielded not only a significant main effect for valence of the 
touch element (as we previously reported) but also a signifi- 
cant main effect of congruence (attitude toward the pam- 
phlet: F(1, 279) = 5.06, p < .05, œ = .02; likelihood of 
donating time or money: F(1, 279) = 2.65, p = .10, œ = 
.01). Most important, the expected interaction between con- 
gruency and autotelic NFT was significant for both depen- 
dent variables (attitude toward the pamphlet: F(1, 279) = 
3.59, p < .05, à? = .02; likelihood of donating time or 
money: F(1, 279) = 3.66, p « .05, a = .02); no other inter- 
actions were significant. In the touch-element condition, 
comparing congruent and incongruent touch elements 
yielded different results for participants who were:high and 
those who were low in autotelic NFT. For those who were 
high in autotelic NFT, the congruency of the touch element 
to the communication did not influence either attitude 
toward the pamphlet (Ms = 5.28 and 5.36; F(1, 329) = .06, 
p > .05) or likelihood of donating time or money to the 
organization (Ms = 3.34 and 3.54; F(1, 329) = .04,ip > .05). 
However, participants who were low in autotelic NFT had a 
more positive attitude toward the pamphlet when the pam- 
phlet included a congruent touch element than when it 
included an incongruent touch element (Ms = 5.36 and 
4.90; F(1, 329) = 7.70, p « .05, o = .03), and they were 
also more likely to donate time or money to the organiza- 
tion in the congruent-touch-element condition than in the 
incongruent-touch-element condition (Ms = 3.60 and 3.07; 
F(1, 329) = 5.98, p < .05, a? = .02; see Table 1). It appears 


that for participants who were high in autotelic NFT, both 
congruent and incongruent touch elements had a positive 
effect on both attitude toward the message and likelihood of 
donating time or money, but for participants who were low 
in autotelic NFT, a touch element that was incongruent with 
the message had a potentially negative effect. 

A content analysis of the participants’ thoughts about 
the congruent and incongruent touch elements supported 
the predictions of H4, and H3,. Of the participants, 69 
(20%) expressed confusion about the touch element and 
frustration at being unable to make sense of the connection 
between the touch element and the message. Of these par- 
ticipants, 52 were in the incongruent-touch-element condi- 
tion, suggesting that participants indeed had difficulty mak- 
ing sense of an incongruent touch element and became 
frustrated as a result. Notably, there was no significant dif- 
ference between participants who were high and those who 
were low in autotelic NFT in expressing confusion; 34 par- 
ticipants who were high in autotelic NFT and 35 partici- 
pants who were low in autotelic NFT expressed confusion 
about the touch element. However, although the participants 
who were low in autotelic NFT exhibited a decrease in atti- 
tude toward the message in the incongruent condition, the 
participants who were high in autotelic NFT demonstrated 
no difference in attitude between the congruent and the 
incongruent conditions, suggesting that though the partici- 
pants who were high in autotelic NFT were confused by the 
incongruent touch element, this confusion did not influence 
their attitudes. 


Discussion of Study 1 


For participants who were high in autotelic NFT, incorpo- 
rating a touch element that conveys no product attribute 
information into a message increased persuasion. This 
effect occurred regardless of the congruence between the 
touch element and the message and was stronger for a touch 
element that provided neutral or positive sensory feedback 
than for a touch element that provided negative sensory 
feedback and compared with the no-touch-element control 
condition. However, participants who were low in autotelic 
NFT showed no differences in persuasion between the 
touch and the no-touch-element conditions; they responded 
negatively to the inclusion of a touch element that was 
incongruent with the message of the appeal. The results of 
Study 1 are consistent with the theory that touch creates an 
affective response for people who are high in autotelic NFT 
that, in turn, influences persuasion, whereas people who are 
low in autotelic NFT process touch as part of the message, 
and touch is persuasive only if it provides useful informa- 
tion. This implies that the relationship among touch, affec- 
tive response, and persuasion may be a moderated media- 
tion; that is, for people who are high in autotelic NFT, touch 
creates an affective response that mediates the relationship 
between touch and persuasion, but for people who are low 
in autotelic NFT, this mediation effect does not occur (see 
Figure 1). This leads to our fourth hypothesis: 


H4: The increased persuasion resulting from a message that 
incorporates a touch element is mediated by affective 
response for people who are high in autotelic NFT but not 
for those who are low in autotelic NFT. 
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FIGURE 1 
Moderated Mediation Affect of Touch on Affective Response and Persuasion 


Persuasion 





Study 2 


The purpose of Study 2 was to measure the affective 
response in both people who were high and those who were 
low in autotelic NFT to a message that incorporated a touch 
element and to determine whether this affective response 
mediated the relationship between touch and persuasion. 
We also attempted to find additional support for H,, which 
predicted that a touch element would be persuasive for 
people who are high in autotelic NFT but not for those who 
are low in autotelic NFT. Study 2 was a 2 (autotelic NFT: 
high versus low [determined by a median split]) x 2 (touch 
element: present versus absent) design; the first factor was 
measured between subjects, and the second factor was 
manipulated between subjects. 


Varlables and Procedure 


Two hundred four undergraduate students participated in 
the study in exchange for extra credit in'a marketing class. 
Each participant read a pamphlet with the same message 
that requested that he or she make a donation of time or 
money to a fictional charity called Spread the Warmth, 
which provided blankets to needy families during the winter 
(see Appendix A). In the touch condition, a 4 x 4 inch 
swatch of black fleece fabric that was similar to the texture 
of a blanket was attached to the pamphlet; the control con- 
dition had no touch element. After reading the pamphlet, 
participants completed a questionnaire that included mea- 
sures of their affective response to the pamphlet (for scale 
items, see Appendix C), attitude toward the pamphlet, atti- 
tude toward the organization, and willingness to donate 
time or money to Spread the Warmth.6 Next, under the 
guise of a separate study, we administered the six-item 


6In Study 2, we measured affective response using the two 
scales in Appendix C. We obtained parallel results for both scales. 
For simplicity, we report only the results of the first scale. In Study 
3, we measured affective response using the emotional reaction 
scale because we determined that this scale had greater relevance, 
given the task in which participants were engaged. 
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autotelic NFT scale. Finally, we thanked the participants 
and debriefed them. 


Results 


With attitude toward the pamphlet as the dependent 
variable, there was a main effect for whether the touch ele- 
ment was present (F(1, 200) = 5.25, p « .05, 92 = .05), a 
main effect of autotelic NFT (F(1, 200) = 13.54, p « .01, 
œ? = .08), and a significant interaction between the two 
(F(1, 200) = 4.24, p < .05, à? = .04; for means, see Table 2). 
Participants who were high in autotelic NFT had a more 
positive attitude toward the message and a greater likeli- 
hood of donating time or money when the pamphlet 
included a touch element than when it did not (attitude 
toward the message: Ms = 6.04 and 5.46; F(1, 200) = 10.39, 
p < 01, €? =.06; likelihood of donating time or money: 
Ms = 5.70 and 4.56; F(1, 200) = 20.75, p < .001, o = .09), 
in support of H}. In contrast, participants who were low in 
autotelic NFT had no significant difference in attitude 
toward the message between the touch and the no-touch- 
element conditions (Ms = 5.27 and 5.24; F(1, 200) = .02, 
p > .05). Participants who were low in autotelic NFT 
showed a significant increase in their likelihood of donating 
time or money in the touch condition (Ms = 5.00 and 4.36; 
F(1, 200) = 5.41, p < .05, a = .03). For both dependent 
variables, there was a significant difference between people 
who were high and those who were low in autotelic NFT in 
the touch condition, suggesting that those who were high in 
autotelic NFT responded more strongly to the touch ele- 
ment than those who were low in autotelic NFT (attitude 
toward the message: Ms = 6.04 and 5.27; F(1, 200) = 15.67, 
p < .01, o? = .07; likelihood of donating time or money: 
Ms = 5.70 and 5.00; F(1, 200) = 6.78, p < .05, m? = .04; see 
Table 2). Attitude toward the organization was unaffected 
by the touch element (p > .05). 

We conducted Study 2-to examine directly the differ- 
ence in affective response to the touch element between 
people who were high and those who were low in autotelic 
NFT. As expected, we found that participants who were 
high in autotelic NFT had a stronger affective response to 


TABLE 2 
Attitude Toward the Message and Behavioral Intentions x Touch and No-Touch-Element Condtlons x High 
and Low Need for Autotellc Touch: Study 2 Means (Standard Deviatlons) 


Low Autotelic NFT 


No Touch 
Element 


Attitude toward the pamphlet |. 5.24 
Attitude toward the organization 6.22 


Affective response 


Likelihood of donating time or money 4.368 


High Autotellc NFT 


Touch No Touch Touch 
Element Element Element 
n= 39 n = 50 n = 60 
5.27b 5.468 6.048, b 

(1.02) (1.01) (.63) 
5.94a 6.33 6.358 
(.89) (.74) (.84) 
4.725 4.54a 5,178 b 
(1.03) (1.14) (1.08) 
5.008 c 4.56b 5.705 c 
(1.23) (1.53) (.79) 


Notes: Numbers with the same superscript in the same row are significantly different at p = .05, means are based on seven-point scales. 


the message in the touch condition than in the no-touch- 
element condition (Ms = 5.17 and 4.54; F(1, 200) = 9.56, 
p < -05, œ = .05), and those who were low in autotelic NFT 
had no significant differences in affective response to the 
message, depending on the presence of a touch element 
(Ms = 4.72 and 4.33; F(1, 200) = 3.06, p > .05; see\Table 2). 
This is consistent with the proposed view that people who 
are high, but not those who are low, in autotelic NFT are 
influenced by affective response to touch. 

We hypothesized that a touch element that influences 
persuasion would be mediated by a person's affective 
response to the touch element. However, we expected 
autotelic NFT to moderate tbis relationship (see Figure 1). 
Using LISREL 8 (Jóreskog and Sórbom 1993), we first esti- 
mated a path model using the full sample. For attitude 
toward the pamphlet, the path model supported complete 
mediation. The path from the touch element to affective 
response was significant (D = .23, t = 3.41, p « .05), the path 
from affective response to attitude toward the pamphlet was 
significant (B = .50, t = 8.13, p < .05), and the path from the 
touch element to attitude toward the pamphlet was not sig- 
nificant in the presence of the mediator (B = .08, t = 1.33, 
p > .05). The results were similar to the dependent variable 
of likelihood of donating time or money to the organization. 
The full model showed evidence of partial mediation. The 
path from the touch element to affective response was sig- 
nificant (B = .23, t = 3.41, p « .05), as was the path from 
affective response to the likelihood of donating time or 
money (B = .53, t = 9.45, p « .05). The path from the touch 
element to donating time or money remained significant in 
the presence of the mediator, indicating that in addition to 
mediation, there was a direct effect of the touch element on 
donating time or money to the organization (B = .22, t = 
3.90, p « .05). 

To gain further insight, we estimated simultaneous path 
models for both participants who were high and those who 
were low in autotelic NFT for each of the two dependent 
variables. In both cases, partial mediation was supported for 
those who were high in autotelic NFT, but mediation was 
not supported for those who were low in autotelic NFT. 
With attitude toward the pamphlet as the dependent 


variable, for those who were high in autotelic NFT, the path 
from the touch element to affective response was significant 
(B = .26, t = 2.91, p « .05), as was the path from affective 
response to attitude toward the pamphlet (B = .40, t = 5.23, 
p « .05). Finally, the path from the touch element to attitude 
toward the pamphlet remained significant (B = .20, t = 2.59, 
p < .05), in support of partial mediation. In contrast, for par- 
ticipants who were low in autotelic NFT, the path from the 
touch element to affective response was not significant (B — 
.19, t= 1.40, p > .05); thus, mediation of affective response 
for participants who were low in autotelic NFT was not sup- 
ported. The path from affective response to attitude toward 
the pamphlet was significant (B = .81, t= 5.38, p « .05), and 
there was no significant effect for the touch element on atti- 
tude toward the pamphlet (B = —.12, t = .06, p > .05). 

The results were parallel for the dependent variable of 
the likelihood of donating time or money to the organiza- 
tion. For participants who were high in autotelic NFT, the 
path from the touch element to affective response was sig- 
nificant (B = .21, t = 3.11, p < .05), as was the path from 
affective response to the likelihood of donating time or 
money (D = .32, t= 4.17, p < .05). In addition, the path from 
the touch element to the likelihood of donating time or 
money was significant (B = .32, t = 4.17, p < .05). However, 
for participants who were low in autotelic NFT, the path 
from touch to affective response was not significant (B = 
.17, t = 1.56, p > .05), in support of differential processes 
for people who are high and those who are low in autotelic 
NFT. For participants who were low in autotelic NFT, the 
path between the touch element and the likelihood of donat- 
ing time or money was also not significant (B = .13, t = 
1.56, p » .05). Not surprisingly, the relationship between 
affective response and donating time or money to the orga- 
nization was significant (D = .70, t = 8.90, p < .05). 


Discussion of Study 2 

Study 2 replicated the finding that participants who were 
high in autotelic NFT had a more positive attitude toward 
messages that incorporated touch elements than toward 
messages that did not incorporate touch elements. However, 
this difference did not occur among those who were low in 
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autotelic NFT. We also found that incorporating a touch ele- 
ment increased likelihood of donating time or money 
among both participants who were high and those who were 
low in autotelic NFT. This study also directly examined the 
affective response of those who were high and those who 
were low in autotelic NFT when encountering a touch ele- 
ment. Only participants who were high in autotelic NFT 
had a significantly stronger affective response to the mes- 
sage with the touch element than to the message with no 
touch element. In effect, for participants who were high in 
autotelic NFT, the touch element evoked an affective 
response beyond that which the message evoked. We found 
that this affective response mediated the relationship 
between touch and persuasion for participants who were 
high in autotelic NFT but not for those who were low in 
autotelic NFT. 


Study 3 


In Study 3, we tested whether the findings of Studies 1 and 
2 could be replicated in a real-world situation, and we 
investigated the generalizability of our experimental find- 
ings. We partnered with a midwestern children’s museum, a 
“hands-on” museum targeted to children aged eight and 
under. As a nonprofit organization, the museum often uses 
direct-mail appeals to solicit donations and museum mem- 
berships. Many of its publicity and fundraising materials 
stress the museum’s unique hands-on features that encour- 
age children to touch, making a touch element relevant to 
its message. However, the museum had never before incor- 
porated a touch element into one of its direct-marketing 
efforts. Working with the fundraising staff at the museum, 
we designed a brochure to solicit new memberships. The 
cover of the brochure featured a picture of a cuddly looking, 
spotted, cartoon dinosaur with a child reaching out to touch 
it (see Figure 2). We then incorporated a touch element into 
half of the brochures. Because Studies 1 and 2 suggested 
that touch elements that provide positive sensory feedback 
are more persuasive, we used a touch element that was soft 
and pleasant to touch. For people in the touch-element con- 
dition, the largest spot on the dinosaur was a circle of red 
faux fur, which was found in a pretest to be soft and pleas- 
ant to touch. For people in the no-touch-element condition, 
the spot was printed like all the other spots on the dinosaur. 

Study 3 was a 2 (autotelic NFT: high versus low [deter- 
mined by a median split]) x 2 (touch element: present ver- 
sus absent) design; the first factor was measured between 
subjects, and the second factor was manipulated between 
subjects. 


Variables and Procedure 


We sent the mailing to a purchased list of approximately 
2000 families in zip code areas with children under the age 
of eight who were not currently members of the children’s 
museum. One thousand people on the mailing list received 
a brochure with a touch element, and the other thousand 
received the same brochure without a touch element. Each 
of the 2000 recipients of the brochure also received a ques- 
tionnaire in the same package, which included questions 
about the recipient's attitude toward the message and likeli- 
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FIGURE 2 
Brochure for Study 3 





hood of donating time or money, control questions about 
the recipient's prior donations to the museum, and the 
autotelic NFT scale. We included prior donation behavior 
and prior visits to the museum as covariates; neither was 


significant (p > .05). 
Sample Description 


Of the 2000 questionnaires, 116 were returned, for a 
response rate of 5.8%. According to the Direct Marketing 
Association (2003), the overall average response rate for 
direct mail, including mailings to both house and prospect 
files, is 2.5496, suggesting that our response rate is reason- 
able. To help eliminate the concern of nonresponse bias, we 
compared questionnaires returned in the first week (n — 79, 
or 6896) with those returned in the subsequent three weeks; 
we found no significant differences in demographics or in 
independent or dependent variables. Of the responses, 61 


TBecause of survey length restrictions the organization 1mposed, 
we included only autotelic NFT. 


(53%) were from the touch-element condition, and 55 
(47%) were from the no-touch-element condition. The 
majority of our respondents were between the ages of 35 
and 44 (53%), followed by ages 25-44 (37%), 45-54 (9%), 
and 60-64 (1%). The mailing-list company stated that 55% 
of its list were between the ages of 35 and 44 (compared 
with our 53%). Female respondents significantly: outnum- 
bered male respondents (102 women [88%] and 14 men 
[12%]). ! 


Results 


We expected that the touch element would increase attitude 
toward the message and behavior for participants who were 
high but not for those who were low in autotelic NFT. We 
hoped that we would be able to obtain a measure:of actual 
membership resulting from the mailing. Unfortunately, only 
3 of the 116 respondents that returned the survey became 
members of the museum, so we were unable to use this 
measure. We measured behavioral intentions through a 
question that asked, “After reading the brochure, how likely 
are you to become a member of the Children’s Museum?" 
with endpoints “very unlikely” (1) and “very likely” (7). 
There was a main effect of the touch element on both atti- 
tude toward the pamphlet and likelihood of becoming a 
member of the museum (attitude toward to pamphlet: Ms — 
4.73 and 5.34; no touch element versus touch element: F(1, 
112) = 7.42, p < .05, a? = .04; likelihood of becoming a 
member: Ms = 3.25 and 3.92; F(1, 112) = 4.68, p < .05, 
a? = .03). However, this was qualified by a significant inter- 
action. In support of H,, the expected interactions between 
the presence or absence of a touch element and: autotelic 
NFT for both attitude toward the brochure and likelihood of 
becoming a museum member were significant’ (attitude 
toward the brochure: F(1, 112) = 10.57, p < .01, œ? = .09; 
likelihood of becoming a member: F(1, 112) = 5.23, p < 
.05, @ = .05). 

Using planned contrasts, we found that participants who 
were high in autotelic NFT had a more positive attitude 
toward the pamphlet and would be more likely to become a 
member of the museum when a touch element wás present 
than when it was absent (attitude toward the pamphlet: 
Ms = 5.58 and 4.26; F(1, 112) = 6.44, p < .05, w = .05; 


likelihood of becoming a member: Ms = 4.39 and 3.43; 
F(1, 112) = 4.01, p < .05, a? = .04; see Table 3). In contrast, 
as we expected, participants who were low in autotelic NFT 
were not influenced by the touch element (attitude toward 
the pamphlet: F(1, 112) = 1.53, p > .05; likelihood of 
becoming a member: F(1, 112) = .55, p > .05). For attitude 
toward the pamphlet as the dependent variable, we found no 
main effect of autotelic NFT (p » .05); however, we did find 
a significant main effect of autotleic NFT on likelihood of 
becoming a member (Ms = 4.03 and 3.13 for participants 
who were high and those who were low in autotelic NFT, 
respectively; F(1, 112) = 6.51, p < .05, œ = .06). As in 
Studies 1 and 2, the presence of a touch element did not 
influence attitude toward the organization (all ps » .05). 


Discussion of Study 3 


Study 3 replicated the results of Study 1 in a real-world set- 
ting, using a more heterogeneous population than the previ- 
ous studies. Thus, we are more confident that our results 
generalize to the general population. A touch element that 
provided positive sensory feedback incorporated into a real 
marketing brochure with a congruent message increased 
attitude toward the brochure and behavioral intentions 
among recipients who were high in autotelic NFT without 
diminishing attitude toward the brochure or behavioral 
intentions among those who were low in autotelic NFT. 
This demonstrates that incorporating a touch element into a 
message indeed results in a net increase in persuasion, even 
if the touch element does not provide any additional instru- 
mental information. 


General Discussion 


In three studies, we found that the incorporation of touch 
into marketing messages can have a positive effect on per- 
suasion for people who are high in autotelic NFT. When a 
touch element was used, a positively valenced or neutral 
element was more persuasive than when a touch element 
that provided negative sensory feedback was used. For par- 
ticipants who were high in autotelic NFT, compared with a 
no-touch-element control condition, a positively valenced 
or neutral touch element increased persuasion, and a nega- 


TABLE 3 
Attitude Toward the Message and Behavioral Intentlons x Touch and No-Touch-Element Conditions x High 
and Low Need for Autotellc Touch: Study 3 Means (Standard Deviations) 





Low Autotelic NFT 


High Autotellc NFT 





No Touch Touch No Touch Touch 
Element Element Element Element 
n = 32 n= 23 n = 23 n= 38 
Attitude toward the pamphlet — 5.07 5.00 4.262 5.588 
(1.18) (1.66) (.84) (1.02) 
Attitude toward the organization 5.58 5.62 5.96 5.91 
(1.45) (1.45) (1.22) (1.14) 
Likelihood of becoming a member , 3.12 3.13 3.43a 4,394 
. (1.11) (1.82) (2.02) (1.53) 


Notes: Numbers with the superscript in the same row are significantly different at p = 05; means are based on seven-point scales. 
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tively valenced touch element did not influence persuasion. 
However, for participants who were low in autotelic NFT, 
compared with the no-touch-element condition, a touch ele- 
ment that provided negative feedback decreased persuasion, 
whereas a touch element that provided positive or neutral 
feedback did not influence persuasion. 

In Study 2, we examined the process by which a touch 
element that provided positive sensory feedback influenced 
persuasion. We found that the persuasive effect occurred 
because of an affective or emotional response to the experi- 
ence of touch. Participants who were high in autotelic NFT 
experienced an emotional response due to the touch ele- 
ment, but participants who were low in autotelic NFT did 
not exhibit this same response. This emotional response to 
the touch element mediated the relationship between the 
presence of the touch element and persuasion 

In addition to the types of sensory feedback the touch 
element elicited, we examined the influence of the congru- 
ency of the touch element with the overall message. For 
participants who were high in autotelic NFT, a touch ele- 
ment increased persuasion, regardless of whether the touch 
element was congruent with the overall marketing message. 
However, for participants who were low in autotelic NFT, 
incorporation of a touch element that was not congruent 
with the message actually decreased the persuasiveness of 
the message. 

Note also that in both Study 2 (the "Feel the Warmth” 
study) and Study 3 (the children’s museum field study), we 
found a main effect of the presence of a touch element on 
persuasiveness. This is likely because both of these studies 
used a touch element that provided positive sensory feed- 
back, which participants perceived as fitting the persuasive 
message. 


Limitations and Further Research 


Although this research found strong consistent effects that 
suggest that touch has a positive affective influence on per- 
suasion, we should recognize some limitations. We were 
not successful in our attempt to obtain a behavioral measure 
of persuasion. Further research should examine the effects 
of touch on actual behavior. In addition, we were unable to 
find an effect of touch on attitude toward the organization. 
Although an increase in attitude toward the message and 
behavioral intentions are positive outcomes for marketers, 
the focus on building relationships with consumers that has 
become prevalent in the marketing literature suggests that 
persuasive elements would be more valuable if they could 
contribute to a broader attitude toward the organization. 
Further research may be able to determine whether there are 
conditions under which the persuasive effects of touch can 
be extended to the organization that sponsors the message. 
For example, repeated exposures to the message may influ- 
ence attitude and behavior toward the organization. 
Another possible limitation is that we examined only 
the process of affective response using a touch element that 
was positively valenced. In particular, we used softness 
because it has been associated with a pleasant sensory feel- 
ing and has been used in touch research (Bushnell and 
Boudreau 1991; Essick, James, and McGlone 1999). Fur- 
ther research should continue to explore process issues by 
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examining the affective response to neutral and negatively 
valenced touch elements. It could also be the case that the 
emotional response to the touch element is moderated by 
other factors. In the current research, participants who were 
high in autotelic NFT exhibited a stronger affective 
response to touch elements than did participants who were 
low in autotelic NFT. We know from previous research 
(Peck and Childers 2003b) that touch information is also 
more accessible for high- than for low-NFT people. Thus, 
not only do high-NFT people respond more strongly to 
touch information, but they also may weigh it more heavily 
when forming evaluations. Thus, it seems likely that a touch 
element could be effective for high-NFT people under both 
high- and low-involvement conditions. Further research 
should be conducted to examine the issue of involvement as 
it relates to high- and low-NFT people and the processes of 
evaluation and persuasion. 


Implications 


Theoretically, this research extends touch research in mar- 
keting. Previous research has focused solely on product 
touch, which has been found to influence persuasion when 
it provides instrumental attribute or structural information 
about the product (e.g., McCabe and Nowlis 2003; Peck 
and Childers 2003a). Our research found that touch can also 
be used as a persuasive element outside of the product touch 
context by providing an enjoyable hedonic experience for 
the consumer. In addition, this research examined the 
process by which hedonic touch influences persuasion and 
the differences between people who are high and those who 
are low in autotelic NFT. For people who are high in 
autotelic NFT, incorporating a touch element into a persua- 
sive communications message creates an affective response 
that increases attitude toward the message and behavioral 
intentions. For people who are low in autotelic NFT, a touch 
element does not generate a significant increase in their 
affective response. 

This research implies that touch could be incorporated 
into marketing messages in a variety of contexts. People 
who are high in autotelic NFT will experience affective 
responses and greater persuasion when they receive a mes- 
sage that incorporates touch, especially when the touch ele- 
ment provides neutral or positive sensory feedback. How- 
ever, to avoid decreasing persuasion for people who are low 
in autotelic NFT, marketers must ensure that the touch ele- 
ment is congruent with the overall marketing message, and 
it should not be negatively valenced. That said, the various 
touch elements we used suggests that congruency between 
the touch element and the message can be interpreted more 
broadly than previous research on congruency has sug- 
gested. For example, in our study, participants evaluated 
sandpaper as being congruent with an arboretum. A similar 
piece of sandpaper was used in an actual direct-mail 
brochure that a charity used to solicit donations to help chil- 
dren living in poverty in India. On the front of the brochure, 
there was a two-inch square of sandpaper, and underneath it 
were the words, “TOUCH THIS...” When the pamphlet 
was opened, there was a picture of a boy, and underneath 
the picture were the words, “To Feel 9-year-old Mallesh’s 
hand.” The experience of touching the sandpaper likely cre- 


ates an image of the child’s suffering and generates an 
affective response of sympathy, which may lead people to 
make donations to the charity. This suggests that marketers 
can use touch in a variety of contexts, provided that people 
who are low in autotelic NFT will be able to find some way 
to make sense of the touch element. 

Touch can be used as a persuasive pol beyond 
brochures that request charitable donations. In some direct- 
marketing situations, it may be possible to segment cus- 
tomers on the basis of autotelic NFT. Many direct mar- 
keters, such as Land's End, currently provide customers 
with the opportunity to request touch information, such as 
fabric swatches. It is likely that a person who requests a fab- 
ric swatch is high in autotelic NFT because prior research 
has found a correlation between instrumental and autotelic 
NFT (e.g., Peck and Childers 2003a, b). Although it may be 
too expensive to mail a touch-enhanced catalog to all cus- 
tomers, a direct marketer could customize mailings to cus- 
tomers who request fabric swatches. 

Regarding the area of product packaging, the current 
research suggests that the packaging opportunities are not 
limited to providing touch attribute information. New print 
technologies are being introduced that provide tactile 
effects, which encourage consumers to touch (Kaleido 
2004). It is likely that people who are high in autotelic NFT 
are more likely to approach and examine such packages 
than are people who are low in autotelic NFT, because the 
former enjoys the sensory experiences of touch. A product 
package that is interesting to touch may increase sales of 
the product even if the opportunity to touch does not pro- 
vide additional product attribute information. Some evi- 
dence suggests that tactile elements of product pàckaging 
can even contribute to the overall brand image of a product. 
For example, Lindstrom (2005) discusses Coca-Cola's use 
of the nostalgic glass bottle to reinforce its brand image and 
suggests that it is the tactile sensation, the feel of the bottle 
in the customer's hand, that is associated with the brand. 

Note that in this research, we investigated the sensory 
aspects of touch independent of product attribute informa- 
tion. However, product touch that conveys attribute infor- 
mation may also provide interesting sensory feedback, 
which would likely affect attitude toward the product 
beyond the touch attribute information. For example, the 
courier envelopes we mentioned previously convey the 
product benefit of the strength of the paper. This research 
suggests that the smooth, pleasant feel of the paper can also 
increase product evaluation. 

Touch also has significant implications for in-store and 
point-of-purchase displays. People who are high in autotelic 
NFT are drawn to opportunities to touch and are likely to 
respond to opportunities to touch clothing, paper goods, and 
other products that provide positive sensory feedback, even 
if they are not in the process of evaluating the product. A 
display that encourages touch may lead customers to inter- 
act with products that they otherwise would have: ignored, 
which in turn may increase impulse and unplanned pur- 
chases (Peck and Childers 2006). 

Finally, this study suggests that touch can be used along 
with pictures, photos, color, humor, and other elements to 
increase the persuasiveness of print advertising. Touch 


elements are unexpected information and have been shown 
to increase the persuasiveness of advertisements, provided 
that they are congruent with the message (Lee and Mason 
1999). Recent trends in advertising have focused on the 
experiential and aesthetic aspects of communication (e.g., 
Schmitt 1999; Schmitt and Simonson 1997). Incorporating 
touch may be the next step in adding a hedonic or 
experiential aspect to advertising and other marketing 
communications. 


Appendix A 
Text of Messages for Studies 1 
and 2 


Study 1 

A little piece of nature, nestled in the middle of the city, the 
Arboretum is a perfect place for a stroll, curling up under a 
tree with a good book, watching birds and local wildlife, or 
an evening of stargazing. 

But the Arboretum is more than just a park. It is-a 
research and teaching facility that provides a place for 
people to develop a positive relationship with nature. 

Every day, the Arboretum brings more people back in 
touch with nature with environmental tours, class and lec- 
tures, and special outreach programs at local schools. It is 
an exceptional resource for learning, sharing, discovering, 
and enjoying—a cherished gem of nature in an urban 
setting. — - 

The Arboretum also provides great opportunities to get 
in touch with nature directly with hiking, biking and jog- 
ging trails, and even trails for skiing and snowshoeing. 

The Arboretum has been a pioneer in the restoration, 
and management of ecological communities since the 
1930's. The Arboretum strives to conserve, restore and pre- 
serve the natural lands of the city so that residents can enjoy 
the beauty of nature for years to come. 

As part of this goal, we recently embarked on a capital 
campaign to further improve the Arboretum through a new 
addition to the visitor's center and the installation of the 
four-acre Native Plant Garden. This project will help the 
Arboretum improve and expand programs for university, 
public and professional audiences. 

You can help to bring more people in touch with nature 
by becoming a Friend of the Arboretum. We gratefully wel- 
come your gift because every gift enhances our ability to 
provide quality programs and experiences for everyone who 
enjoys and learns from the Arboretum. 


Study 2 


Feel the warmth of a warm winter blanket. 

Winters can be icy cold, especially at night. There's 
nothing like curling up under a thick, cozy, warm blanket to 
keep out the harsh winter chill. But for some families, that 
warm, cozy feeling is just out of reach. 

Spread the Warmth helps families in need to keep warm 
this winter by providing them with new and gently used 
blankets. But we need your help to spread the warmth 
before the first big winter chill hits our state this year. 
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Spread the Warmth needs volunteers to deliver blankets 
in our city and donations to help us to purchase enough 
blankets to keep everyone warm. Please help us spread the 
warmth this winter. 


Appendix B 
Autotelic NFT Items 


Autotelic NFT (6 Items) 


1. When walking through stores, I can't help touching all 
kinds of products. 


2. Touching products can be fun. 


3. When browsing in stores, it is 1mportant for me to handle 
all kinds of products. 


4. I like to touch products even if I have no intention of buying 
them. 


5. When browsing in stores, I like to touch lots of products. 
6. I find myself touching all kinds of products in stores. 


We measured all items on a seven-point Likert-type scale: 
Qt = .92 (source: Peck and Childers 2003b). 


Funtouch (3 Items) 


1. I enjoy touching various textures. 
2. I am a person who likes to touch. 
3. Touching in general is fun. 


We measured all items on a seven-point Likert-type scale: 
a = .94. 


Appendix C 
Affective Response Items 


Affective Response (4 items) 


1. This mailing was very enjoyable. 
2. This mailing was very likeable. 

3. This mailing was very persuasive. 
4. This mailing was very interesting. 


We measured all items on a seven-point Likert-type scale: 
a = .86 (source: Zinkhan and Martin 1983). 


Emotional Reaction (10 items) 


Here is a list of emotional reactions you may have experi- 
enced while reading the mailing (touching the swatches). 
Please indicate how much you felt each of these emotional 
reactions. 


«Interested 
«Moved 
«Captivated 
eInquiring 
eConfident 
eDelighted 
eEnthusiastic 
*Appealed 
eSatisfied 
«Amused 


We measured all items on a five-point scale, with endpoints 
“not at all” and “a lot”: œ = .84 (source: Derbaix 1995). 
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Xueming Luo & Naveen Donthu 


Marketing’s Credibility: 
A Longitudinal Investigation 
of Marketing Communication 
Productivity and Shareholder Value 


Practitioners and scholars have questioned marketing communication’s credibility because it has not been linked 
to firm shareholder value. Using secondary longitudinal data, the authors show that the impact of marketing 
communication productivity (MCP) on shareholder value is positive and nonlinear. In particular, the influence of 
MCP on the forward-looking performances of Tobin’s q and stock return is curvilinear with an inverted U shape. The 
authors also investigate the moderating role of research-and-development investment and competitive intensity. 
The findings have implications for marketing managers regarding how to model and benchmark MCP and the 
dangers of blindly improving the productivity of advertising and sales promotion expenditures. 


arketing scholars and practitioners have long been tion investments.” Kraft Foods has suggested moving away 
concerned about the accountability of marketing from guesswork and gut feelings and toward the systematic 
communication expenditures (e.g., advertising, assessment of advertising productivity (Dittus and Kopp 
sales promotions). In their classic article, Aaker and Car- 1990). Achenbaum (1992, p. 25) calls attention to the “cri- 
man (1982) report that firms greatly overadvertise. In an sis of advertising productivity” because many customers 
echo of this claim, Prasad and Sen (1999), in their review of tend to zap past, zip through, and tape over marketers’ mul- 





studies since 1982, find a pattern of continued overadvertis- timillion dollar selling and marketing messages. More 
ing across firms. More recently, Rust and colleagues (2004) recently, it has been noted that “there’s hardly a marketing 
comprehensively review the literature and pinpoint various executive today who isn’t demanding a more scientific 


researchers’ concerns about the productivity of advertising approach to help defend marketing strategies from the chief 
and sales promotion expenditures. In particular, to rebuild financial officer” (BusinessWeek 2004). Thus, reversing the 
confidence in advertising and marketing investment, Rust marketing productivity crisis represents a high-profile, hot- 
and colleagues (2004, p. 76) suggest the urgent need to button issue for both business practitioners and marketing 
“show how marketing adds to shareholder value, [because] scholars. 
the perceived lack of accountability has undermined mar- However, several challenges hinder marketers from 
keting’s credibility, threatened marketing’s standing in the fully understanding and effectively improving marketing 
firm, and even threatened marketing’s existence as a distinct communication accountability. First, an appropriate mea- 
capability within the firm.” surement is needed of marketing communication productiv- 
The trade press has identified a similar problem. ity (MCP), which we define as the ratio of marketing out- 
According to McLaughlim (1997, p. 5), "the most sacred puts (sales level, sales growth, and corporate reputation) to 


word in advertising had been creativity; now it's account- marketing communication expenditures (broadcast, print, 
ability.” Clients are demanding the most cost-efficient and outdoor advertising and sales promotion expenditures). 
media planning with measurable returns. Kumar and A methodologically sound measure of MCP remains chal- 
Petersen (2004, p. 28) point out that “company [chief finan- lenging because firms often target their advertising and 
cial officers] are charged with the task of getting the most sales promotion expenditures to promote multiple outcomes 
out of company resources [such as] marketing communica- simultaneously, such as both visible sales and market per- 


formance and invisible customer equity and corporate brand 
Xueming Luo is Assistant Professor of Marketing, Department of Market- TePVtation. Thus, the traditional approach of using a single 
Ing, University of Texas at Arlington (e-mail: luaxm@uta.edu). Naveen outcome-based measure (e.g., simple ratio of sales to mar- 
Donthu is Katherine S. Bernhardt Research Professor of Marketing, keting expenditures) may not offer a valid assessment of 
Department of Marketing, Georgia State University (e-mail: ndonthu@ MCP. In addition, although a vast literature pertains to sales 
gsu.edu). The authors thank the anonymous JM reviewers for their helpful responses to advertising (e.g., Vakratsas and Ambler 1999), 
guidance and insightful suggestions. Ruth Bolton served as editor for this the theorized shape of the response curve (i.e., linear, non- 
article. linear concave, or convex) has been, at best, inconsistent 
To read or contribute to reader and author dialogue on this article, visit and inconclusive (see Holstius 1990). In this sense, a sub- 


http;//www.marketingpower.com/[mblog. jectively specified measure based on the conflicting 
advertising-sales function that uses regression techniques is 
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inappropriate for assessing MCP. Extant marketing studies 
have not addressed these challenges sufficiently with rigor- 
ous methodological techniques. 

Second, there needs to be a proper assessment of the 
dynamic contributions of MCP to the firm’s market value 
over time. Specifically, in assessing the value of marketing 
communications, most prior research has related advertising 
and sales promotions to accounting performance measures, 
such as profits (e.g., Dekimpe and Hanssens 1995;-Mueller 
1990). However, Aaker and Jacobson (2001, p. 485) chal- 
lenge this practice by proposing that accounting-based mea- 
sures of return on investment (ROI) and book equity alone 
“cannot adequately explain firm value, because they fail to 
capture the benefits of investing in intangible assets such as 
brands.” Similarly, Ambler (2004) notes that ROI is driven 
by short-term considerations and often fails to reflect the 
dynamic impact of marketing assets. In addition, because 
accounting measures such as ROI are backward looking in 
nature (1.e., capture firm value in terms of historical or con- 
current performance), they are not robust for evaluating the 
future contribution of MCP (Rust et al. 2004).! Therefore, 
the long-term value of the invisible outcomes (e.g., corpo- 
rate reputation, brand equity) of a marketing expenditure is 
less likely to be evident according to accounting book val- 
ues. Instead, research should address how MCP adds to a 
firm's market value in terms of forward-looking stock mar- 
ket performance, such as Tobin's q (Dutta, Narasimhan, and 
Rajiv 1999; Rao, Agarwal, and Dahlhoff 2004) and stock 
returns (Aaker and Jacobson 1994; Mizik and Jacobson 
2003). Unfortunately, prior literature has rarely related 
MCP to the market-based measure of firm value and thus 
has failed to address the real, long-term credibility of mar- 
keting communication expenditures. 

Against this background, we attempt to (1) develop a 
theory-based measurement of MCP and (2) assess MCP's 
association with firm shareholder value empirically. In par- 
ticular, we define MCP on the basis of economic growth 
theory and the resource-based view (RBV) and measure it 
in a time-series-based context using the Malmquist model- 
ing method (Caves, Christensen, and Diewert 1982; 
Malmquist 1953). This nonparametric method satisfies the 
challenging requirements of measuring MCP because it 
captures the multiple objectives of marketing communica- 
tions in multiple periods and without a prior assumption 
about the response function. Unlike most regression 
approaches that offer a comparison with average performers 
under the assumption of central tendencies, the Malmquist 
technique benchmarks each firm's productivity against the 
best practices that perform above average (Füre et al. 1992). 
In addition, its dynamic time-series nature accommodates 
the lagged, carryover effects of advertising and sales pro- 
motion. Our application of the Malmquist time-series tech- 
nique not only extends prior work on cross-sectional and 
time-invariant counterpart models (cf. Horsky and Nelson 
1996; Murthi, Srinivasan, and Kalyanaram 1996) but also 


1Using the backward-looking performance measure of ROI, the 


impact of advertising is mixed, according to the literature (e.g., 
positive: Chauvin and Hirschey 1993; Mizik and Jacobson 2003; 
negative: Erickson and Jacobson 1992; Holstius 1990). 


provides a direct answer to a series of calls for scientific 
measures of marketing productivity (McLaughlim 1997; 
Rust et al. 2004). 

Furthermore, we develop and test a theoretical frame- 
work that predicts that firms that are more efficient in con- 
verting advertising and promotion investments into sales 
and corporate reputation may create greater shareholder 
value over time. Because MCP facilitates the level and 
speed of future cash flows while reducing their volatility 
and vulnerability (Srivastava, Shervani, and Fahey 1998, 
1999), we suggest that it also should promote shareholder 
value in terms of the forward-looking market value mea- 
sured by Tobin's q and stock return. 

Another important insight of our framework is that we 
do not predict a monotonic linear relationship between 
MCP and firm market value but rather nonmonotonic curvi- 
linear influences of MCP. Because of the possible conflict 
between productivity and customer satisfaction (Anderson, 
Fornell, and Rust 1997), the unrestricted pursuit of 
improvements in MCP may actually harm firm shareholder 
value. There is a limit to the incremental performance con- 
tributions of MCP, and departing from the optimal point 
(too low or too high increases of MCP) may lead to reduced 
Tobin’s q and stock return. We also hypothesize about why 
and how the impact of MCP changes across firms with dif- 
ferent levels of research and development (R&D) invest- 
ment and competition intensity. 

Next, we present the hypotheses development for the 
impact of MCP and the moderated relationships, followed 
by details of our suggested process for measuring MCP. 
Using secondary longitudinal data from Fortune’s large 
companies, we find that MCP is positively related to 
Tobin's q and stock return. For practitioners, this finding 
helps justify advertising and sales promotion budgets and 
supports marketing's long-term credibility. However, the 
supported inverted U-shaped relationship between MCP 
and firm market value suggests some "dangers" of blindly 
improving MCP. Our work represents a first step in measur- 
ing and valuing marketing communication investments, 
which must be taken before the productivity crisis can be 
reversed. 


MCP and Shareholder Value 


Neoclassical production theory in economics posits that, in 
general, productivity represents the conversion of economic 
inputs (e.g., labor, capital) into desirable outputs (i.e., sales, 
profits) or the ratio of the outputs of an economic activity to 
the inputs required by that activity. According to Caves, 
Christensen, and Diewert (1982) and Farrell (1957), in a 
production setting, managers attempt to improve the value 
of a firm by operating it in the most productive manner. 
That is, managers maximize the firm's desirable outputs or 
outcomes with the minimal amount of inputs or resources. 
This definition of productivity in an economic production 
context has been well accepted in marketing (Bucklin 1978; 
Murthi, Srinivasan, and Kalyanaram 1996; Sevin 1965). 
Following this stream of research, we define MCP as 
the optimally weighted ratio of marketing outputs (sales 
level, sales growth, and corporate reputation) to marketing 
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communication expenditure (advertising media spending in 
broadcast, print, and outdoor and sales promotion 
expenditure).2 

Figure 1 presents our theoretical framework of MCP 
and shareholder value. This framework provides a base 
from which we predict (1) how MCP adds to a firm’s share- 
holder value in a curvilinear manner and (2) the moderating 
role of contextual factors (R&D and environmental compe- 
tition intensity) for returns on MCP. 


Curvilinear Influences of MCP 


In this section, we offer the logic for the dynamic impact of 
MCP and predict that MCP has nonlinear influences on 
shareholder value. That is, an initial increase of MCP will 
enhance the forward-looking performance of Tobin’s q and 
stock returns, but beyond an optimal point, further increases 
in MCP will become harmful. 

An initial increase of MCP over time is positively asso- 
ciated with a firm’s market return (i.e., Srivastava, Shervani, 
and Fahey 1999). Marketing communication productivity 
can accelerate cash flows in several ways. Efficient market- 
ing communication investments offer the same levels of 
sales and brand reputation with less advertising and promo- 
tion spending. The saved expenditures, which represent 
extra working capital, then can be invested in other high net 
present value projects that address long-term performance 
(Anderson, Fornell, and Rust 1997; Mittal et al. 2005). Over 
time, this cost saving and higher productivity may acceler- 
ate cash flows (Srivastava, Shervani, and Fahey 1999). Fur- 
thermore, because the time for market acceptance may be 
reduced through learning and the persuasion effects of pro- 
ductive and successful advertising, the product adoption 
and diffusion rate might be facilitated (Erickson and Jacob- 
son 1992; Vakratsas and Ambler 1999). In addition, supe- 
rior marketing communication programs build customer 
equity, which leads to higher future repurchases and mar- 
gins (Keller 1998; Keller and Aaker 1993; Rust 1986). 
Indeed, productive advertising and promotion help promote 
product awareness, initial product trials, and repeated pur- 
chases (Chauvin and Hirschey 1993; Luo and Donthu 2001; 
Szymanski, Bharadwaj, and Varadarajan 1993). From 
another perspective, investors may regard an improvement 
in efficient marketing and advertising expenditures as good 
news in the financial markets, which can have a positive 
impact on the long-term implications of future cash flows 
(e.g., Hiemstra and Jones 1994). Furthermore, productive 
advertising and sales promotion programs may reduce the 
vulnerability and volatility of cash flows through reduced 
brand-switching intentions and decreased customer churn 


——— MÀáá———————————— 

2Our MCP modeling approach fits with Penrose’s (1959) origi- 
nal work on the RBV, which argues that the process of converting 
resources determines a firm's competitive advantage more than 
does the level of resources. In addition, because the RBV pertains 
to best performances and above-average rents (Hansen, Perry, and 
Reese 2004), our model coincides with the RBV neatly ın that it 
compares each firm with the best-practices frontiers. Our MCP 
measures examine the productivity (not level) of advertising and 
promotion expenditure. 
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rates (Reichheld 1996; Vakratsas and Ambler 1999). Cus- 
tomers who retain a higher brand identity with the firm tend 
to be more responsive to future advertising and promotions. 
Dynamically, this tendency may lead to reduced market 
risks for the firm (Srivastava, Shervani, and Fahey 1999). 
Because MCP results that are too low and employed with 
inefficient and unproductive marketing communications 
investments mean overspending in advertising and promo- 
tions (Aaker and Carman 1982; Rust et al. 2004), the 
wasted and inefficient resources likely hamper the speed 
and level of cash inflows to the firm. 

Thus far, we have argued that increasing MCP posi- 
tively affects firms' market returns. However, we expect that 
this impact may not be strictly linear. Just as in the case of 
too low MCP with overspending, an extremely high 
increase of MCP with minimum spending in advertising 
and promotion programs may not be optimal. As Anderson, 
Fornell, and Rust (1997, p. 129) point out, because firms 
that try to achieve high productivity (i.e., reduce labor costs, 
cut employee training expenditures) experience lowered 
levels of customer satisfaction, their "future profitability 
may be threatened." This potential conflict between produc- 
tivity and customer satisfaction suggests that the unre- 
stricted pursuit of high productivity and efficiency through 
relentlessly "lean" marketing and advertising expenditures 
can lead to a backlash that involves compromises and hin- 
drances in customer equity efforts, initiatives to achieve 
premeditated levels of customer awareness and corporate 
exposure, and new product innovations (Dutta, Narasimhan, 
and Rajiv 1999; Szymanski, Bharadwaj, and Varadarajan 
1993). These consequences eventually lead to diminished 
future performance returns of MCP. In addition, because 
firms tend to enhance productivity by first cutting the 
expenses that are most wasteful, subsequent efforts to 
improve MCP by further cutting advertising and promotion 
expenditures may cause harm over time and lead to reduced 
market returns. In other words, there is a limit to the posi- 
tive incremental return to MCP. At the margin, increasing 
MCP beyond an optimal point may eventually result in 
negative returns. This line of reasoning for the curvilinear 
influence of MCP is also supported by recent empirical 
accounting and finance research, which finds nonlinear 
relationships between market value and various predictive 
variables (Byrd and Hickman 1992; Hiemstra and Jones 
1994; Ng 2005; Sougiannis 1994). For example, Ng (2005) 
finds a curvilinear relationship between corporate owner- 
ship and Tobin's q. According to Hiemstra and Jones 
(1994), trading volume has a nonlinear influence on stock 
returns. Indeed, Hiemstra and Jones argue (p. 1642) that 
"changes in stock market volatility can only be properly 
modeled with nonlinear models." 

Taken together, this discussion suggests that departing 
from an optimal point—that is, increases in MCP that are 
too low or too high—leads to decreases in the forward- 
looking market value, exhibiting an inverted U-shaped 
relationship. 

H;: MCP has an inverted U-shaped influence (positive linear 

but negative quadratic coefficients) on the forward- 


looking performance measures of Tobin's q and stock 
return. 
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The Moderating Role of R&D Intensity and 
Competition 


We expect that the association between MCP and share- 
holder value may not be universal but rather may change 
according to the organizational context. We turn to the RBV 
perspective (Barney 1986; Penrose 1959; Wernerfelt 1984) 
and Srivastava, Shervani, and Fahey’s (1998) market asset 
framework to gather the logic surrounding some moderators 
of this association. 

A firm’s R&D intensity may change the strength of the 
association between MCP and market value. In the litera- 
ture, R&D and marketing are deemed to be complementary 
resources (Dutta, Narasimhan, and Rajiv 1999; Srivastava, 
Shervani, and Fahey 1998). Because the RBV considers 
firms different bundles and configurations of strategic 
resources, the synergistic interaction between superior mar- 
keting and R&D may create additional long-term value, 
beyond the direct impact of efficient marketing programs. 
Productive marketing and advertising investments likely 
obtain valuable feedback from the market and customers, 
which facilitates R&D and new product innovations. In 
turn, this offers enhanced cash flow to the firm. In addition, 
intense R&D in the new product development process can 
“ensure speedy and successful commercialization of tech- 
nologies and products at a low cost” (Dutta, Narasimhan, 
and Rajiv 1999, p. 551; see also Gupta, Raj, and Wilemon 
1987). Thus, superior MCP coupled with higher R&D 
intensity over time may lead to even greater speed and lev- 
els of cash flow, along with lower vulnerability and volatil- 
ity, which can promote greater market value in the long run 
(Griffin and Hauser 1996; Srivastava, Shervani, and Fahey 
1999). Working from the RBV perspective, Dutta, 
Narasimhan, and Rajiv (1999) find that R&D capability 
facilitates the positive influence of marketing capability and 
brings complementary rents into the firm dynamically, 
beyond the contribution of superior marketing efficiency 
alone. 

In summary, on the basis of the RBV and prior findings, 
we suggest that in conjunction with higher R&D intensity 
in the firm over time, MCP leads to greater firm market 
value. 


H5: MCP has a stronger influence on the forward-looking per- 
formance measures of Tobin’s q and stock return in firms 
with higher (versus lower) R&D intensity. 


Finally, we predict that environmental competition 
intensity expands the complementary rents of MCP and 
intense R&D, as we state in H». In particular, in a competi- 
tive market, buyers may choose from more substitutes of 
products and services. To lock in buyers, firms strategically 
build brand loyalty and create exit barriers that facilitate the 
speed and level of their cash flows. Similar to Schumpeter's 
(1934) notion of continuous entrepreneurial innovations due 
to market competition, in more competitive markets, firms 
are motivated (reactively or proactively) to leverage not 
only efficient advertising programs but also R&D invest- 
ments over time to dynamically upgrade and renew cash- 
cow brands to shield themselves from head-to-head compe- 
tition for more entrepreneurial rents. The more competitive 
the market, the more firms may rely on marketing and R&D 
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skills simultaneously to ensure the successful commercial- 
ization of their existing and new technologies, products, and 
services (1.e., Griffin and Hauser 1996; Mizik and Jacobson 
2003). 

In short, the complementary effect of marketing and 
R&D (Dutta, Narasimhan, and Rajiv 1999) may be 
expanded in more competitive markets, which would lead 
to greater cash flows and firm market value. Therefore, it is 
likely that MCP has the strongest influence on Tobin's q 
and stock returns when firms have higher R&D intensity 
and operate in highly competitive markets? 

H3: MCP has the strongest influence on the forward-looking 

performance measures of Tobin's q and stock return in 


firms that have high R&D intensity and are in more 
competitive markets. 


Measuring MCP: The Modeling 
Process 


We now detail a process of modeling MCP that is based on 
the time-series Malmquist approach. We present a flow 
chart of this process in Figure 2. As we show in the flow 
chart, there are two significant stages involved in our mod- 
eling process. Stage 1 focuses on measuring MCP, and 
Stage 2 focuses on valuing MCP (as we detail in the 
"Analyses and Results" section). In Stage 1, we first define 
MCP as the conversion ratio of marketing communication 
inputs to outputs, which requires a logical identification of 
the multiple inputs to multiple outputs. 

For the inputs of MCP, we focus on two main cate- 
gories: advertising media spending and sales promotion 
expenditures. Advertising media spending includes three 
dimensions: expenditures in broadcast, print, and outdoor. 
Specifically, a firm's advertising expenditures in broadcast 
media equals the sum of television and radio expenditures, 
including network television spots on ABC, CBS, NBC, 
Fox, WB, UPN, and Pax TV and television spots on 575- 
plus stations in the top 100 U.S. markets, which encom- 
passes more than 180 programs per month and 37 cable net- 
works, as well as four radio networks and spot radio 
spending on 4000 stations in more than 225 markets. The 
MCP input of advertising expenditures in print media 
equals the sum of newspaper, magazine, and other expendi- 
tures, including more than 140 newspapers in 60 of the top 
U.S. markets, more than 230 consumer magazines, and 
Sunday magazines. The MCP input of advertising expendi- 
tures in outdoor media is the total expenditures in more than 
300 outdoor plant operator markets. Finally, the MCP input 
of sales promotion expenditures is the sum of spending in 
sales promotions, direct mail, co-op spending, coupons, cat- 
alogs, business and farm publications, and Special events. 

Next, we specify three outputs of MCP—-sales level, 
sales growth, and corporate reputation—because marketing 
communication expenditures influence these outcomes 
simultaneously. Note that measuring the sales level requires 
an approach to control the influence of other relevant 
variables and carryover effects of advertising and marketing 
a ee 


Although the main effects of competition may be positive or 
negative, we investigate only the moderating effect. 


FIGURE 2 
MCP Methodology Flow Chart 
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expenditures (i.e., Kim, Bhargava, and Ramaswami 2001; 
Vakratsas and Ambler 1999), as we discuss subsequently. 
The MCP output of sales growth is the three-year com- 
pounded annual growth rate of sales in the primary Stan- 
dard Industrial Classification of the firm (Rao, Agarwal, 
and Dahlhoff 2004). The MCP output of corporate reputa- 
tion is defined as the firm's overall appeal to the: public and 
is based on its historical actions and future prospects 
(McGuire, Schneewels, and Branch 1990; Roberts and 
Dowling 2002). According to Fortune (2005), this variable 


assesses the perceived reputation of the large U.S. firms and 
is based on the research of more than 10,000 financial ana- 
lysts and investors. 

After we specify the inputs and outputs of MCP, we 
introduce the dynamic Malmquist approach to estimate 
MCP. The intuition of this approach is that firms attempt to 
consume the least possible amount of inputs to achieve the 
same level of desired outputs from time t to time t + 1. If a 
firm cannot reduce its inputs (i.e., use fewer resources) 
without hurting its output levels, it is considered productive 
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Over time. Otherwise, it is unproductive and inefficient. 
Thus, productive firms as a whole represent the so-called 
best-practice frontier that dynamically benchmarks each 
firm’s MCP in the transformation of inputs into outputs 
(Fare et al. 1992; Luo 2004). 

To calculate MCP from t to t + 1 for each firm, 
Malmquist (1953) initially developed dynamic models to 
assess the total factor productivity of general economic 
activities over time. Later, Fare and colleagues (Fire et al. 
1992; Fare, Grosskopf, and Lovell 1994) constructed the 
time-series linear programming (or data envelopment analy- 
sis [DEA] -based) Malmquist productivity index.4 As we 
show in the Appendix, this measure of MCP for each firm 
can be calculated using two single-period and two mixed- 
period efficiencies based on DEA programming (Banker, 
Charnes, and Cooper 1984; Charnes, Cooper, and Rhodes 
1978). To the best of our knowledge, the literature has pro- 
vided no time-series applications of the DEA-based 
Malmquist approach to measure and value the various com- 
ponents of MCP (for technical details and numeric exam- 
ples of DEA, see Horsky and Nelson 1996; Luo and Donthu 
2005; Murthi, Srinivasan, and Kalyanaram 1996; see also 
the Appendix). 

The Malmquist approach is well suited for measuring 
MCP and tracking its changes over time because of its 
methodological advantages. First, it can estimate the pro- 
ductivity of marketing communication expenditures “with- 
out a priori information on tradeoffs among inputs and out- 
puts” (Chen and Ali 2004, p. 239). This is advantageous 
because the function of market responses to advertising and 
promotions is conflicting and inconclusive (Strong 1925: 
Vakratsas and Ambler 1999). In addition, some advertising 
expenditures are intended to promote corporate brands, 
whereas others are intended to increase sales revenue, and 
still others aim for a little bit of both. Therefore, without 
prior knowledge about which part of the advertising and 
marketing expenditure produces which part of the outputs, 
the Malmquist approach is very suitable. Second, unlike 
regression techniques that use one output at a time (Caves, 
Christensen, and Diewert 1982), Malmquist models are not 
limited by the number of outputs, which is desirable for 
measuring MCP because marketing communications can 
achieve multiple outcomes (i.e., visible outcomes, such as 
sales, and invisible outcomes, such as reputation) at the 
same time.? Third, the time-series nature of Malmquist 
———— M —— — — c MN 

^Compared with parametric alternatives such as stochastic fron- 
uer, thick frontier, distribution-free analyses, and translog specifi- 
cation, the Malmquist linear programming estimates are more sta- 
ble 1n Monte Carlo simulation studies (e.g., McAllister and 
McMannus 1993) and offer richer managerial implications (e.g., 
Luo 2004; Luo and Donthu 2005; Wheelock and Wilson 1999). 

5Although multivariate analysis methods such as canonical cor- 
relation can deal with multiple outputs and inputs, they are essen- 
tially parametric approaches that assume normal distribution and 
central tendency (1.e., they compare with average performers 
rather than best practices). In addition, canonical correlation can 
estimate only one set of coefficients across all firms and thus pro- 
duce, at best, only one index of MCP for each firm (cf. the 
Malmquist approach offers a more complete understanding of dif- 
ferent facets of MCP). We thank an anonymous reviewer for high- 


lighting this point. 
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modeling offers an advantage for benchmarking MCP 
because advertising and sales promotion expenditures have 
lagged and carryover effects; outcomes may unfold over a 
long period in the marketplace (Fáre et al. 1992). 

The Malmquist approach also fits well with the RBV of 
the firm (e.g., Barney 1986; Penrose 1959). According to 
Penrose (1959), a firm's best practices in the process of 
converting resource inputs to outputs (or MCP in our con- 
text) ultimately determine firm performance, not just the 
levels of those inputs. Hansen, Perry, and Reese (2004, p. 
1280) recently acknowledged that *RBV is fundamentally a 
theory about extraordinary performers or outliers. The sta- 
tistical methods used in applying the theory must account 
for individual firm differences, but should not be based on 
means, which statistically neutralize firm difference." The 
Malmquist approach is just such a method and identifies 
best-practice frontiers and benchmarks MCP at the individ- 
ual level. 

The final step in Stage 1 is to decompose the overall 
MCP into several subcomponents across firm-year observa- 
tions (see the Appendix). Intuitively, the first dimension of 
MCP reflects the change of MCP from time t to t 4 1 in 
terms of managerial efficiency, which we label MCP1 (i.e., 
the effects of moving toward the best-practice frontiers). 
This efficiency represents how much improvement in MCP 
can be achieved if managers enhance their skills of manag- 
ing marketing communication programs compared with the 
best practices. Another dimension is the change of MCP 
from time t to t + 1 in terms of capital efficiency, which we 
label MCP2 (i.e., the effects of moving between the best- 
practice frontiers with different returns-to-scale model 
specifications). This efficiency represents how much 
improvement in MCP can be achieved if managers can pro- 
mote the economies of scale when using marketing commu- 
nication capitals. It also offers a dimension that represents 
the change of MCP from time t to t + 1 in terms of best- 
practice shift efficiency, which we label MCP3 (i.e., the 
effects of moving away from the best-practice frontiers 
from t to t + 1). This efficiency indicates how much 
improvement in MCP can be achieved through factors that 
are unrelated to marketing communication managerial skills 
but are associated with intertemporal, uncontrollable 
variables, such as industry changes, communication innova- 
tions, and environmental variables. Finally, the dimension 
of returns to scale of marketing communications capital 
(1.e., the effects of moving along the best-practice frontiers) 
is a categorical variable that can be increasing, constant, or 
decreasing. We create two dummy variables (MCPd1 and 
MCPd2) for these three kinds of returns to scale. 

Next, we describe the multiple sources of secondary 
data we collected to measure MCP. Then, we present the 
analysis approach in Stage 2, which investigates the value 
of MCP by relating its subdimensions to Tobin's q and 
Stock returns. 


Data and Measures 
We obtained data on the advertising and promotions expen- 
ditures, sales revenue, corporate reputation, Tobin's q, stock 
return, accounting profitability, and firm characteristics 


variables from multiple archived sources. Specifically, we 
collected secondary data from four sources: COMPUSTAT, 
Competitive Media Reporting (CMR), the Fortune reputa- 
tion index, and the Center for Research in Security Prices 
(CRSP). To evaluate the contribution of MCP to firms’ mar- 
ket value, we selected the large publicly traded Fortune 
1000 companies (and the top marketers and advertisers) for 
eight consecutive years (1995-2002). We report the details 
of the cross-sectional time-series databases and the 
variables in Table 1. 

To estimate MCP over time, we need data for the four 
marketing communication expenditure inputs (i.e., in 
broadcast, print, outdoor, and sales promotion) and three 
outputs (sales level, sales growth, and corporate reputation). 
The MCP input variable data come from the CMR archives, 
which offer fairly comprehensive information about the 
breakdown of advertising and promotion expenditures by 
the large firms in top markets (e.g., Anderson, Fornell, and 
Mazvancheryl 2004; Rao, Agarwal, and Dahlhoff 2004). 
Although COMPUSTAT also offers information,on adver- 
tising and sales expenditures, many data points are missing 
because firms are not required to report this type of expen- 
diture (see Joseph and Richardson 2002). In addition, unlike 
the CMR data set, COMPUSTAT does not break down 
advertising expenditure data into different types of media 
spending. 

For MCP output variables, we first used sales level that 
has controlled the carryover effects of advertising (i.e., 
Chauvin and Hirschey 1993; Vakratsas and Ambler 1999) 
and the impact of other relevant factors, such as R&D, firm 
size, and competition. Specifically, we employed the fol- 
lowing functions: 


(1) SALES,~ = SALES, — SALES, predicted Where 
SALES, predicted = & + DgSALES, _ | + by X iADV,. | 
+ by X RD, + bs x RD, -1 + b4 X'SIZE, 
+ bs X SIZE, , 
+ be x Focus of FIRM, 
+ b, x Focus of FIRM, _ ; 
+ bg X competition, + bg X competition, _ | 
+e. 


COMPUSTAT provided data for the book value of total 
assets, sales revenue, R&D intensity, firm size, the focus of 
the firm, industry competition intensity, and other such 
variables. Firm size is the log of the number of employees, 
and focus of the firm is the number of industry segments in 
which the firm operates (Rao, Agarwal, and Dahlhoff 
2004). In line with prior literature in marketing and man- 
agement (i.e., Anderson, Fornell, and Mazvancheryl 2004; 


6Conceptually, it is important to have a "clean" measure of sales 
level when examining the relevant contribution of marketing 
expenses at a given point of time. That is, we need to tease out the 
impact of many other variables and carryover effects of marketing 
communication expenditures. Dutta, Narasimhan, and Rajiv 
(1999) use a similar approach to adjust their output variable when 
evaluating capability efficiency (MCP productivity in our context). 


Fischer and Pollock 2004), we measured industry competi- 
tion intensity using the Herfindahl concentration index, 
which we derived from COMPUSTAT. A low degree of 
concentration indicates that the marketplace is fragmented 
and competition intensity is high. We measure R&D inten- 
sity as the ratio of R&D spending to total assets for each 
firm-year observation (Anderson, Fornell, and Mazvan- 
cheryl 2004), which we also derived from the COMPUS- 
TAT database. 

The second output variable is sales growth, which we 
also derived from the COMPUSTAT data set and calculated 
as the three-year compounded annual sales growth rate 
rather than a simple growth rate, such as sales, versus 
sales, _ ; (Rao, Agarwal, and Dahlhoff 2004). Sales growth 
could be measured with a simple two-year-based growth 
rate (t versus t — 1). However, to be more parsimonious, we 
measured sales growth on the basis of the three-year com- 
pounded rate (t versus t — 1 and t — 2). Because this com- 
pounded rate is essentially a “smoothed” rate of growth, it 
has relatively fewer concerns, such as the possible existence 
of some random dramatic shocks. In general, the com- 
pounded annual sales growth rate (t versus t — 1 and t — 2) is 
preferred to a simple growth rate (t versus t — 1) in both 
marketing and accounting literature (see Rao, Agarwal, and 
Dahlhoff 2004). 

We obtained data for the final MCP output variable, cor- 
porate reputation, from the annual reputation survey of the 
most admired U.S. corporations (Houston and Johnson 
2000; Roberts and Dowling 2002). This measure of per- 
ceived overall reputation of the largest U.S. firms uses an 
1 1-point scale (0—10) to assess eight indicators (innovative- 
ness, employee talent, social responsibility, long-term 
investment value, financial soundness, use of corporate 
assets, quality of management, and quality of products and 
services; see Fortune 2005). Fortune surveys more than 
10,000 executives, directors, and securities analysts every 
year. 

Many prior studies have successfully used this corporate 
reputation score in both marketing and strategy (Houston 
and Johnson 2000; Roberts and Dowling 2002). In particu- 
lar, McGuire, Scheeweis, and Branch (1990, p. 170) note 
that the “Fortune reputation is one of the most comprehen- 
sive and widely circulated surveys of attributes available 
[for measuring reputation]. Both the quality and number of 
respondents are comparable or superior to the 'expert pan- 
els' usually gathered for such purposes. Thus, research 
across disciplines seems to accept this best-available mea- 
sure of corporate reputation. However, because it has been 
noted that the Fortune reputation measure depends on finan- 
cial performance (e.g., Aaker and Jacobson 2001), we also 
address this reverse-causality issue in our analyses, as we 
discuss subsequently, using the approach that Roberts and 
Dowling (2002) recommend.’ 


TTo control the reverse-causality issue between financial perfor- 


mance and corporate reputation (Aaker and Jacobson 2001), we 
adopted the approach that Roberts and Dowling (2002) recom- 
mend. As we discussed in the “Additional Analyses” subsection, 
our results hold after accommodating this reverse-causality con- 
cern. We thank an anonymous reviewer for this point. 
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Measure 


Forward-looking 
performance of 
Tobin’s q 


Forward-looking 
performance of 
stock return 


Backward-looking 
performance of 


return on assets 


Sales level 


Sales growth 

Corporate 
reputation 

Advertising 


broadcast 
expenditure 


Advertising print 
expenditure 


Advertising 
outdoor 
expenditure 

Sales promotion 
expenditure 


Competition 
intensity 


R&D Intensity 
Firm size 


Focus of the firm 


TABLE 1 


Measures and Data Sources 


Operationallzation 


A longltudinal linear-programming-based ratio of multiple 
outputs (sales revenue, market growth, and corporate 
reputation) to multiple inputs (broadcast, print, outdoor, 

and sales promotion expenditures) 


The result of ([share price x number of common stock 
outstanding + liquidating value of the firm’s preferred 
stock + short-term liabilities — short-term assets + book 
value of long-term debt}/book value of total assets) 


The result of (current year's share price x number of 


common stock outstanding + dividends — previous year's 


share price x number of common stock 
outstanding/[previous year's share price x number of 
common stock outstanding]) 


The ratlo of net income before extraordinary Items to 
assets 


Adjusted for advertising and marketing carryover effect, 
R&D contemporary and carryover effect, competition 
Justification effect, and firm-size effect 


The compounded annual sales growth rate over the 
previous three years 


Fortune reputation index (corporate reputation based on 
an annual survey of the most admired U.S. corporations) 


The sum of television and radio expenditures, such as in 
network television spots on ABC, CBS, NBC, Fox, WB, 
UPN, and Pax TV and spot television on 575-plus 
stations in the top 100 U.S. markets; more than 180 
syndicated television programs per month, 37 cable 
networks, and four radio networks and spot radio 
spending from 4000 stations in more than 225 markets 


The sum of newspaper, magazine, and print other 
expenditures including 140-plus newspapers in 60 of the 
United States’ top markets; monitors space in national 
newspapers, Including The Wall Street Journal, USA 
Today, and The New York Times; 230-plus consumer 
magazines and Sunday magazines 


The sum of more than 300 outdoor plant operator 


markets 


The sum of spending in sales promotion, direct mail, co- 
op spending, couponing, catalogs, business and farm 
publications, and special events 


Herfindahl concentration index (derived by using the 
lagged sales for all the companies in the same two-digit 
Standard Industnal Classification code for each firm- 
year observation) 

R&D expenditure/book value of total asset, _ , 

Number of employees 


Number of industry segments the firm operates 
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Data Sources 


Derived from CRSP 
and COMPUSTAT 


Derived from CRSP 
and COMPUSTAT 


Derived from 
COMPUSTAT 


Derived from 
COMPUSTAT 


Derived from 
COMPUSTAT 


Fortune 


CMR 


CMR 


Derived from 
COMPUSTAT 


COMPUSTAT 
COMPUSTAT 
COMPUSTAT 


Perlod 
1995-2002 


1994-2003 


1994-2003 


1995—2002 


1994-2003 


1993-2003 


1995-2002 


1995-2002 


1995-2002 


1995—2002 


1995—2002 


1994-2002 


1994-2002 
1995-2002 


1995-2002 


Because we tried to merge data from these different 
sources and each source has some missing values, our sam- 
ple size was reduced. After searching other sources, such as 
Standard & Poor's industry reports, AdWeek, . Moody's 
report, Compact Disclosure, and company annual reports, 
we obtained complete data on 89 companies. As a result, we 
have 712 data points across the firms from 1995 to 2002. 

In line with the work of Jacobson and colleagues (Aaker 
and Jacobson 1994, 2001; Mizik and Jacobson 2003), we 
derived the forward-looking performance measures of stock 
return using both the COMPUSTAT and the CSRP data- 
bases. Tobin's q, which is comparable across industries 
(Anderson, Fornell, and Mazvancheryl 2004; Dutta, 
Narasimhan, and Rajiv 1999), offers another forward- 
looking measure of firm market value. We followed prior 
marketing studies (Lee and Grewal 2004; Rao, Agarwal, 
and Dahlhoff 2004) to calculate Tobin's q for each firm- 
year observation. In computing the stock return and Tobin's 
q, we also controlled for the influence of return on assets 
(ROA; see Chauvin and Hirschey 1993; Erickson and 
Jacobson 1992), which we measure as the ratio of net 
income before extraordinary items to assets, which we 
derived from COMPUSTAT. 


Analyses and Results 


Results of Measuring MCP 

Because we wanted to benchmark and track the longitudinal 
changes of several different dimensions of MCP over time, 
we estimated 39,872 linear programs (4 efficiency models x 
2 returns to scale x 712 data points x [8 — 1] years) to derive 
the MCP components for the sample. In addition, following 
Füre, Grosskopf, and Lovell's (1994) approach, we ran 
22,784 additional linear programming models to determine 
the returns to scale of marketing expenditure. We summa- 
rize the results of the subdimensions and the overall MCP 
in Table 2. The interpretation of the results is that when 
MCP » 1 (for MCP dimensions and overall MCP), MCP 
improves from t to t + 1; when MCP = 1, no change occurs; 
and when MCP « 1, MCP decreases from t to t + 1. 

Our data suggest that MCP improved over time, as we 
show in Table 2. Overall MCP was stable during 1995— 
1998, but it increased steeply during 1998—2001. For the 
individual dimensions of MCP, we observed an overall 
trend of improved managerial efficiency (MCP1) and dra- 
matic increases (with a peak) in 2000—2001. The capital 
efficiency of marketing communications expenditure 
(MCP2) slightly increased during 1995-1999 but then 
decreased during 1999—2002. In addition, the best-practice 
shift efficiency (MCP3) seemed to increase over time but 
decreased during 1997—1998 and 1999—2000.. Therefore, 
both managerial and best-practice shift efficiency seem to 
be the drivers of the improvement of the firms' overall MCP 
over time in this sample. 

We also break down the overall results of MCP accord- 
ing to the different contextual factors in terms of firm dif- 


$We need only two dummy variables for the three different 
returns to marketing communications expenditures: increasing, 
constant, and decreasing. The decreasing-returns case is the base 
in the dummies. 


ferences in R&D spending and market competition inten- 
sity. As we present in Figure 3, Panel A, firms with high 
R&D intensity tend to have sustained, superior MCP during 
the 1995—2002 period, especially between 2000 and 2001. 
There is a significant difference between the high- and the 
low-R&D groups (with a median split) in their overall MCP 
changes over time (t = 4.82, p < .05). In support of the lit- 
erature that suggests that high R&D intensity is important 
for firm performance (e.g., Dutta, Narasimhan, Rajiv 1999; 
Gupta, Raj, and Wilemon 1987; Rust, Moorman, and Dick- 
son 2002), this finding indicates that it is also positively 
related to an intermediary marketing outcome of MCP. Fur- 
thermore, it supports the recent theoretical argument about 
the need to investigate intermediary measures of marketing 
productivity (Rust et al. 2004). 

For market competition intensity, as we show in Figure 
3, Panel B, the results indicate a significant difference 
between the high- and the low-competition groups (with a 
median split) in their overall MCP changes over time (t — 
5.29, p « .05). Although low competition intensity is associ- 
ated with higher overall MCP for the most part, during 
2000—2002, high market competition is related to improved 
overall MCP. Because Malmquist MCP results are linear- 
programming-based nonparametric results, we report both 
Spearman and Pearson correlation results of the variables in 
Table 3. 


Hesults of Valuing MCP 


Hypotheses testing approach. To test H,-H5, we employ 
a hierarchal nonlinear mixed model to control for unob- 
served heterogeneity and unmeasured nuances in the effects 
of MCP on shareholder value. Prior studies suggest that for 
cross-sectional time-series data, large variance may be due 
to the unobserved heterogeneity in firm-level factors, such 
as marketing managerial expertise, and time-level factors, 
such as changes in consumer learning and industry trends 
over time (Boulding and Staelin 1993; Jacobson 1990). In 
particular, Jacobson (1990, p. 94) warns that “models and 
methods that ignore unobservable factors and disregard the 
consequences of neglecting their effects ... generate insidi- 
ous results." However, because there is no single ideal 
model recommended to control for the unobservable effects 
(e.g., Baltagi 2001; Boulding 1990; Grilliches and Haus- 
man 1986; Murthi, Srinivasan, and Kalyanaram 1996), we 
specify the following nonlinear mixed model to account for 
two-way (firm and year) random effects with the dependent 
variables of Tobin's q:9 


(2) Tobin's qu = p + Bi x MCP1,, + B; x MCP2, 
+ B3 x MCP3, + B4 x MCPdl1; 
+ Bs x MCPd2,, + Bs x CPI, + B; x RD, 
+ Bg x ROA, + XB, Interactions, 
+ Z Dsquac X MCP-squared ,, 
+ 2 Pintsg X MCP-squared x Interactions, 
+ 2 Bcov Covariates,  & +O, + OQ, 





3We also tried to specify the model with cubic terms of MCP 
dimensions but failed to find significant coefficients. 
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FIGURE 3 
MCP Longitudinal Changes 
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TABLE 3 
Parametric and Nonparametric Correlations 


X4 X5 X6 X7 X8 X9 


X1 X2 X3 
MCP best-practice shift (X1) 1.00 .09* —.02 
MCP managerial efficiency (X2) .23** 1.00 .10* 
MCP capital efficiency (X3) —.09* .02 1.00 
MCP returns to scale (X4) 19%* | —14"* | —02 
Tobin's q (X5) .18** .10* 3 
ROA (X6) .06 AI .06 
Stock return (X7) .18** .10* 11" 
Market competition Intensity (X8) .05 .08* —.06 
R&D Intensity (X9) .10* —.07 —.04 
“p< .05. 
**p < .01. 


15™ A ri 03 .18** .03 .09* 
-.12™ .10* 13™ .09* 14° = —01 
—.04 .14** .07 12" —.03 —.05 
1.00 .11* —.04 .10* MET .13** 

.12*™* . 1.00 .06 32** WAIT 1** 
—.03 .04 1.00 .18™ .03 .09* 

.13** .35** .19^* 1.00 .10* .08 

Aor .09* .02 14 — 1.00 .20** 

io .10* .10* .09* .21** . 1.00 


Notes: The entries above the diagonal are Pearson parametnc correlations, and the entries below the diagonal are Spearman nonparametric 


correlatlons. 


where 

MCPI = the managerial efficiency dimension 
of MCP; 

MCP2 - the capital efficiency dimension of 
MCP; 

MCP3 = the best-practice shift efficiency 
dimension of MCP; 

MCPd1 = MCP scale dummy 1 with increasing 

returns to scale; 


MCPd2 = MCP scale dummy 2 with constant 
returns to scale; 
MCP-squared = the quadratic terms of MCP1, MCP2, 
and MCP3; 
CPI = competition intensity; 
RD = R&D intensity; 
ROA = accounting backward-looking perfor- 
mance of ROA; 

Interactions,, = interaction terms for firm i at time t, 
or interactions among R&D inten- 
sity, competition intensity, and 
dimensions of MCP; they include all 
two-way and three-way interactions; 

MCP-squared x 

Interactions, = interaction terms among quadratic 
terms of MCP1, MCP2, and MCP3, 
all above two- and three-way interac- 
tions for firm i at time t; 

Covariates;, = covariates for firm 1 at time t, such as 
ROA, firm size, focus of the firm, 
business type, and market service 
type; 

u = the overall grand intercept; 

B, =the influence of ith explanatory 
variables; 

€ = the disturbance term, normally and 
independently distributed with a con- 
stant variance 02; 

C, =the random disturbance of ith firm, 
constant across periods and normally 
and independently distributed with a 
constant unknown variance O?r; and 
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Q, = the random disturbance of tth year, 
constant across firms and normally 
and independently distributed with a 
constant unknown variance 029. 


We summarize the results in Table 4. The mixed model 
estimates indicate that for the forward-looking performance 
of Tobin's q, firm random effects have a substantial intra- 
class correlation coefficient of 19.6896 (8.30/[28.25 + 
8.30 + 5.62]). Thus, unobserved/omitted firm factors may 
account for as much as 19.6896 of the variance of Tobin's q, 
according to the variance structure distribution of the three 
types of errors in Table 4 (Baltagi 2001). Unobserved time 
random effects are also significant and account for approxi- 
mately 13.3296 of the variance of Tobin's q. These results 
imply that to reduce omitted variable bias, an analysis of the 
cross-sectional time-series panel data should not ignore the 
issue of unobserved heterogeneity and the influence of 
omitted variables (e.g., Arellano 2003; Boulding and Staelin 
1993). 


Hypotheses testing results, The findings provide some 
support for H4: MCP has an inverted U-shaped influence on 
the forward-looking performance. As we show in Table 4, 
marketing communication managerial efficiency positively 
influences forward-looking Tobin's q (MCPI: t = 3.87, p < 
.01), and too much improvement in MCP1 negatively influ- 
ences Tobin's d. The squared term of MCP1 is negative and 
significant (MCP1-squared: t = —3.51, p < .O1), thus sug- 
gesting an inverted U-shaped relationship. Similarly, mar- 
keting communication best-practice shift efficiency bas a 
positive influence on Tobin's q (MCP3: t = 2.86), but too 
great an increase of this efficiency has a negative influence 
(MCP3-squared: t = —2.36, p « .05), as we expected. 

H, predicted that MCP would have a stronger positive 
influence on Tobin's q in firms with higher R&D intensity. 
As we show in Table 4, when coupled with higher R&D 
intensity, positive changes in marketing communication 
managerial efficiency (MCP1 x R&D: t = 2.23, p < .05) and 
best-practice shift efficiency (MCP3 x R&D-related: t — 
1.66, p < .1) lead to higher Tobin's q for the firm. Thus, H» 
is supported for the Tobin's q model, which not only adds 


Looking Looking Forward Looking 

Variables p. (t) p (t) B (t) 
—————————M—————MÓÉÉÉ————— 
MCP Dimensions 

MCP1 (managerial efficlency) 7.67 (3.81***) 8.06 (3.87***) 9.28 (4.03***) 

MCP2 (capital efficiency) 8.65 (1.86*) 12.21 (3.09***) 12.16 (3.15***) 

MCP3 (best-practice shift) 3.73 (1.22) 7.98 (2.86**) 12.73 (3.61***) 

MCPd1 (dummy1) .27 (.98) 5.36 (1.92%) 7.87  (2.52**) 

MCPd2 (dummy2) 13 (.60) 3.55 (1.61) 3.03 (1.31) 

Competition Intensity (CPI) 8.11 (.83) 13.86 (2.57**) 19.31 (2.82**) 

R&D intensity (R&D) 5.62 (1.81*) 7.33  (2.19**) 6.07 (1.85*) 
Interactions 

MCP1 x R&D 6.86 (2.23**) 7.22  (2.36**) 

MCP2 x R&D 12.45 (3.21***) 

MCP3 x R&D 6.76 (2.62**) 3.32 (1.66%) 

MCP1 x CPI 14.19  (3.57***) 10.42 (2.87^) 

MCP2 x CPI 12.11 (2.92**) 13.75  (3.26**) 

MCP3 x CPI 4.88  (3.97***) 

MCP1 x R&D x CPI ' 16.82 (3.84*) 15.20 (3.58**") 

MCP3 x R&D x CPI 18.32 (6.18**) 
Curvilinear Terms 

MCP 1-squared 16.29 (4.15***) —15.37 (-3.51***) —11.76 (-2.21**) 

MCP3-squared —9.28 (-2.836"*) 

MCP1-squared x R&D 19.86 (5.86**") —11.84 (-2.27**) 

MCP1-squared x CPI —9.45 (—1.83") 

MCP2-squared x CPI —18.53 (-3.36***) 

MCP1-squared x R&D x CPI -27.71 (-2.25**) 
HOA — — 1.28 (2.31%) 1.87 (4.15***) 
Firm size 01 (.53) .00 (.22) .00 (.36) 
Markets service types —.08 (-2.52**) —.36 (—1.70*) —.32 (-1.69") 
Business types —02 (-1.69") -.01  (—78) —.03  (—.82) 
Focus of the firm — O03 (-1.23) —00 (—.05) —.04  (—18) 
Og 8.25 (11.32***) 28.25 (35.54***) 26.29 (33.77***) 
Ot 1.08 (2.40**) 8.30 (9.33***) 9.12 (11.18***) 
OQ 2.16 (3.55**) 5.62 (6.71***) 6.42 (7.21***) 
-2 log-likelihood 1726.042 1236.153 1339.145 
Akaike’s information criterion 1732.139 1239.347 1344.207 
Hurvich and Tsai's criterion 1732.203 1239.353 1344.283 
Bozdogan's criterion 1741.116 1246.028 1358.411 
Schwarz's Bayeslan information criterion 1739.187 1243.125 1352.395 


| TABLE 4 
MCP and Shareholder Value 


ROA: Backward 


TobIn's q: Forward 


Stock Return: 


—————————————————ÉÉÓÉÓÉÓÉÓÉÓÉÓÁÉÁMÁ———M—M————ÁÀ 


*p « .10. 
**p < .05. 
***p < .01. 


Notes: MCPd1 = increasing returns (dummy1), MCPd2 = constant returns (dummy2), and the base is decreasing returns to marketing com- 
municatlons expenditures; we do not show nonsignificant Interaction terms. Bayesian Information criterion Imposes a severe penalty for 
more complicated model specifications. In the stock return model, ROA = unanticipated ROA (or time-series residuals), and MCP mea- 


sures are all unanticipated residuals because of the efficient- 


markets rationale from the accounting literature. Because stock return is 


different from Tobin’s q, we used the differences of MCP (or unanticipated MCP) for stock return model and used the levels of MCP for 


the Tobin's q model. 


evidence of the complementary rents of marketing and 
R&D (Dutta, Narasimhan, and Rajiv 1999; Griffin and 
Hauser 1996; Gupta, Raj, and Wilemon 1987; Hayes, 
Wheelwright, and Clark 1988) but also supports the RBV 
that marketing and R&D resources should be configured as 
performance-enhancing bundles. l 

To test H3, that MCP has the strongest influence on 
market value in firms that bave higher R&D intensity and 


function in more competitive markets, we created three-way 
interaction terms among MCP dimensions, R&D, and com- 
petition intensity. As we show in Table 4, two of the three- 
way interactions are positive and significant (MCP1 x 
R&D x CPI: t = 3.84, p < .01; MCP3 x R&D x CPI: t = 
6.18, p < .01), which indicates that a firm’s Tobin’s q is 
greater when the firm excels in MCP and R&D and operates 
in competitive markets over time. This finding suggests that 
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in a competitive market, firms that have higher R&D and 
efficiently invest in their marketing communication pro- 
grams are most likely to obtain sustainable competitive 
advantages dynamically (see Dutta, Narasimhan, and Rajiv 
1999; Srivastava, Shervani, and Fahey 1998). 

Because we hypothesized the impact of MCP as curvi- 
linear relationships in H,, we also include the product terms 
between the moderators (R&D and competition intensity) 
and the quadratic form of the MCP dimensions. As we show 
in Table 4, we find some interesting results. For example, 
because the linear X quadratic interaction terms (MCPI- 
squared x R&D and MCP2-squared x CPI: t = —2.27 and 
—3.36, respectively) are significant and negative, we con- 
clude that overly high levels of MCP harm the firm's 
Tobin's q to a greater degree if the firm has greater R&D 
and/or operates in more competitive markets. This conclu- 
sion 1s also confirmed by the negative three-way linear x 
quadratic interaction term (MCPI-squared x R&D x CPI: 
t = —2.25). Overall, these findings are insightful; when firms 
face tougher competition and seek to capitalize on R&D, 
unbounded improvements in MCP (i.e., relentlessly cutting 
advertising and promotion programs) would be especially 
harmful to future performance. Thus, these findings lend 
further support to H;, H5, and H3. Following the steps that 
Aiken and West (1991) recommend, we illustrate these 
curvilinear and moderated relationships among Tobin's q, 
MCP, R&D, and competition intensity in Figure 4. 

However, in contrast to the strong support of the 
hypotheses in the Tobin's q model, the support in the stock 
return model is less significant. More specifically, in the 
stock return model, we used the unanticipated changes of 


FIGURE 4 
The Curvilinear Influences of MCP on 
Shareholder Value 
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ROA and MCP dimensions (Jacobson and Aaker 1987; 
Mizik and Jacobson 2003), because according to the effi- 
cient markets theory, the current stock price incorporates all 
previous information with profit implications. That is, 
abnormal stock returns in the financial markets react only to 
unanticipated information (e.g., Erickson and Jacobson 
1992; Kothari 2001). In other words, stock prices are essen- 
tially a random (strong or weak) walk. Although criticism 
of such efficient market arguments exists, in general, the lit- 
erature seems to support them (e.g., Chauvin and Hirschey 
1993; Mizik and Jacobson 2003). Therefore, we define 
unanticipated changes of ROA and MCP as their deviations 
of the series from predictions based on prior information (a 
proxy for expectations). We operationalized these devia- 
tions as the residuals from a time-series forecast model 
(e.g., Grilliches 1995; Mizik and Jacobson 2003). As a fur- 
ther clarification, because stock return is different from 
Tobin's q (Erickson and Jacobson 1992), we used the differ- 
ences of MCP (or unanticipated MCP) for stock return 
model and the Jevels of MCP for the Tobin's q model in 
Equation 2. As we report in Table 4, overall, the results pro- 
vide less support for the hypotheses in the stock return 
model. In particular, the results indicate that an unantici- 
pated change in marketing managerial efficiency has a posi- 
tive influence on forward-looking performance of stock 
return (MCPI: t = 4.03, p < .01), but this influence becomes 
negative (MCP1-squared: t = —2.21, p < .01) at too high 
increases in MCPI, as we expected. Thus, the data further 
support H, that MCP1 has an inverted U-shaped influence 
on stock return. However, only one linear x quadratic inter- 
action term in stock return (MCPI-squared x CPI: t = 
—1.83, p « .1) is negative, as we expected, and none of the 
three-way linear x quadratic interaction terms are signifi- 
cant. Thus, these findings show much less ‘support for H, 
and H4 in the stock return model. These less significant 
results in the stock return model (compared to Tobin's q) 
are not totally unexpected; a recent study by Mittal and col- 
leagues (2005) also finds that their marketing variables 
strongly influence Tobin's q but do not affect stock return 
significantly. 

We also examine the influences of MCP on backward- 
looking performance with ROA. As we report in Table 4, 
changes in MCP are less significantly related to ROA; only 
two dimensions are significant statistically, whereas four 
dimensions are related to shareholder value. These limited 
findings regarding the influence of MCP on ROA (MCPI: 
t = 3.81, p < .01; MCP2: t = 1.86, p < .1) and the impact of 
few quadratic terms are consistent with prior literature. 
Empirically, existing studies (e.g., Chauvin and Hirschey 
1993; Erickson and Jacobson 1992) have established a posi- 
tive link between the multiperiod accumulated effective 
advertising expenditures and profitability ROA. Intuitively, 
improving MCP by cutting costs increases ROA because 
cutting operating expenses should consistently enhance 
accounting profitability ratios (Aaker and Carman 1982). 

Note that because the total effects of MCP include (1) 
direct main effects, (2) indirect effects through ROA, and 
(3) moderated effects with R&D and competition intensity 
on Tobin's q and stock return, the total impact may not 
strictly follow an inverted U shape. In particular, although 


total effects (direct, indirect, and moderated impact) of 
MCP have an inverted U shape for the Tobin’s q'model, as 
we expected, this is not the case for the stock return model. 
Nevertheless, because modern marketing theory (Rust et al. 
2004) is more concerned with marketing productivity’s 
implications for the forward-looking shareholder, value and 
less concerned with implications for the backward-looking 
ROA profitability, interpretations of our results should 
focus more on direct main effects of MCP and their moder- 
ated effects with R&D and competition intensity (which 
have an inverted U shape, as hypothesized) than on the indi- 
rect effects through ROA, a backward-looking, short-term 
performance measure. 


Additional Analyses 


We adopted several steps to conduct additional analyses to 
ensure the robustness of our results. Because Malmquist 
modeling is nonparametric and may be sensitive to data 
extreme points and measurement error (Banker, Charnes, 
and Cooper 1984; Fire Grosskopf, and Lovell 1994; Fare et 
al. 2004), we repeat the modeling analyses with different 
combinations of variables (i.e., two outputs and four inputs, 
three outputs and three inputs, and different lags of adver- 
tising and R&D in adjusted sales revenue). The results per- 
taining to MCP across the different specifications are highly 
correlated (i.e., smallest r = .83, p < .01), and the longitudi- 
nal changes of MCP are robust. Furthermore, we checked 
whether our results were robust when using two-year-based 
sales growth (t versus t — 1). In doing so, we reran all the 
39,872 + 22,784 linear programming models with the sim- 
ple annual growth rate (from t — 1 to t). These results of dif- 
ferent dimensions of MCP measures (MCP1, MCP2, 
MCP3) are highly consistent. The correlation results for the 
three pairs of MCP based on the three-year compounded 
sales growth and MCP based on the two-year sales growth 
were r = .92, .92, and .91 (all ps < .01). In addition, as we 
discussed previously, because there is a reverse-causality 
concern between financial performance and Fortune’s repu- 


tation measure, we conducted additional analyses. That is, 
we teased out the reverse causality of corporate reputation 
with the residual approach (Roberts and Dowling 2002). 
Then, we used the estimated “clean” residual corporate 
reputation (without reverse causality) and conducted one 
more round of estimation of all the 39,872 + 22,784 linear 
programming models. We found that the MCP results are 
robust. Across the dimensions, the correlations r for the 
three pairs of MCP based on the original score of corporate 
reputation and MCP based on the residual of corporate 
reputation are .89, .86, and .88 (all ps < .01). Table 5 reports 
a summary of the different sensitivity analyses in MCP 
results. Thus, it seems that our empirical work is reliable, 
and the MCP results are reasonably credible. 

In addition, recall that MCP is a rather complex trans- 
formation of the three outputs that may correlate with MCP 
(i.e., significant correlations include raycpi and sales level) = 
-11, TMCP1 and sales growth) = 14, I(MCP1 and reputation) 7 12, 
I(MCP2 and sales level) = -12, and r(MCP2 and reputation) = - 
Thus, we test whether a more parsimonious model might 


perform just as well. That is, we added the three outputs as 
covariates on top of those controls in Equation 2 and reesti- 
mated this more parsimonious model. The results on the 
impact of MCP still hold. In particular, the effects of MCP 
on Tobin’s q are robust and in the expected inverted U shape 
for both MCP1 and MCP3, though MCP2-squared is still 
nonsignificant as in the base model (p > .05). Moreover, in 
an alternative approach, we tested our hypotheses with a 
hierarchical regression model (see Luo, Rindfleisch, and 
Tse 2007; Slotegraaf, Moorman, and Inman 2003) to 
account for the correlations between MCP and the three 
outputs and to examine the additional explanatory power of 
MCP on influencing Tobin’s q (beyond that of the three out- 
puts). In particular, we first regress MCP against the three 
outputs to obtain residuals that have parceled out the corre- 
lations with the three outputs. We then employ these residu- 
als as MCP measures when reestimating Equation 2. Again, 
the results of this alternative modeling approach fail to 


TABLE 5 
Sensitivity Results of MCP 
Overall Produc- Produc- Produc- Produc- Produc- Produc- Produc- 
Productivity tivity tivity tivity tivity tivity tivity tivity 
Change Change Change Change Change Change Change Change 
1995-2002 1995-1996 1996-1997 1997-1998 1998-1999 1999-2000 2000-2001 2001-2002 

MCP overall 1.282 1.231 1.213 1.141 1.223 1.307 1.652 1.207 
(baseline) (.823) (.699) (.754) (.603) (.604) (.642) (1.479) (.480) 

MCP overall 1.279 1.227 1.221 1.138 1.219 1.310 1.646 1.209 
(different (.908) (.713) (.675) (.642) (.610) (.683) (1.511) (.486) 
sales i 
growth 
measure) 

MCP overall 1.212 1.209 1.224 1.093 1.218 1.295 1.663 1.192 
(different (.910) (.732) (.862) (.727) (.715) (.829) (1.385) (.523) 
corporate 
reputation 
measure) 


Notes: We based MCP baseline results on the three-year com 
different sales growth measure on the two-year simple s 
eure on corporate reputation after accommodating the 
measure. 


pounded sales growth rate and corporate reputation. We based MCP results with 
ales growth rate. We based MCP results with different corporate reputation mea- 
reverse-causality issue between financial performance and Fortune reputation 
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reject our hypotheses of the inverted U shape and moder- 
ated impact of MCP on Tobin’s q. 

Furthermore, employing Hausman’s (1978) tests, we 
compared the random effects model with the fixed effects 
models (firm-specific, year-specific fixed effects; see Arel- 
lano 2003; Boulding 1990; Boulding and Christen 2003). 
The Hausman test essentially tests the null hypothesis that 
the coefficients estimated by the relatively efficient random 
effects model are the same as the coefficients estimated by 
the relatively consistent fixed effects model (e.g., Baltagi 
_2001). A nonsignificant p value of this test would suggest 
that the use of the random effects model is appropriate, and 
a significant p value would indicate that the use of the fixed 
effects model is appropriate. As we report in Table 6, the 
Hausman test result is not significant (ya. = 2.788, p > 
.05), thus favoring the specified random effects model in 
Equation 2. This is consistent with the similar findings in 
prior studies that also favored random effects modeling 
(e.g., Anderson, Fornell, and Mazvancheryl 2004; Mittal et 
al. 2005; Rao, Agarwal, and Dahlhoff 2004). Using the 
Bayesian information criterion, we conducted further likeli- 
hood ratio tests and found that the model with two-way ran- 
dom effects in Equation 2 outperformed (p « .05) the rival 
models with either time or firm random effects. In addition, 
because random effects modeling assumes an exchangeable 
covariance structure, we repeat the analyses with different 
specifications of covariance structure, including unstruc- 
tured, AR(1), AR(1) heterogeneous, Toeplitz heteroge- 
neous, and compound symmetry (Arellano 2003; Baltagi 
2001; Boulding and Staelin 1993). As we show in Table 6, 
we fail to obtain significant differences (largest X24 = 
3.081, p » .05) across these covariance structure 
specifications. 

Finally, in Equation 2, we specified random intercepts 
models (Gonul and Srinivasan 1993) in testing the direct 
and moderated impact of MCP on Tobin's q. However, 
because firms might differ in effectiveness in their deploy- 
ment and exploitation of MCP, we also tried to test the 


robustness by fitting random coefficient models (Gonul and 
Srinivasan 1993; Murthi, Srinivasan, and Kalyanaram 1996; 
Rao, Agarwal, and Dahlhoff 2004). Using likelihood ratio 
tests, we failed to reject the null hypothesis that there is no 
significant variation in firm financial value due to the 
effects of random coefficients. Overall, we believe that the 
results are fairly robust. 


Discussion 


Marketing's credibility has been questioned, and the need to 
link marketing expenditures to financial performance has 
been stressed. However, the impact of MCP on firm market 
value has been unclear. In response to these issues, we 
developed a measure of MCP based on the neoclassical pro- 
duction theory and RBV. Our study tests a conceptual 
framework that accommodates both the curvilinear influ- 
ence of MCP on shareholder value over time and the mod- 
erating roles of competition and R&D intensity. The key 
takeaway points of this study are that (1) firms that are more 
productive in converting advertising and promotion 
resources into marketing outputs may create greater share- 
holder value over time, (2) too great an increase in MCP 
can also be harmful, and (3) these results are especially 
robust for firms with intense R&D expenditures and that 
operate in competitive markets. 


Implications for Marketing Researchers 


Our study answers Rust and colleagues' (2004, p. 83) call 
for new methodology to measure marketing productivity, in 
which they clearly identify a key gap in the literature when 
they state that “few methods currently exist for comprehen- 
sively modeling the chain of marketing productivity all the 
way from tactical actions to financial impact or firm value." 
Our article explores this issue by introducing Malmquist 
modeling to measure MCP. The Malmquist approach has 
direct and handy implications for marketing modelers and 
strategists interested in evaluating the chain of marketing 





TABLE 6 
Random Effects Model Specifications Results 
Test Statistics p Value Conclusion 
Model Specification Comparisons 

Random firm and time effects versus 

fixed firm and time effects X7ant = 2.788 >.05 Random effects model is supported. 
Random firm and time effects versus 

random firm effects only X7 ant = 21.853 «.05 Two-way random effects model Is preferred. 
Random firm and time effects versus 

random time effects only X7 att = 33.096 <.05 Two-way random effects model Is preferred. 

Likelihood Ratio Test Comparing 
Different Covariance Structure In 
Random Effects Model 
Unstructured versus AR(1) x af = 2.675 >.05 Estimates are robust across covariance 
structure specifications. 
AR(1) versus AR(1) heterogeneous X- at = 2.846 >.05 Estimates are robust across covariance 
structure specifications. 

AR(1) heterogeneous versus Toeplitz att = 2-933 >.05 Estimates are robust across covariance 

heterogeneous structure specifications. 
Toeplitz heterogeneous versus 1 ait = 3.081 >.05 Estimates are robust across covarlance 
. compound symmetry structure specifications. 
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productivity and efficiency in novel ways. For example, this 
technique can be applied to benchmark consumer decision- 
making productivity and price promotion productivity with 
time-series consumer panel data (e.g., Dominick’s [a gro- 
cery store] scanner data, Consumer Reports, ACNielsen 
data) and Internet pricing productivity and cross-channel 
efficiency dynamically with online clickstream data (e.g., 
comScore data, firm customer relationship management 
databases). 

In addition, our Malmquist approach extends research 
on how to test the RBV empirically with scientific model- 
ing techniques. Unlike most regression approaches that 
compare against average performers and neutralize firm dif- 
ferences with a central tendency, the Malmquist, technique 
deals with best practices and above-average players. Simi- 
larly, the RBV is a theory about extraordinary performers 
(Hansen, Perry, and Reese 2004) and above-normal firms 
that outperform their industries (Wiggins and Ruefli 2005), 
a point with which Malmquist measurement modeling fits 
neatly. 

The existing literature seems to be enthusiastic about 
improving MCP, but it ignores its potential dark: side (with 
one notable exception being the work of Anderson, Fornell, 
and Rust [1997]). We suggest that an unrestricted increase 
of MCP may be harmful and cause negative market returns. 
Achieving overly high productivity may not be the most 
optimal way to improve firm market value, because doing 
so may hurt service quality and customer satisfaction, 
which are critical for firm future performance (Anderson, 
Fornell, and Rust 1997). The results of the negative qua- 
dratic influence of MCP may help explain the “lost” credi- 
bility of marketing; unbridled increases in MCP generate 
suboptimal market value for the firm over time, and only a 
well-balanced increase in MCP will produce higher Tobin’s 
q and stock returns. 

A burgeoning stream of research has begun to explore 
the linear, monotonic relationship between shareholder 
value and its predictive marketing variables (e.g., Anderson, 
Fornell, and Mazvancheryl 2004; Dutta, Narasimhan, and 
Rajiv 1999; Gruca and Rego 2005; Lee and Grewal 2004). 
Our findings of the nonlinear relationships between MCP 
and firm shareholder value extend this line of research. The 
financial literature similarly recommends such nonlinear 
models as the proper way to model stock price changes and 
stock returns (e.g., Hiemstra and Jones 1994). Therefore, 
we encourage marketing researchers to look beyond simple, 
linear relationships in determining firm market value. 

The supported impact of interactions between MCP and 
R&D intensity adds more evidence to Dutta, Narasimhan, 
and Rajiv’s (1999) argument about the complementary rents 
of marketing and R&D capabilities. Our finding of a posi- 
tive influence in the three-way interaction among MCP, 
R&D, and competition intensity on Tobin’s q also suggests 
a hidden point; namely, the synergy between marketing and 
R&D may be even greater when firms are motivated by 
market competition for Schumpeter-type entrepreneurial 
rents (Schumpeter 1934; Wiggins and Ruefli 2005). Other 
possible contextual moderators should be considered in 
future efforts to evaluate marketing productivity. In addi- 
tion, it would be appealing for research to test our frame- 


work in a broader sense by including the influences of other 
perception-based factors, such as pricing strategies and 
market orientation. We were not able to do so empirically 
because of the limited availability of archival data. Thus, we 
call for further efforts with both primary survey data and 
secondary archival data to extend this line of research. 


Implications for Marketing Practitioners 


Where is marketing’s credibility? Contrary to the common 
criticisms about advertising accountability and the decline 
of advertising and promotion productivity, our results indi- 
cate a positive influence of MCP on shareholder value. 
These findings help justify advertising and sales promotion 
spending, promote the creditability of the marketing func- 
tion, and establish marketing as a distinct capability within 
the firm. To put the results into perspective, we calculate 
that for a typical firm in our sample, which has an average 
market value of approximately $43 billion, a 1% improve- 
ment in MCP (managerial efficiency) will add approxi- 
mately $45 million more returns in the following year. For a 
high-tech firm such as Dell, with a market value of around 
$80 billion, a 1% increase in MCP translates into a substan- 
tial financial return: approximately $85 million. 

Our Malmquist approach to MCP offers marketers a 
valuable tool for benchmarking and evaluating the results of 
advertising and promotion expenditures dynamically. To 
sell marketing and advertising programs to financial offi- 
cers, marketers need to quantify the marketing budget and 
demonstrate marketing’s contribution to shareholder value. 
In this context, our Malmquist-based scientific technique 
helps solve the thorny situations corporate marketers are 
currently facing: 


For many years, corporate marketers have walked into 
budget meetings like neighborhood junkies. They couldn't 
always justify how well they spend or what difference it 
all made. They just wanted more money. Old tricks simply 
don't work, there's hardly a marketing executive today 
who isn't demanding a more scientific approach to help 
defend marketing strategies in front of the chief financial 
officer. Marketers want to know the actual value of each 
dollar.... The push is from top ranks. [Chief executive 
officers, chief financial officers,] and even board directors 
have relentlessly cut costs in every corner of their compa- 
nies.... The bean counters know that marketing matters. 
But they're hazy about how much or what kind. (Busi- 
nessWeek 2004, p. 112) 


Although improving MCP largely promotes firm perfor- 
mance, blindly pushing for high MCP can be harmful for 
firm market value. The nonlinear findings of negative 
returns on MCP suggest that managers should resist the 
temptation to improve productivity simply by relentlessly 
cutting corners. Given a trade-off between productivity and 
customer satisfaction (Anderson, Fornell, and Rust 1997), 
an extremely lean marketing budget is not necessarily opti- 
mal. In addition, unbounded improvements in MCP, includ- 
ing cutting advertising and promotion programs, would be 
especially harmful for future performance if the firm faces 
tougher competition and seeks to capitalize on its R&D 
capabilities. Thus, these findings advise managers to take a 
cautious view of improving the productivity of their adver- 
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tising and promotion expenditures. In conclusion, our study 
may help management achieve a more complete under- 
standing of MCP, benchmark it against best practices over 
time, and examine its shareholder value in more depth and 
sophistication so that the marketing productivity crisis can 
be reversed. 


Appendix 
Malmquist Linear Programming for 
MCP 


According to Malmquist (1953), a productivity change 
measure can be calculated by comparing the inputs of a 
firm at times t and t + 1 according to the maximum factor, 
by which the input in time t could be decreased such that 
the firm would continue to produce the same level of out- 
puts at time t + 1. Caves, Christensen, and Diewert (1982) 
extend this idea and develop the total factor productivity 
change from t to t + 1; they label it "the Malmquist produc- 
tivity measure." Combining Caves, Christensen, and Diew- 
ert's productivity index and Farrell's (1957) Pareto-based 
efficiency concept, Fáre and colleagues (Füre et al. 1992; 
Fare, Grosskopf, and Lovell 1994; Fare et al. 2004) con- 
struct the linear-programming- (DEA-) based Malmquist 
productivity change from t to t + 1. This measure of MCP 
for each firm can be calculated with a series of linear pro- 
gramming models. The composite score of MCP for firm,, 
or M,, requires two single-period and two mixed-period 
efficiencies for the DEA. programming (Banker, Charnes, 
and Cooper 1984; Charnes, Cooper, and Rhodes 1978), as 
follows: 


(Al) M, -Piai 1 y5* D Dot !(x5* 5 yo* oP 
D$ (x5; Yo) D$* !(x$. Yo) 

where Di(x5, y$), D&* 1x5 * 1, yh+4, D&*!(x5, yb), and 

Dt(x5 * 1, y5* 1) are relative efficiencies estimated by four 

linear programming models. 

After some mathematical manipulations, we can decom- 
pose the Malmquist measurement (M,) of MCP into several 
components (e.g., Caves, Christensen, and Diewert 1982; 
Fare, Grosskopf, and Lovell 1994). As we show subse- 
quently, the first component of MCP1 is the change of pro- 
ductivity from time t to t + 1 that is due to managerial eff- 
ciency (or the effects of moving toward the best-practice 
frontiers). This efficiency represents how much improve- 
ment in MCP can be achieved if executives enhance the 
skills of managing their marketing communication com- 
pared with the best-practices. The second component, 
MCP2, is change in productivity from time t to t + 1 that is 
due to capital efficiency (or the effects of moving between 
the best-practice frontiers with different returns-to-scale 
model specifications). This efficiency represents how much 
improvement in MCP can be achieved if executives pro- 
mote the economies of scale when using the marketing 
communication capitals. The third dimension of MCP3 is 
the change of productivity from time t to t + 1 that is due to 
best-practice shift efficiency (or the effects of moving away 
from the best-practice frontiers from t to t + 1). This effi- 
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ciency represents how much improvement in MCP can be 
achieved through factors that are not related to marketing 
communication managerial skills but rather to uncontrol- 
lable factors such as industry communication creativity 
innovation and environment. Finally, the returns to scale of 
marketing communications capital (or the effects of moving 
along the best-practice frontiers) is a categorical variable 
that can increase, be constant, or decrease in returns to scale 
(Caves, Christensen, and Diewert 1982; Fare, Grosskopf, 
and Lovell 1994). The equation is as follows: 


Dore 1h" 1, yht!) 
(A2) M,(t t+ 1)= NEn 
Dot (xL,yt) 


MCP] 


CRS VRS 
_ [DSBS 65 ypt DYRS Gt tos n 
CRS VRS 
Det (x, bYDo: (x5, y5) 
-—«-—————— d—————————————— 
MCP2 


1/2 
DERS(xt+l,ygtt) DEE $, xb) 
DERS ttl, ytti) DEES (x$, y$) 


MCP3 


For the sake of brevity, we illustrate the details of only 
one linear programming model for D$ + 1(x$, y5). The other 
linear programming models are similar, with only some 
changes in the superscript of either t or t + 1. The term . 
D$ * !(x5, yt) assesses the amount by which marketing com- 
munication inputs at time t can be reduced but produce the 
same amount of output levels at time t when firm M, is 
compared with the best-practice frontier at time t + I. 

D$ * (x5, yb) = min 8o, 
subject to (1) multiple outputs constraints: 


n 
S ASAL! *! 2 SAL, 
j-1 


n 
A SAG! +! 2 SAGE, and 
j=l 


YA REP! + ! 2 REP}; 
j=l 
(2) multiple inputs constraints: 


n 
V A ADB! + 1 «0 ADB, 
j=l 


n 
S AADP! + 1 <0 ADPS, 
pel 


Y XADO! +! «0 ADOS, and 
jl 
n 
M A SAPt+1 «0 SAPS 
je4 


and (3) returns-to-scale constraints for variable returns-to- 
scale models: 


YA — ], and 
j=l 


,20,f=1,2,...j=03 


where À 2 0; j = 1, 2;7~.., n = sample size; r = 1, 2, ..., $ 
outputs; and i = 1, 2, ...., m inputs. There are four market- 
ing expenditure inputs (firms’ expenditure in. broadcast 
[ADB], print [ADP], outdoor [ADO], and sales promotion 
[SAP]) and three marketing outputs (sales level [SAL], 
sales growth rate [SLG], and corporate reputation [REP]). 
Adjusted sales capture the advertising carryover effect and 
the influence of other variables, such as R&D. The parame- 
ter theta essentially means the degree to which the inputs 
can be reduced without hurting the levels of outputs (e.g., 
Horsky and Nelson 1996; Luo 2004; Murthi, Srinivasan, 


and Kalyanaram 1996). Thus, the optimal minimized theta 
represents the best scenario in which a firm can achieve the 
highest possible efficiencies in marketing communication 
overall productivity benchmarked against the best-practice 
frontier. In addition, Di(xt, yt) measures the amount by 
which marketing communication inputs at time t can be 
reduced while producing the same amount of output levels 
at time t compared with best practices at time t; 
Dt * 1(x& +1, yt +1) measures the amount by which market- 
ing communication inputs at time t -- 1 can be reduced 
while producing the same amount of output levels at time 
t + 1 compared with best practices at time t + 1; and 
D&(xi + 1, y$ +1) measures the amount by which marketing 
communication inputs at time t -- 1 can be reduced while 
producing the same amount of output levels at time t + 1 
compared with best practices at time t. 
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Erica Mina Okada 


Upgrades and New Purchases 


In many industries, firms introduce successive generations of new and enhanced product models to remain 
competitive. Product enhancements target first-time purchasers and consumers who already own an existing 
product and are considering upgrading to a new and enhanced product. Upgrade decisions differ from new 
purchase decisions because they are hindered by the psychological costs associated with the purchase price spent 
on the existing product. This research examines how the enhanced product can be positioned relative to the existing 
product to mitigate the psychological costs and facilitate upgrade purchases. A series of studies demonstrate that 
consumers with existing products are more likely to upgrade when the enhanced product is generally dissimilar to 
the existing product. Upgraders have stronger preferences than first-time purchasers for enhanced products with 
new features rather than for improvements on existing features, and they prefer enhancements that focus on a few 


key features rather than a general enhancement of all features. 


of a firm’s success (Urban and Hauser 1993). In 

many industries, the market size typically does not 
expand to accommodate the proliferation of increasingly 
sophisticated models, so the commercial success of each 
generation of product enhancements depends to a large 
extent on current owners of the existing model who upgrade 
to the new model before the useful life of the existing 
model expires. When Sony introduced the PlayStation 2 
(PS2) in March 2000, the highly popular original PlaySta- 
tion that it launched only four years earlier had an installed 
base of 30 million in the United States. By the end of 2003, 
PS2 had an installed based of almost 30 million (Peterson 
2004). Given the total size of the U.S. market, PS2 sales 
must have come not only from new buyers in the market but 
also significantly from owners of the original PlayStation 
who upgraded to PS2. 

Product enhancements target both new purchasers and 
"upgraders"—that is, consumers who already own a prod- 
uct and upgrade to a new, enhanced product—but upgrade 
decisions are different from new purchase decisions. The 
upgrader already has an existing product, so the marginal 
benefit of the enhanced product will likely not be as great as 
for a new buyer who does not have an existing product. 
Another economics-based explanation is that the upgrader 
may be more knowledgeable about the product than a new 
purchaser, and his or her preferences may have evolved dif- 
ferently over usage. There are also the psychological factors 
based on mental accounting (see Thaler 1980, 1999). Okada 
(2001) demonstrates how upgrade decisions are hindered by 
the psychological costs associated with taking an existing 
product out of commission while it is still functional. This 


C product enhancement is a key component 
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is a psychological cost for upgraders but not for first-time 
purchasers. 

This research builds on existing literature on mental 
accounting (Thaler 1980, 1999) and sunk costs (Arkes 
1996; Arkes and Blumer 1985; Heath 1995; Heath and Fen- 
nema 1996; Heath and Soll 1996; Okada 2001) to explore 
how those psychological costs can be mitigated in ways that 
are within a marketer's control. The psychological costs 
become a lesser impediment to upgrading when the con- 
sumer perceives the enhanced product as dissimilar to the 
existing product. Therefore, between two enhanced prod- 
ucts that may be equally attractive to a new buyer with no 
prior purchase, the upgrader will find the one that is more 
dissimilar to the existing product to be relatively more 
attractive. Even after accounting for any differences in 
knowledge and preferences (see Joshi and Sharma 2004) 
between upgraders and new purchasers, the differences still 
maintain. To appeal to upgraders, then, marketers should 
position their enhanced product as being dissimilar to the 
existing product. 

Marketers can enhance a product by improving existing 
product attributes and by adding new attributes. On the 
basis of research on attribute alignability (Markman and 
Gentner 1993; Markman and Medin 1995), I define the for- 
mer as "alignable enhancements" and the latter as “non- 
alignable enhancements." Research on perceived similarity 
(Tversky 1977) and product comparability (Johnson 1984) 
suggests that alignable enhancements would make the new 
product more similar to the existing product, and non- 
alignable enhancements would make the new product more 
dissimilar. Compared with a first-time purchase, non- 
alignable enhancements would be relatively more desirable 
in an upgrade than proportional alignable enhancements. 
Furthermore, enhancements that focus on a select few fea- 
tures would be relatively more desirable in an upgrade than 
more general enhancements across all features. 


Theoretical Development 
In an upgrade, a consumer who already owns a product that 
is still functional purchases a new enhanced product, which 
makes the existing product redundant (Okada 2001). 
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Although consumers who purchase an enhanced product 
may not necessarily dispose of the existing product, this 
conceptualization of “upgrade” is consistent with the collo- 
quial usage of the word. A traveler who upgrades opts for 
an enhanced class of travel instead of traveling coach. A 
user who upgrades to a new computer may keep the existing 
one as a backup but will likely use the new, enhanced com- 
puter primarily. Thus, I contrast upgrade decisions to new 
purchase decisions. 


Upgrading to Enhanced Products That Are 
Dissimilar Versus Similar 


People keep a mental account of the costs and benefits of 
transactions over time (Gourville and Soman 1998; Heath 
and Fennema 1996; Thaler 1980, 1985, 1999). When a con- 
sumer purchases a product, he or she opens a mental 
account and posts the purchase price as a negative entry into 
the account, but the consumer feels no pain at the time of 
payment, because he or she psychologically links the costs 
and benefits of the transaction (Hirst, Joyce, and Schade- 
wald 1994; Kahneman and Tversky 1984). As the consumer 
uses the product, the benefits from consumption are posted 
in the same account as positive entries. As the cumulative 
enjoyment from consumption increases, the consumer gets 
his or her money’s worth from the purchase. The account is 
closed when he or she finishes consuming the product. 

If an upgrader purchases an enhanced product, he or she 
will no longer use the existing product, which triggers the 
closing of that mental account. There is a psychological 
cost associated with closing the existing account in the red, 
that is, before the consumer has gotten his or her money’s 
worth from the existing product. That is what hinders 
people from upgrading, even when the marginal cost- 
benefit comparison favors it (Okada 2001). ` 

In mental accounting terms, what differentiates upgrade 
decisions from new purchase decisions is the “write-off” of 
the existing product. The purchase price spent on the exist- 
ing product is a sunk cost (see Arkes 1996; Arkes and 
Blumer 1985), which normatively should be ignored in 
prospective decisions, but the motivation to recoup the sunk 
cost by continuing to use the existing product hinders con- 
sumers’ upgrade decisions (Okada 2001). I propose that the 
' sunk cost in the existing product is more likely. to be disas- 
sociated from the upgrade decision and, therefore, to 
become less of an impediment when the enhanced product 
is dissimilar to the existing product. 

Categorization is the basis for mental accounting 
(Thaler 1985). People tend to mentally account for similar 
products together and for dissimilar products separately 
(Henderson and Peterson 1992), and prospective purchase 
decisions are influenced more by prior purchase decisions 
in similar categories (Heath and Soll 1996). For example, a 
dinner out at a restaurant and a theater ticket are more alike 
than a pair of slacks and a theater ticket. Thus, a consumer 
who may not buy a theater ticket if he or she has already 
gone out to a dinner earlier in the week—because that 
would be overspending in the entertainment category—may 
still buy the same theater ticket after having bought a pair of 
slacks that are comparable in price to the dinner out. The 


prospective theater purchase is affected more by a prior 
restaurant dinner purchase because the two are alike. The 
prospective clothing purchase is affected less because it is 
dissimilar to the restaurant dinner. 

Likewise, in an upgrade, the decision to purchase an 
enhanced product should be affected less by the existing 
product the more it is dissimilar to it. When the enhanced 
product is more dissimilar to the existing product, the 
prospective write-off of the sunk cost in the existing product 
should become more disassociated from the consideration 
of the enhanced product, and it should become less of an 
impediment to the upgrade decision. Thus, upgraders 
should have relatively stronger preferences than new pur- 
chasers for enhanced products that are more dissimilar to 
their existing products. 

Hı: When a person has paid for and already owns an existing 

product, an enhanced product that is dissimilar to the 


existing one should be relatively more preferable than a 
proportionally enhanced product that is similar. 


In the current context, “proportional” means the extent 
to which a new purchaser would consider two enhanced 
products comparable in their overall desirability, if it were 
not for the existing product. For example, a person may nor- 
mally deem a six-day ski vacation to Aspen and a six-day 
cruise to Acapulco to be equivalently desirable. However, if 
he or she first paid a nonrefundable deposit on a two-day 
ski trip to the Poconos and was later presented with the 
Opportunity to go on either of the more “enhanced” vaca- 
tions instead, he or she may be more likely to choose the 
more dissimilar Acapulco cruise than the more similar 
Aspen ski trip. Presumably, the psychological cost of writ- 
ing off the nonrefundable deposit would be more likely to 
be disassociated from the decision to upgrade to the cruise 
than to a better ski trip. The current theory centers on the 
link between the (dis)similarity of the enhanced product 
and the extent to which the sunk cost in the existing product 
is considered in, and thus impedes, the upgrade decision. 


Positioning New Products as Nonalignable Versus 
Allgnable Enhancements 


If dissimilar enhanced products are more desirable to 
upgraders, the products should be positioned as such to tar- 
get them. Two items will be perceived as more similar if 
they share more attributes in common and more dissimilar 
if they are characterized by attributes that are unique to only 
one (Johnson 1984; Tversky 1977). Attributes that two (or 
more) products share in common are alignable attributes, 
and those that are possessed only by one are nonalignable 
attributes (Markman and Medin 1995). Products with 
alignable attributes tend to belong to similar product cate- 
gories (Markman and Gentner 1993), but (dis)similarity is a 
relative measure. Even in the same general product cate- 
gory, some item pairs may be more similar than other pairs. 
A Mazda Miata roadster and BMW Z3 roadster are more 
simular than a Mazda Miata and a Lexus RX330 SUV. The 
former pair has relatively more alignable attributes between 
them than the latter pair. 

In terms of a multiattribute model that explains con- 
sumers’ preferences and choices as a function of products’ 
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attribute levels and importance (see Bettman, Capon, and 
Lutz 1975; Green, Carroll, and Goldberg 1981; Kahn and 
Meyer 1991; Shocker and Srinivasan 1979), an enhanced 
product, relative to an existing product, is characterized by 
a combination of higher levels of existing attributes and the 
addition of new attributes. 

The term “alignable enhancements” refers to the 
improvement of existing attributes, and “nonalignable 
enhancements” refers to the addition of new attributes. For 
example, Panasonic introduced the Versio model of the 
mobile phone in 2000; it weighed 85 grams and had no 
browser capabilities. The new X70 model weighs 10% less, 
which is an alignable enhancement; has browsing capabili- 
ties, which is a nonalignable enhancement; and is an overall 
enhancement of the Versio. Perceived similarity between 
pairs of items increases with the number of features that the 
two share in common and decreases with the number of fea- 
tures that are distinctive to just one (Tversky 1977). In light 
of research on similarity (Tversky 1977) and product com- 
parability (Johnson 1984), nonalignable enhancements 
should make the enhanced product more dissimilar to the 
existing product than should proportional alignable 
enhancements. 

H, proposed that consumers should find it easier to 
upgrade to enhanced products that are dissimilar to the 
existing product than to products that are similar to it. If 
nonalignable enhancements make the new product more 
dissimilar, compared with new purchasers, upgraders 
should have relatively stronger preferences for nonalignable 
enhancements than for proportional alignable enhancements. 

H>: When a person has paid for and already owns an existing 

product, an enhanced product that is characterized by 
additional attributes should become relatively more desir- 


able than a proportionally enhanced product that is char- 
acterized by improvements to existing attributes. 


In a previous article (Okada 2001), I examined upgrades 
characterized by alignable enhancements and found that the 
mental cost of writing off the existing product directly 
impeded the consumer's upgrade decision. The current 
research complements my previous work by differentiating 
between nonalignable and alignable enhancements and by 
showing how this differentiation alters the effect. The men- 
tal cost of upgrading should be mitigated when the 
enhancements are nonalignable. 

Adding features makes the enhanced product more dis- 
similar to the existing one, but it may not always be feasible 
or practical. This research compares consumers' percep- 
tions of proportional product enhancements. However, 
when consumers are unfamiliar with the added features, 
they may not value the enhancements as much (Zhang and 
Fitzsimons 1999; Zhang and Markman 1998). Furthermore, 
if consumers perceive added features as trivial, the enhance- 
ments may backfire and actually diminish the value of the 
product (Mukherjee and Hoyer 2001; Simonson, Carmon, 
and O'Curry 1994). An enhanced product can still be posi- 
tioned as dissimilar from the existing product without 
adding new features and based on just the existing attributes 
by focusing the enhancements on a select few attributes. 
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Based on a multiattribute model, it is possible to achieve 
proportional levels of product enhancement by higher 
increases in the levels of a select few attributes or by lesser 
increases in the levels of all attributes. I define the former 
approach as "focused enhancements” and the latter as “gen- 
eral enhancements.” Focused enhancements tend to make 
the new product more dissimilar to the existing product. 

When product-market structure is represented spatially, 
product attributes define the dimensions of space. Con- 
sumers perceive two products that are closer together as 
more similar (Best 1976; Ghose 1998; Glazer and 
Nakamoto 1991). Figure 1 represents a simplified, two- 
dimensional product space defined by two equally impor- 
tant attributes (Attributes 1 and 2) and an existing product 
situated at O. The axes are attribute ratings. The existing 
product can be enhanced proportionally by improving the 
rating of Attribute 1 to Eg, or by improving the rating of 
Attribute 2 to Em. Anywhere along the segment that con- 
nects Ep, to Ep; represents the same proportional level of 
enhancement. It can likewise be achieved by enhancing 
both Attributes 1 and 2 to Eg. A general enhancement of 
both attributes, such as Eg, results in a closer distance from 
the existing product O and relatively greater similarity to O. 
When the enhancements are focused on either one or the 
other of the attributes, such as Ep, or Ep), a greater distance 
from O results and, thus, a relatively greater dissimilarity to 
O. 

The principle generalizes to products that are character- 
ized by more than two attributes. Enhancements that are 
focused on a select few attributes result in greater distances 
from and more dissimilarity to the existing product than do 
general enhancements on all attributes. 


Ha: When a person has paid for and already owns an existing 
product, an enhanced product that is characterized by 
focused enhancements 1n a select few attributes should be 
more desirable than a proportionally enhanced product 
that is characterized by more general enhancements across 
all attributes. 


FIGURE 1 
Enhancements to Maximize Distance 


Attribute 2 





Attribute 1 


In this research, I explore the addition of attributes and 
the improvement of existing attributes as the primary ways 
to enhance products, but I do not explore overall enhance- 
ments in which existing attributes are taken away. When 
any significant part of the existing product is deleted in an 
overall enhancement, the enhanced product would be an 
additional purchase rather than an upgrade. For example, 
when the microwave oven was introduced, it may have been 
an overall enhancement compared with the conventional 
oven, but many of the functionalities of a conventional oven 
were deleted. Furthermore, most people who owned con- 
ventional ovens did not upgrade from conventional to 
microwave ovens and discontinue using their conventional 
ovens. Rather, they typically bought the microwave ovens in 
addition to the conventional ovens. An upgrade ‘effectively 
terminates the consumer’s usage of the existing product. A 
series of four studies examine how the sunk cost in an exist- 
ing product can be mitigated to facilitate upgrade decisions. 


Study 1 


People are less willing to upgrade to an enhanced alterna- 
tive when there is a sunk cost in the existing alternative. 
Study 1 was designed to gain insight into whether the dis- 
similarity of the enhanced product mitigates the negative 
effect of the sunk cost and, in turn, facilitates upgrade deci- 
sions. Study 1 addresses the question, Is it easier to upgrade 
to enhanced alternatives that are dissimilar to the existing 
alternative than to proportionally desirable enhanced prod- 
ucts that are similar? 


Method 


Ninety-four undergraduate students participated in a paper- 
and-pencil study. Data were collected over two sessions, six 
weeks apart. There were a total of four items in this study: 
two moderate and two enhanced. The two moderate items 
were described as a ticket to a Seattle Mariners playoff 
game in Safeco Field in balcony seats behind left field and a 
ticket to a rock concert by their favorite band'in the Key 
Arena with end seats to the left of the stage in the top row. 
The two enhanced items were a ticket to a Seattle Mariners 
playoff game in Safeco Field in front-row seats behind the 
first-base line and a ticket to a rock concert by their favorite 
band in the Key Arena with front-row balcony seats and a 
backstage pass. Participants were randomly assigned to one 
of four different combinations of one moderate item and 
one enhanced item. 

In one session, participants were asked to indicate their 
willingness to pay (WTP) for their randomly assigned pair 
of one moderate item and one enhanced item. Participants’ 
WTP in this session measured the desirability of the alter- 
natives in the absence of an existing product that would be 
made redundant by the purchase. 

Six weeks later, participants were given scenarios in 
which they made plans to go to a moderate ‚event, after 
which an opportunity arose to go to a more desirable, 
enhanced event. They were asked how much they would be 
wiling to pay to go to the enhanced event. Participants 
were presented with the same combination of moderate and 
enhanced events in these scenarios as in the first session. 


The difference in the second session was that the moderate 
alternative was presented as the existing alternative, and the 
enhanced alternative was made available as a new alterna- 
tive after the existing plans were made. For half of the par- 
ticipants, the existing alternative was the moderate baseball 
ticket, and for the other half, it was the moderate concert 
ticket. In all cases, the enhanced event occurred at the same 
time as the existing event, so participants could go to either 
one or the other event but not to both. Furthermore, the 
ticket for the existing alternative was nonrefundable and 
nontransferable; therefore, purchasing the new enhanced 
alternative would render the existing ticket worthless. This 
created an upgrade situation. For half of the participants, 
the enhanced alternative was similar to the existing alterna- 
tive; that is, an enhanced baseball ticket became available 
when there was an existing moderate baseball ticket, or an 
enhanced concert ticket became available when there was 
an existing moderate concert ticket. For the other half, the 
enhanced alternative was dissimilar to the existing alterna- 
tive; that is, an enhanced concert ticket became available 
when there was an existing moderate baseball ticket, or an 
enhanced baseball ticket became available when there was 
an existing moderate concert ticket.! 

There were two levels of sunk cost to manipulate how 
much participants had paid for the existing ticket. Half of 
the participants had paid $15 for their existing moderate 
alternatives. The other half first paid $15 but then got an 
unexpected refund of the purchase price for winning a 
drawing, so they ended up with no money sunk into the 
ticket (see Soman and Cheema 2001). If participants were 
to infer the desirability of the moderate alternative on the 
basis of whether they had chosen to pay for it, the inferred 
value should be the same in both conditions. Their WTP for 
the enhanced event was the dependent measure. 

The study was a 2 (type of existing alternative: moder- 
ate baseball or moderate concert) x 2 (similar or dissimilar 
enhanced product) x 2 (sunk cost or no sunk cost) design. 
Participants’ WTP in the first session measured their valua- 
tion of the enhanced items in situations that more closely 
resembled new purchases. In the second session, partici- 
pants indicated their WTP for the same products but in an 
upgrade context. 


Results and Discusslon 


The data collected in the first session served as a manipula- 
tion check of the stimuli for the existing and enhanced alter- 
natives and confirmed that the average WTP for the moder- 
ate baseball and moderate concert tickets (M — $24.11 and 
$27.64, respectively) was lower than the average WTP for 
the enhanced baseball and enhanced concert tickets (M = 
$51.35 and $50.98, respectively). On average, the enhanced 
baseball and enhanced concert tickets were comparably 





1A manipulation check of the dissimilar and similar conditions 
was performed on 25 separate participants. All 25 participants 
indicated that the enhanced baseball ticket was more similar to the 
moderate baseball ticket than to the enhanced concert ticket. All 
25 participants also indicated that the enhanced concert ticket was 
more similar to the moderate concert ticket than to the enhanced 
baseball ticket. 
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desirable to each other and were more desirable than the 
existing moderate baseball and moderate concert ticket 
alternatives, which were also comparably desirable to each 
other. This suggests that the scenarios were fair representa- 
tions of upgrade situations in which people who already 
owned a product had an opportunity to acquire an enhanced 
alternative that would render the preexisting alternative 
redundant. I also used these data to calculate the marginal 
benefit of the enhanced product compared with the moder- 
ate product (the difference in the WTP of the former less the 
latter). Overall, the average marginal benefit of the 
enhanced product over the moderate was $25.30. 

The data collected in the second session suggest that 
participants were willing to pay $22.41 for the enhanced 
event when there was a preexisting moderate event for 
which they already had tickets. The results appear in Table 
1. 

When the enhanced product was similar to the existing 
product, participants were willing to pay significantly less 
(t= 3.4, p < .01) for the enhanced alternative when they had 
sunk costs in the existing product (M = $7.70) than when 
there was no sunk cost (M = $30.40). This is consistent with 
the literature on sunk costs. However, when the enhanced 
product was dissimilar to the existing product, there was no 
difference (t = .35, p = .72) between the WTP for the 
enhanced product when there were sunk costs (M = $23.64) 
and the WTP for the enhanced product when there was no 
sunk cost (M = $27.08). 

To manipulate the level of sunk cost and equalize any 
inferred desirability of the moderate alternative on the basis 
of whether participants had paid for it, the no-sunk-cost 
condition was presented as a situation in which respondents 
had first chosen to pay $15 but were unexpectedly refunded. 
This inevitably created a condition of a windfall gain in the 
no-sunk-cost condition. The literature suggests that people 
treat windfall gains differently from money they earn or 
already have (O'Curry and Strahilevitz 2001), but this does 
not offer an alternative explanation for the current findings, 
because it was in the sunk-cost group, in which there was 
no windfall gain, that the differences in WTP between the 
dissimilar and the similar enhancement conditions were 
significant. 

An analysis of covariance (Fs gg = 32.57, p « .001) of 
the WTP for the enhanced product, on the (dis)similarity of 
the enhanced alternative to the existing alternative, on 
whether there was a sunk cost in the existing alternative, on 
the interaction between the two aforementioned variables, 
on whether the existing alternative was a moderate baseball 
or concert ticket, and on the marginal benefit of the 


enhanced alternative compared with the existing alternative 
as a covariate, demonstrated that WTP for the enhanced 
alternative was significantly lower (F, gg = 4.16, p = .04) 
when there was a sunk cost in the existing alternative. This 
is consistent with the findings in previous research (Okada 
2001); that is, presumably, the sunk cost in the existing 
product impedes upgrade decisions. Furthermore, the nega- 
tive effect of the sunk cost on WTP for the enhanced alter- 
native was significantly mitigated (F gg = 8.38, p < .01) 
when the enhanced alternative was dissimilar to the existing 
product, in support of H,. The covariate was also signifi- 
cant; intuitively, respondents who perceived a higher mar- 
ginal benefit (F; gg = 139.86, p < .001) in the enhanced 
product were willing to pay a higher price. In terms of the 
baseball and concert events being well matched in desirabil- 
ity, there was no difference (Fj, gg = .80, p = .37) whether 
the existing alternative was the moderate baseball or concert 
ticket. 

When people already own an existing alternative, and 
an opportunity arises to upgrade to a superior alternative, 
they are willing to pay more for the new, enhanced alterna- 
tive if it is dissimilar to the existing alternative. If the new 
alternative is for the same event but in better seating, the 
WTP is significantly lower, presumably because the sunk 
cost in the existing alternative hinders the upgrade decision. 
This is consistent with previous research on the sunk-cost 
effect (Arkes 1996; Arkes and Blumer 1985; Heath 1995) 
and, more specifically, on product upgrade decisions 
(Okada 2001). Conversely, if the new alternative is dissimi- 
lar to the existing alternative, WTP is significantly higher 
and comparable to the marginal benefit, presumably 
because the sunk cost in the existing alternative becomes 
less relevant in considering the transaction. When the new 
alternative is dissimilar, it becomes easier to disassociate 
the write-off of the sunk cost, and the upgrade decision 
becomes essentially like a new purchase. 


Study 2 


The results of Study 1 support H, and demonstrate that 
people are willing to pay more to upgrade when the 
enhanced alternative is dissimilar to the existing alternative. 
The theoretical explanation is based on the differential 
effects of the sunk cost; that is, the write-off of the sunk 
cost becomes more disassociated with the upgrade decision 
when the enhanced alternative is more dissimilar to the 
existing product, which in turn facilitates upgrade deci- 
sions. Study 2 was designed to gain insight into the process 
and to measure the extent to which sunk costs influence 


TABLE 1 
Mean WTP for Similar Versus Dissimilar Upgrades In Conditions of Sunk Cost and No Sunk Cost 


Dissimilar Enhanced Product 


Baseball Concert 
No sunk cost $26.15 $28.18 
Sunk cost $25.00 $22.27 
Total $25.43 


96 / Journal of Marketing, October 2006 


Similar Enhanced Product 


Baseball Concert Total 
$31.15 $29.58 $28.78 
$ 7.58 $ 7.82 $15.49 

$19.52 $22.41 


upgrade decisions differentially when the enhanced product 
is dissimilar versus similar. 


Method 


One hundred forty-seven undergraduate students partici- 
pated in this study, in which upgrade scenarios were pre- 
sented using the same product stimuli as those in Study 1. 
Half of the participants had purchased moderate baseball 
tickets, and the other half had purchased moderate concert 
tickets as the original plan. Then an enhanced alternative 
became available after the existing plans were: made. As 
before, both the enhanced and the moderate alternatives 
occurred on the same day, so participants could go to one or 
the other event but not to both. Furthermore, the ticket for 
the existing alternative was nonrefundable and nontransfer- 
able. The moderate and enhanced alternatives were 
described in the same way as in Study 1. 

Whereas in Study 1, the sunk cost was manipulated, in 
this study, the sunk cost was $15 for all participants. The 
objective was to measure the differential effect of the same 
level of sunk cost on the upgrade decision as a function of 
the (dis)similarity of the enhanced product to the existing 
one. 
As in Study 1, for half of the participants, the enhanced 
alternative was similar to the existing one; that is, they 
upgraded from the moderate baseball ticket to the enhanced 
baseball ticket or from the moderate concert ticket to the 
enhanced concert ticket. For the other half, the enhanced 
alternative was dissimilar; that is, they upgraded from the 
moderate baseball ticket to the enhanced concert ticket or 
from the moderate concert ticket to the enhanced baseball 
ticket. 

In contrast to Study 1, which measured WTP, Study 2 
offered the enhanced product for $20, and the dependent 
measure was whether participants chose to upgrade. This 
price was only $5 more than the $15 they paid for the mod- 
erate alternative, but the scenario was that a student agency 
at school was offering the enhanced tickets at a special stu- 
dent price, and the offer price did not directly reflect the 
market price. Participants also indicated the extent to which 
they would consider the $15 they had spent on the existing 
alternative in their decisions using a seven-point scale (0 = 
“the $15 would not be a consideration at all,” and 6 = “the 
$15 would be an extremely significant consideration”). The 
prediction is that the $15 would be a lesser consideration 
when upgrading to a dissimilar enhanced product, which in 
turn should facilitate the upgrade decision. 


Results and Discussion 


Overall, 87 of the 147 participants (59.2%) indicated that 
they would upgrade to the enhanced alternative. As in Study 
1, whether the existing ticket was the moderate baseball or 
the moderate concert ticket made no difference,(Z = .14) in 
the percentage of those who upgraded, so I aggregated the 
data across the two existing alternative conditions for the 
analysis and compared upgrade scenarios in which the 
enhanced alternative was dissimilar versus similar to the 
existing alternative. 


Of the 77 participants in the dissimilar upgrade condi- 
tion, 54 (70.1%) indicated that they would upgrade to the 
enhanced alternative for $20. This was significantly greater 
(Z = 2.83) than the 33 of 70 (47.1%) in the similar upgrade 
condition who indicated that they would upgrade. Further- 
more, the dissimilar upgrade group (M = 2.1) considered 
the sunk cost in the existing ticket to a significantly lesser 
extent (t = —5.77, p « .001) than the similar upgrade group 
(M = 3.6). To measure the role of sunk cost, I asked partici- 
pants directly about the extent to which they considered the 
$15; this method may arguably make the issue more salient. 
However, the sunk-cost measure was taken after the 
upgrade decision. Furthermore, the data indicate a greater 
extent of consideration of sunk cost in the similar than in 
the dissimilar group, but there is no theoretical reason that 
any collateral increase in salience of sunk cost should be 
different between the two groups. 

I conducted a mediation analysis (Baron and Kenny 
1986) to test the theory that when the enhanced product is 
dissimilar to the existing product, the sunk cost in the exist- 
ing product will be considered to a lesser extent, which in 
turn will increase the likelihood of upgrading, as illustrated 
in Figure 2. First, the dissimilarity of the enhanced product 
significantly decreased the extent to which people consid- 
ered sunk costs in their decisions (t = —5.77, p < .001; B = 
—1.57). Second, by a logistic regression, dissimilarity also 
significantly increased the likelihood of upgrading (p < 
.01). The Wald statistic was 7.85, and the effect on the odds 
ratio (exp[B]) was 2.63. Numbers greater than 1 indicate a 
positive effect on the odds ratio. Third, when the likelihood 
of upgrading was regressed against both the dissimilarity of 
the enhanced product and the consideration of sunk costs, 
again by logistic regression, only the sunk cost was signifi- 
cant (p « .001), and dissimilarity became not significant 
(p = .62). The Wald statistics were 34.19 and .25, respec- 
tively, and the respective effects on the odds ratio (exp[B]) 
were .39 and .80. Numbers less than 1 indicate a negative 
effect on the odds ratio, and this showed that upgrading 
became less likely when the sunk cost was considered to a 
greater extent. 

One limitation that the design may impose is that 
because the price of the enhanced alternative was only $5 
more than what participants had paid for the existing mod- 
erate alternative, they may have made inferences about the 
quality of the enhanced product, which would affect the 
likelihood of upgrading. The same results could be obtained 
if such negative-quality inferences are made to a greater 
extent for similar enhanced products than for dissimilar 
ones. However, participants were given descriptions of the 
moderate and enhanced alternatives in the same way as in 
Study 1; they were not just given the price. Furthermore, the 
enhanced alternatives were made available through a stu- 
dent agency at school at a special student price, and the 
offered price did not directly reflect the market price. Con- 
sistent with the proposed theory, the mediation analysis in 
Study 2 showed that participants considered the sunk cost in 
the existing product to a lesser extent when the enhanced 
product was dissimilar, which in turn facilitated the upgrade 
decision. 
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Study 3 


According to theories of similarity (Tversky 1979) and 
product comparability (Johnson 1984), adding new features 
should make the new product more dissimilar to the original 
product than proportionally improving existing features. 
Study 3 examines whether people are more likely to 
upgrade to a new product that is characterized by non- 
alignable enhancements or to upgrade to a new product that 
is characterized by proportional alignable enhancements. 

Another objective of Study 3 is to determine whether 
the findings of Study 1 are more broadly generalizable. 
Because the concept of upgrading is relevant for durable- 
type products as well, Study 3 examined mobile pbones. In 
addition, in Study 3, I collected data from consumers who 
actually purchased mobile phones, so the existing product 
in Study 3 is actual, not hypothetical. 


Method 


The 179 participants of this study were recruited at a local 
shopping center. The product stimulus was a Sony Ericsson 
T68i mobile phone. Participants were shown an actual 
model of the phone and were given a brief written descrip- 
tion of the phone. Then, they completed a brief paper-and- 
pencil questionnaire and indicated how much they would be 
willing to pay for the new Sony Ericsson mobile phone. To 
make purchase scenarios viable and realistic and to control 
for knowledge and experience resulting from product usage, 
only the people who currently owned mobile phones were 
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recruited. People participated on a volunteer basis, so the 
surveys were kept as succinct as possible. 

The survey first asked how much they would be willing 
to pay for the new phone under one of two conditions: the 
upgrade or the new purchase condition. In the new purchase 
condition, participants were asked to imagine that they lost 
their existing phones. This created a situation in which there 
would be no existing phone to become obsolete as a result 
of the new purchase, and thus the phone would resemble a 
new purchase. The upgrade condition made no qualifica- 
tions on participants' existing phones, and they were simply 
asked how much they would be willing to pay for the new 
phone. Because participants all had mobile phones, their 
existing phones would presumably become redundant if 
they purchased the new phone. In both the new purchase 
and the upgrade conditions, participants were asked to 
assume that there were no additional costs, such as termina- 
tion or activation fees, and that the monthly charges would 
remain equivalent for an equivalent level of usage. 

In the second part of the survey, participants compared 
the new phone with the mobile phones they actually owned 
in terms of the alignable and nonalignable attributes. The 
survey briefly defined alignable features as those that the 
two phones shared in common and nonalignable features as 
those that were unique to one phone. Participants indicated 
on a seven-point scale (—3 = “strongly disagree,” and +3 = 
“strongly agree”) whether the new phone was superior to 
their own phones in the alignable features and nonalignable 


features, respectively. The order between the two questions 
was randomized. In essence, Study 3 measured responses to 
hypothetical situations but from consumers considering 
their actual purchase history. 


Results and Discussion 


Of the 179 people who participated in the survey, 90 were 
in the new purchase condition, and 89 were in the upgrade 
condition. Respondents were willing to pay $65. 12 on aver- 
age for the new phone, and WTP was higher (t = 2.31, p = 
.02) for the new purchase group (M = $75.98) than for the 
upgrade group (M = $54.15), which is intuitive because the 
people in the new purchase condition effectively had no 
working phone, and thus the new phone’s marginal benefit 
would presumably be greater. 

On average, participants perceived the new phone as 
superior in its alignable features (M = .55; t = 4.74, p < .01) 
and its nonalignable features (M = 1.07; t = 7.13, p < .01) 
compared with the phones that they owned. Therefore, the 
new phone was a reasonably enhanced phone. In terms of 
the random assignment of the participants to the new pur- 
chase and upgrade conditions, there were no differences 
between the two groups in the perceived levels of the new 
phone's alignable enhancements (M = .62 versus 48; t = 
.60, p = .55) or nonalignable enhancements (M = 1.11 ver- 
sus 1.03; t = .26, p = .80). 

H, predicted that in terms of upgrades, nanilignaile 
enhancements would have a greater positive impact on pur- 
chase decisions than proportional alignable enhancements. 
Therefore, nonalignable enhancements should have a rela- 
tively greater positive effect on WTP in the upgrade condi- 
tion, and the perceived levels of alignable enhancements 
should have a relatively greater positive effect on WTP in 
the new purchase condition. 

I ran a regression to analyze the differential effects of 
alignable versus nonalignable enhancements in the upgrade 
versus new purchase conditions. The results appear in Table 
2. First, I mean-centered the alignable and nonalignable 
enhancement measures by converting them to Z-scores, 
arriving at the alignable and nonalignable Z-scores. Second, 
to control for the multicollinearity between the alignable 
and the nonalignable measures (p — .61), I regressed the 
nonalignable Z-scores against the alignable Z-scores and 
calculated the residual of the nonalignable Z-scores after 
subtracting the component that is predicted by alignable Z- 


scores to arrive at the residual nonalignable Z-scores. The 
main regression analysis used these calculated variables. 

The dependent variable was the WTP for the enhanced 
phone, and the explanatory variables were the alignable Z- 
score, the residual nonalignable Z-score, whether the phone 
was considered a new purchase or an upgrade, the interac- 
tion between the alignable Z-score and whether it was a 
new purchase or an upgrade, and the interaction between 
the residual nonalignable Z-score and whether it was a new 
purchase or an upgrade. The model had an R-square of .24. 
People were willing to pay less when the new phone was an 
upgrade to replace an existing phone than when it was 
essentially a new purchase (t = —2.50, p = .01). This is intui- 
tive because the marginal benefit would be greater for those 
who do not have functional phones. People were willing to 
pay more for the new phone when they perceived it as supe- 
rior to their current phones on the alignable attributes (t — 
3.52, p = .001). Because there was a relatively high correla- 
tion between the alignable and the nonalignable enbance- 
ment measures, this significant effect captured the increased 
WTP due to nonalignable enhancements as well. So the 
residual nonalignable Z-score did not have a significant 
effect on WTP (t = —1.22, p = .22). However, in the upgrade 
condition, people were willing to pay more for nonalignable 
enhancements (t = 4.20, p « .001) but only marginally more 
for alignable enhancements (t = —1.95, p = .05). Even by a 
conservative test that measured the level of nonalignable 
enhancements only as a residual after subtracting the com- 
ponent that correlated with alignable enhancements, the 
analysis still demonstrated a differentially positive effect of 
nonalignable enhancements on WTP in upgrading. 

Thus, Study 3 suggests that in upgrading, nonalignable 
enhancements are more desirable than proportional 
alignable enhancements. Presumably, the addition of new 
attributes makes the enhanced product more dissimilar, and 
the sunk cost in the existing product becomes less salient. 


Study 4 


Study 4 was designed to test the prediction that focused 
enhancements of a select few of the existing attributes 
would be more attractive to upgraders than proportional 
general enhancements of all existing attributes. 


TABLE 2 
Regression of WTP for New Mobile Phone 


Variables p SE t p VIF 
I 
Constant™ 75.04 5.96 12.59 < .001 — 

Alignable Z-score** 19.07 5.42 3.52 = .001 1.64 
Residual nonalignable Z-score —12.09 9.91 —1.22 = .22 3.43 
Upgrade or new purchase* —21.1 8.45 —2.50 = .01 1.00 
Upgrade x alignable Z-score —16.98 8.70 —1.95 = .05 1.64 
Upgrade x residual nonalignable Z-score** 49.44 11.78 4.20 < .001 3.43 


*Significant at « = .05. 
“Significant at a = .01 
Notes: VIF = variance inflation factor 
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Method 


Data were collected at a dental clinic from patients who 
came in for their routine, semiannual checkups. They par- 
ticipated by completing short anonymous surveys on a vol- 
unteer basis. Patients who were scheduled for more invasive 
dental procedures were not included in the study. On arriv- 
ing at the clinic for their routine checkups, patients were 
asked by the receptionist if they would be willing to fill out 
a short survey while they waited. The patients who volun- 
teered to participate completed their surveys and returned 
them anonymously into a box. 

The survey presented scenarios in which participants 
were going on a vacation over an upcoming long weekend 
and had made arrangements to stay at a moderately rated 
bed-and-breakfast. Then, a few days later, they found two 
other alternatives for accommodation, both more desirable 
than the one at which they made reservations and both 
priced at $80. They were asked to indicate which of the two 
more desirable accommodations they would choose or to 
indicate “neither” if they wanted to stay at the moderate 
bed-and-breakfast they found first. This was the dependent 
measure. 

The moderate lodging and the two more desirable hotels 
were all rated on five different dimensions and were given 
one overall rating. The rating was on a five-point scale, 
anchored by “fair” (1) and “excellent” (5). Those compari- 
sons appear in Table 3. The surveys presented the moderate 
lodging information first on the front page. The two more 
desirable alternatives with higher ratings were presented on 
the next page side by side. With one enhanced hotel, the 
enhancements were focused on fewer features, and with the 
other, the enhancements were more general. 

There were two different levels of commitment to the 
moderate bed-and-breakfast. In the sunk-cost condition, 
participants paid a nonrefundable $25 deposit, which would 
“go to waste" if they chose to stay elsewhere. In the no- 
sunk-cost condition, they made a deposit, but the deposit 
was fully refundable. The prediction is that in the sunk-cost 
condition, which more closely resembles an upgrade deci- 
sion, the focused enhancements would be relatively more 
attractive, and in the no-sunk cost condition, which more 
closely resembles a new purchase, the general enhance- 
ments would be relatively more attractive. 

A separate manipulation check on a group of 150 par- 
ticipants measured the relative desirability of the two 
enhanced alternatives and their relative similarity to the 


moderate alternative. In a side-by-side choice between the 
two enhanced alternatives, there was no significant differ- 
ence (Z = 1.30, p = .10) between the percentages of those 
who chose the focused enhancement as being more desir- 
able (n = 83, 55.3%) and those who chose the general 
enhancement as being more desirable (n = 67, 44.7%). This 
suggested that the two enhanced alternatives were compara- 
bly desirable and helped rule out an alternative sunk-cost- 
based explanation, namely, that the prospect of having to 
write off sunk costs creates negative emotions, which in 
turn compromises the level of scrutiny in the decision mak- 
ing, and the choice ends up different and less optimal than if 
there was no sunk cost and there were no negative emo- 
tions. However, the results suggest that there was no clearly 
“optimal” choice from which participants would deviate 
with a lesser level of scrutiny due to any negative emotion. 

In the second part of the manipulation check, the mod- 
erate alternative was introduced as a comparison, and par- 
ticipants were asked which of the two enhanced alternatives 
was more similar to the moderate alternative. Significantly, 
more people (Z = 7.02, p < .001) perceived the general 
enhancement (n = 118, 78.7%) than the focused enhance- 
ment (n = 32, 21.3%) as more similar to the moderate alter- 
native. This confirmed the manipulation in the main study 
and was consistent with the premise for the hypothesis that 
the enhanced product would be perceived as more dissimi- 
lar to the moderate product with focused enhancements and 
more similar with general enhancements. 


Results and Discussion 


One hundred sixty-eight people participated in the main 
study over a four-week period. Overall, 109 (64.9%) chose 
the more generally enhanced alternative, and the other 59 
(35.1%) chose the more desirable, focused-enhancement 
alternative. No one chose “neither.” This indicated an over- 
all preference for the generally enhanced alternative (Z = 
3.86, p < .001). Although these results did not replicate the 
manipulation check results, which suggested no significant 
difference in desirability between the two alternatives, pref- 
erences were measured in a different context in the main 
study. In the manipulation check, just the two enhanced 
alternatives were presented and compared, without the 
moderate alternative. In the main study, the moderate prod- 
uct was presented first, and the two enhanced products were 
presented next, so they were presumably each evaluated 
relative to the moderate alternative. On the basis of decreas- 








TABLE 3 
A Comparison of the Moderate and Enhanced Alternatives in Study 3 

Moderate General Focused 

Alternative Enhancement Enhancement 
Location and access 2 3 5 
Comfort and cleanliness of room 2 3 5 
Friendliness of staff 2 3 2 
Quality of food 2 3 2 
Amenities 2 3 2 
Overall rating Moderate Good Good 

2 3 3 
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ing marginal sensitivity in gains (Kahneman and Tversky 
1979), smaller incremental gains across more attributes, 
which characterize the generally enhanced alternative, 
would have a greater positive impact on value than larger 
incremental gains across fewer attributes, which character- 
ize the focused enhancement. 

Eighty-four participants were in the no-sunk-cost group, 
and 84 participants were in the sunk-cost group. The results 
appear in Figure 3. In the no-sunk-cost group, 70 (83.3%) 
participants chose the more generally enhanced alternative, 
showing a strong preference (Z = 6.11, p < .001) for the 
general enhancement. In the sunk-cost group, the direction 
reversed, and 45 (53.6%) participants chose the more 
focused enhancement. The absolute preference for one 
alternative over the other may be driven by differences in 
the importance of the attributes. However, the objective of 
Study 3 was to determine whether there are any differences 
between upgraders and new purchasers in their preferences 
for focused versus general enhancements. Thus, I compared 
the preference for the focused enhancement with the prefer- 
ence for the general enhancement between the two groups. 
Consistent with H3, the relative preference for the focused 
enhancement was greater in the sunk-cost group than in the 
no-sunk-cost group (Z = 5.14, p < .001). 

The results of this study suggest that different types of 
product enhancements appeal to upgraders versus new pur- 
chasers. When enhancing an existing product on the basis 
of the existing attributes, focusing the enhancements on a 
few key attributes tends to be relatively more attractive to 
upgraders than to new purchasers. Presumably, focused 
enhancements result in products that are perceived as more 
dissimilar to the original product than proportional general 
enhancements, which makes it easier for upgraders to disas- 
sociate the write-off of the original product from the 
upgrade purchase decision. 


FIGURE 3 
Focused Versus General Enhancements With and 
Without Sunk Costs 
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General Discussion 


It is helpful for marketers to understand how upgrade deci- 
sions differ from new purchases because many new prod- 
ucts they introduce to the market target both new buyers 
and upgraders. Previous research has demonstrated that 
upgrade decisions are hindered by the mental costs associ- 
ated with putting a still-functioning product out of commis- 
sion, and consumers may forgo upgrades even when the 
marginal cost-benefit comparison is favorable. The current 
research suggests that depending on how the new product is 
positioned relative to the existing one, the sunk-cost effect 
can be mitigated, and the upgrade decision can be facili- 
tated. Consumers have an easier time ignoring the sunk 
costs when upgrading to new products that are dissimilar to 
the existing product. Marketers can position their new prod- 
ucts to be dissimilar by adding new features rather than 
improving existing ones and by focusing the enhancements 
on a few key attributes. 

This research examined product enhancements by 
adding new attributes, but product enhancements can also 
combine both the addition of some new features and the 
deletion of some existing ones. Adding two new attributes 
and deleting one may make the enhanced product more dis- 
similar to the existing product than just adding one new 
attribute, assuming equally important attributes. According 
to the dissimilarity argument proposed here, the former may 
be relatively more desirable to the upgrader. However, loss 
aversion predicts that upgraders would respond unfavorably 
to deleted attributes. This research did not examine the 
interaction between adding new attributes and deleting 
existing ones. This is a direction for a future extension. 

I considered upgrades in one-time consumable services 
(events, hotel accommodations) and durable-type products 
(phones). However, I did not compare one-time consumable 
products with repeat-usage products, and I did not consider 
any differential negative impact of sunk costs in upgrading. 
The prediction would be that the sunk cost may be a greater 
impediment for upgrading in durable-type products because 
of the repeat nature of the expected usage. With durable- 
type products, preferences for different types of upgrades 
may also change with the passage of time. For example, the 
desirability of new attributes may strengthen as consumers 
satiate with the existing attributes over time. Alternatively, 
it may weaken over time as consumers adapt to the existing 
attributes. These are also areas for further research. 

The basic concept that consumers forgo purchasing a 
superior alternative to save previous expenditures from 
“going to waste” can also apply to products offered with 
loyalty programs. Suppose that only connecting flights on 
an existing airline were available out of a traveler’s home- 
town airport. If a new airline enters the market and offers 
direct nonstop flight service, it would become the enhanced 
alternative. However, if travelers have accrued points with 
the existing airline that remain unused, these points, which 
they “paid for” by flying the connecting flights, would have 
to be "written off" if the travelers switched to the enhanced 
service. In addition to any economic reason for continuing 
to fly connecting flights, the mental cost of upgrading may 
also hinder frequent flyers from upgrading to the more con- 
venient service. 
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Having spent money on an existing product that is still 
functional, upgraders tend to prefer different types of 
enhanced products compared with new purchasers. It is not 
just the differential knowledge and experience from owner- 
ship and usage that drive this pattern. Simply refunding the 
money spent (Studies 1 and 4) or rendering the existing 


product not functional (Study 3) changed people’s prefer- 
ences for the enhanced product. There is a mental cost to 
upgrading, and this research explores how the positioning 
of the enhanced product relative to the existing one can help 
mitigate that cost and facilitate upgrade decisions. 


NN SS 
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Influence Tactics for Effective 
Adaptive Selling 


The adaptive selling literature identifies effective salespeople as those who match their influence tactics to suit the 
characteristics of buyers. However, prior research is largely silent on the specific influence tactics that salespeople 
use and the effectiveness of these tactics across different types of buyers. The authors propose a theoretical model 
that uses Kelman’s (1961) underlying influence processes of internalization, compliance, and identification to 
identify the seller influence tactics that salespeople use and to assess which of these tactics will resonate with three 
types of buyers: task-oriented buyers, Interaction-oriented buyers, and self-oriented buyers. The authors test their 
model with data from 193 bidirectionally matched buyer-seller dyads. The results strongly support the theoretical 
model and suggest that buyers are more complex than originally presumed. However, salespeople seem to 
recogníze this complexity and use the combination of influence tactics prescribed by theory for persuading these 


types of buyers. 


are that (1) influence tactics are the mechanism 

through which salespeople persuade buyers in inter- 
actions (e.g., Brown 1990; Spiro and Perreault 1979) and 
(2) adapting sales tactics to different buyers is important for 
superior performance (e.g., Spiro and Weitz 1990; Sujan 
1986; Weitz 1981). Notably, research at the intersection of 
these two notions is limited. As a consequence, little guid- 
ance is provided to salespeople regarding what influence 
tactics they should use with different types of buyers. This 
study sheds light on this important issue by proposing a 
theoretically grounded framework for investigating the 
effectiveness of sales influence tactics across groups of buy- 
ers who are dissimilar in their characteristics. ' 

Frazier and Summers (1984) introduced the notion of 
influence strategies to the marketing literature. These influ- 
ence strategies, or influence tactics, were developed for 
interorganizational, firm-to-firm channels contexts. Frazier 
and Summers (1984, p. 44) recognize the context for which 
their tactics were developed and specifically note that their 
tactics provide "researchers with an extended perspective 
on the influence process within channels of distribution. Of 
particular interest are boundary personnel, whose responsi- 
bilities lie in the areas of customer service and territorial 
management, as opposed to sales personnel." Whether Fra- 
zier and Summers's (1984) tactics are equally applicable in 
personal selling contexts remains in question. Thus, an 
objective of this study is to extend the work of these schol- 
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ars and others (e.g., Boyle et al. 1992; Payan and McFar- 
land 2005; Venkatesh, Kohli, and Zaltman 1995) and iden- 
tify tactics that are especially pertinent in sales settings. We 
refer to these as "seller influence tactics" or, more simply, 
SITs. 

Although identification of SITs is important, the adap- 
tive selling literature suggests that it is insufficient unless it 
can be pinpointed which tactics should be used with differ- 
ent buyers. Sheth (1976) differentiates buyers on the basis 
of their “task orientation,” “self orientation,” and “‘interac- 
tion orientation.”! Williams and Spiro (1985) validated this 
classification in sales settings; however, the influence tac- 
tics that will resonate with these three buyer orientations is 
unknown. Accordingly, a primary objective of the current 
study is to develop a theoretical framework for identifying 
the influence tactics that “work” with buyers of each of 
these three orientations. In addition to this theory develop- 
ment and its empirical testing, our study extends Sheth’s 
work by identifying clusters of buyers on the basis of their 
orientations, and it examines the combinations of SITs that 
are used within each cluster.2 

Our theoretical model argues that the three influence 
mechanisms that Kelman (1961) proposes—“internaliza- 
tion,” "compliance," and “identification”—serve as the 
bridge between influence tactics and buyer orientations. We 
argue that influence tactics operate through one of these 
three underlying influence mechanisms. That is, some influ- 
ence tactics operate through the mechanism of internaliza- 
tion, and others operate through compliance or identifica- 
tion. Furthermore, we argue that buyers with task, 


interaction, and self orientations respond differentially to 


lSheth (1976) categorizes buyers in terms of task-oriented, 
interaction-oriented, and self-oriented communication styles. On 
the basis of the insightful comment of a reviewer, we refer to these 
communication styles as “orientations.” 

7We thank a reviewer for suggesting that we cluster buyers on 
the basis of their orientation scores and identify the combinations 
of influence tactics that are effective for each cluster. 
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these three influence mechanisms. Thus, Kelman’s under- 
lying processes provide the key for unlocking the specific 
influence mechanisms that buyers will respond to and for 
identifying the influence tactics that operate through these 
mechanisms. 

We use matched buyer-seller dyads to test the hypothe- 
ses. The use of dyadic data is consistent with the recom- 
mendations of several researchers, who state that influence 
tactics must be examined from the perspective of both par- 
ties, not just one (e.g., Achrol, Reve, and Stern 1983; Kim 
2000). In addition, our study design reduces the chances of 
capitalizing on common method biases by having sales- 
people report their perceptions of a specific buyer’s orienta- 
tion and having that buyer report his or her perception of 
the SITs used by the same salesperson as well as the effec- 
tiveness of these SITs (Netemeyer et al. 1997; Ross, Ander- 
son, and Weitz 1997). 

We organize the article as follows: We first discuss the 
identification of the influence tactics in sales settings. Next, 
we link Kelman’s (1961) underlying influence processes to 
these influence tactics. We then develop the hypotheses. We 
describe the data collection and analytic procedures, present 
our results, and then conclude. 


Influence Tactics 


Influence tactics have been examined at the interorganiza- 
tional, firm-to-firm level of analysis in the marketing chan- 
nels literature and at the intraorganizational, person-to- 
person level of analysis in the organizational behavior and 
buying-center literature. The personal selling context falls 
into a third category that can be described as an interorgani- 
zational, person-to-person relationship in which the unit of 
analysis is the individual relationship. Consequently, the 
influence strategies identified in the channels, organiza- 
tional behavior, and buying-center literature need to be 
assessed for their applicability in sales settings. 

Frazier and Summers (1984) define influence strategies 
as compliance-gaining tactics that channel members use to 
achieve desired actions from channel partners. Influence 
strategies “involve the alternative means of communication 
available to a firm’s personnel in their influence attempts” 
(Frazier and Rody 1991, p. 52). Frazier and Summers 
(1984) propose six tactics: information exchange, recom- 
mendations, requests, threats, promises, and legalistic pleas 
(for brief definitions of the influence strategies from the 
marketing channels literature, see Table 1). These six tactics 
have been used extensively in the channels literature (e.g., 
Boyle and Dwyer 1995; Frazier and Summers 1986; Kim 
2000). The same six tactics have also been examined in 
intrafirm contexts (e.g., Venkatesh, Kohli, and Zaltman 
1995). 

The literature identifies additional tactics to those that 
Frazier and Summers (1986) discuss that are likely to be 
especially pertinent in personal selling. These tactics focus 
on emotional utilities, which sales researchers have long 
acknowledged as having an important role in influencing 
buyers (e.g., Brown, Cron, and Slocum 1997; Dixon, Spiro, 
and Jamil 2001; Evans 1963; Webster 1968; Weitz 1981). 
For example, several studies emphasize the importance of 
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TABLE 1 

Marketing Channels Influence Strategy Definitions 
Influence 
Strategy Traditional Channels Definition 
Information The source discusses general Issues 

exchange and procedures to try to alter the 

target's general perceptions without 
stating a request. 

Recommenda- The source predicts that the target will 


tions be more profitable if the target follows 
the source’s suggestions. 


Requests The source simply states the actions tt 


would like the target to take. 


Threats The source threatens the target with a 


future penalty if the target does not 
comply with a request. 


Promises The source promises the target a 


reward if the target complies with a 
request. 


Legalistic pleas The source cites a legallstic, 


contractual, or informal agreement that 
requires the target to perform a certain 
action. 


Notes: We drew these definitions from the work of Payan and 
McFarland (2005). We describe the definitions for the SITs In 
the text. 


ingratiation in sales contexts (e.g., Spiro and Perreault 
1979; Strutton, Pelton, and Tanner 1996). Ingratiation 
focuses on influencing buyers by getting them to like the 
salesperson. Likewise, Yukl and Tracey (1992) note the 
importance of appealing to people's emotions and ideals 
through the use of inspirational appeals. Ingratiation and 
inspirational appeals are focused primarily on satisfying the 
psychological needs of the target. Salespeople's use of 
ingratiation and inspirational appeals is consistent with lit- 
erature that notes the importance of attractiveness (e.g., 
Ahearne, Gruen, and Jarvis 1999), similarity (e.g., Crosby, 
Evans, and Cowles 1990), and liking (e.g., Doney and Can- 
non 1997) in professional selling. 


Identifying and Defining SITs 


To pinpoint which of the eight influence tactics identified 
previously are appropriate in personal selling, we first con- 
ducted ten in-depth personal interviews with professional 
buyers and industrial salespeople. The purpose of these 
interviews was to allow participants to describe commonly 
used influence tactics, determine whether other influence 
tactics should be included in the SITs framework, help with 
item generation, and determine whether any of the scales 
we identified from prior research could be adapted to the 
personal selling context. In these interviews, we asked par- 
ticipants open-ended questions (e.g., “What type of tactics 
do professional salespeople use to influence buyers?") and 
specific questions about whether salespeople use a specific 
tactic, such as promises. We also sought feedback on mea- 
surement items from prior research. 


We drew two conclusions from these interviews. First, 
four of the six influence tactics that Frazier and Summers 
(1984) define in the marketing channels literature'formed a 
solid basis in the formation of SITs. Interviewees noted 
using information exchange, recommendations, threats, and 
promises; they regarded two tactics as either implicit 
(requests) or simply not used (legalistic pleas). Second, 
buyers and salespeople identified ingratiation and inspira- 
tional appeals as common in personal selling. 

On the basis of these findings and further input from 
domain experts, we conducted a pretest with 17 executive 
MBA students involved in the purchasing function in their 
organizations. The purpose of this pretest was to confirm 
which tactics we should include and to assist with item 
purification. We provided participants with scales for the six 
influence tactics identified in the channels literature 
(including requests and legalistic pleas) and for ingratiation 
and inspirational appeals. Participants confirmed that the 
items for requests were confusing in the personal selling 
context because requests are implicit in sales interactions. 
They also confirmed that salespeople seldom, if ever, use 
legalistic pleas. Finally, participants confirmed that ingratia- 
tion and inspiration appeals are common sales tactics. Next, 
we define each of the six SITs. 


Information exchange. Information exchange involves 
the communication of information, including asking ques- 
tions, without making any specific recommendations to 
influence positively the buyer’s general perceptions about 
the benefits of the salesperson’s offerings. “The source 
firm’s boundary personnel use discussions of general busi- 
ness issues ... to alter the target's general perceptions;... 
however, no specific target action is requested" (Frazier and 
Summers 1984, p. 45). 


Recommendations. Recommendations are arguments 
used to convince a customer that products or services pur- 
chased from the salesperson would be beneficial to the cus- 
tomer's firm (Fitzsimons and Lehmann 2004; Frazier and 
summers 1984; Rackham 1988). Venkatesh, ‘Kohli, and 
Zaltman (1995, p. 72) define recommendations as follows: 
"[T]he source [of influence] suggests that following a spe- 
cific course of action is likely to be beneficial.” 


Threats. Threats are implied or stated negative sanctions 
that the salesperson asserts will be applied to the buyer's 
firm if the buyer does not comply with the seller's request 
(Boyle and Dwyer 1995; Boyle et al. 1992). "The source 
communicates to the target that it will apply negative sanc- 
tions should the target fail to” comply with the source's 
request (Frazier and Summers 1984, p. 46). 


Promises. With promises, the seller “pledges to provide 
the target with a specific reward, contingent on the target’s 
compliance with the source’s” request (Frazier and Sum- 
mers 1984, p. 46). Promises are pledges of future rewards 
for the buyer’s firm. Venkatesh, Kohli, and Zaltman (1995, 
p. 73) state that “the deprivation of rewards may be consid- 
ered to be equivalent to the imposition of sanctions.” Thus, 
the influence tactic of promises is similar to that of threats, 
though it may be more tolerable than threats because the 
nonmediation of rewards leaves the buyer at the status quo, 


whereas the mediation of sanctions leaves the buyer worse 
off (Venkatesh, Kohli, and Zaltman 1995). 


Ingratiation. Ingratiation involves the use of “behaviors 
that are designed to enhance one’s interpersonal attractive- 
ness” and improve rapport with the target of influence 
(Kumar and Beyerlein 1991, p. 619). These actions consist 
primarily of praising a customer for his or her achievements 
(other enhancement) and expressing attitude similarity 
(Kipnis and Schmidt 1988; Kipnis, Schmidt, and Wilkinson 
1980). 


Inspirational appeal. An inspirational appeal is “a 
request or proposal that arouses enthusiasm by appealing to 
a target’s [higher-order] values, ideals, and aspirations” 
(Yukl and Tracey 1992, p. 640). Inspirational appeals focus 
on arousing a positive affective response in the target, 
which serves to motivate the target toward a desired 
response. Forgas (1995) argues that positive affective 
arousal leads to more positive expectancies and, thus, to 
higher levels of compliance. Inspirational appeals may 
attempt to motivate people to go beyond their personal 
interests for the greater good by appealing to higher-order 
psychological needs (Yukl and Tracey 1992). 


Linking Influence Processes to 
SITs 


Kelman (1958, 1961) and others (e.g., Bearden, Netemeyer, 
and Teel 1989) describe three underlying processes of inter- 
personal influence: internalization, compliance, and identi- 
fication. Each of the six SITs operates through one of these 
mechanisms of interpersonal influence. Specifically, we 
propose that information exchange and recommendations 
operate through the process of internalization, threats and 
promises operate through the mechanism of compliance, 
and ingratiation and inspirational appeals operate through 
the processes of identification. 

Information exchange and recommendations tactics 
attempt to alter a buyer's "perceptions regarding the inher- 
ent desirability of the intended behavior’ (Frazier and Sum- 
mers 1984, p. 44). We propose that these tactics operate 
through the influence process of internalization, which 
occurs when the target of influence (i.e., the buyer) does 
what the source of influence (i.e., the salesperson) requests 
because the target believes that the action is appropriate and 
is in his or her own best interest (Kelman 1961). A target 
accepts the induced behavior because it is in agreement 
with his or her value system or because the content of the 
behavior is intrinsically rewarding (Smith and Barclay 
1997). In sales settings, internalization occurs when buyers 
accept influence because they find the content of the influ- 
ence to represent a useful solution or because it is congruent 
with their own orientations (e.g., approach to problem 
solving). 

In contrast, threats and promises tactics focus on chang- 
ing the motivation of the customer to comply on the basis of 
either positive or negative source-controlled consequences 
(Frazier and Summers 1984; Keith, Jackson, and Crosby 
1990; Venkatesh, Kohli, and Zaltman 1995). We propose 
that these tactics operate through the process of compliance, 
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which occurs when people accept influence (or change their 
beliefs) because they hope to obtain a favorable reaction or 
reward from another person (Kelman 1961). The recipient 
may be interested in attaining certain rewards or in avoiding 
certain “punishments” that the influencing agent controls 
(Brown, Lusch, and Nicholson 1995; Gundlach and Cadotte 
1994). For example, buyers may be receptive to “compli- 
ance” because they hope to obtain preferential delivery of a 
product or desirable credit terms. The recipient of the influ- 
ence likely does not adopt the induced behavior because he 
or she believes in its content but rather because it is instru- 
mental in goal accomplishment. 

Ingratiation and inspirational appeals aim to satisfy a 
target’s psychological needs by being attractive, by main- 
taining a fulfilling relationship with the target, and through 
eliciting positive emotional responses. We propose that 
these tactics operate through the process of identification, 
which occurs when a target is receptive to influence 
because he or she finds it attractive to maintain a satisfying 
or self-defining relationship with the influencer (Kelman 
1961). Identification is based on satisfying psychological 
needs. A person can try to establish or maintain a relation- 
ship with another party through identification in different 
ways. For example, he or she can “identify” with another by 
echoing what the other person expresses, endorsing that 
person’s views, or even behaving like that person, thus 
forming a satisfying or self-defining relationship with that 
person. In summary, ingratiation and inspirational appeals 
tactics operate through the process of identification because 
they seek to elicit enthusiasm, liking, and other positive 
emotive feelings toward the salesperson, the salesperson’s 
firm, and/or his or her offerings. 


Hypotheses 


As we noted previously, Sheth (1976) suggests that buyers 
can be distinguished on the basis of their task orientation, 
self orientation, and interaction orientation. These buyer 
orientations are rooted in the literature on leadership (Bass 
1960). Sales researchers examining customers’ social styles 
propose similar conceptualizations (Comstock and Higgins 
1997; Williams and Spiro 1985). For example, Blake and 
Mouton (1980) propose that customers may have two orien- 
tations: concern with the sale and concern about the other 
person. These two orientations parallel task and interaction 
buyer orientations. Furthermore, customers who are low on 
both of these orientations may be categorized as having a 
self orientation (Williams and Spiro 1985). 

Buyer orientations have also been referred to as com- 
munication styles in the literature because they represent a 
person’s overall communication pattern and goal expecta- 
tions from social interactions (Williams and Spiro 1985). 
Sheth (1976, p. 382) defines these orientations as “repre- 
senting the format, ritual or mannerism which a buyer” 
adopts in his or her interactions with salespeople. A buyer 
orientation is rooted in personality variables, socialization 
processes, personal lifestyles, and situational factors (Sheth 
1976), suggesting that though a buyer’s orientation is not a 
personality trait, it can be relatively stable. This has two 
important implications. First, because buyer orientations are 
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somewhat stable, it is possible to identify a buyer's orienta- 
tion and communication preferences. Second, the adaptive 
selling literature suggests that sellers who use the influence 
tactics that are most appropriate for a particular buyer orien- 
tation are likely to reap greater benefits than those who do 
not. 


SITs’ Effectiveness and Buyer Task Orlentation 


Buyers with a task orientation are goal oriented and pur- 
poseful (Sheth 1976). They want to accomplish the task at 
hand as efficiently as possible, and they are intolerant of 
any activity that is inefficient and non-goal oriented or that 
deviates from the current task on which they are focusing 
(Sheth 1976). Williams and Spiro (1985) suggest that 
people with a task orientation tend to be mechanistic in 
their approach to others and wish to make the best decision 
possible for their organization by being focused on the task. 
People with a task orientation are more focused on the task 
and less focused on relationship building than people with 
an interaction orientation. Similarly, those with a task orien- 
tation are more focused on achieving organizational objec- 
tives than those with a self orientation (Miles, Arnold, and 
Nash 1990). 

Salespeople who rely on SITs that operate through the 
process of internalization are likely to achieve greater mani- 
fest influence with buyers with a high task orientation. This 
is because such buyers need to be convinced of the merit of 
an idea or action and observe how it meets organizational 
objectives. They need to believe that the solutions being 
proposed are appropriate and congruent with their value 
system. Information exchange and recommendations, which 
operate through the process of internalization, are likely to 
appeal to buyers with a high task orientation because these 
tactics attempt to convince the target of the inherent attrac- 
tiveness of the intended action by presenting facts and logi- 
cal arguments (Frazier and Summers 1984). Indeed, prior 
research suggests that buyers with a task orientation 
respond more favorably to influence attempts that are high 
in knowledge and information than to attempts that are low 
in knowledge and information (Williams and Spiro 1985). 

More specifically, the higher the information exchange, 
the more it educates buyers about the salesperson's products 
and services, general business issues, and operating proce- 
dures. Furthermore, when information exchange is high, it 
enhances buyers' knowledge and enables them to evaluate 
different solutions. Buyers with a high task orientation will 
value such information because it helps them achieve orga- 
nizational goals. Similarly, recommendations are likely to 
appeal to such buyers. Influence tactics laden with recom- 
mendations provide specific suggestions and arguments to 
the buyer for making purchase decisions (Frazier and Sum- 
mers 1984). They even suggest the behaviors that the target 
should follow to accomplish organizational objectives. 
Thus, recommendations help buyers with a high task orien- 
tation compare and contrast different solutions efficiently, 
enabling them to make decisions more expeditiously. 


H,: The higher a buyer's task orientation, the stronger is the 
positive effect of salesperson (a) information exchange 
and (b) recommendations on buyer manifest influence. 


~~ 


SITs’ Effectiveness and Buyer Self Orientation 


Buyers with a self orientation are concerned almost entirely 
with their own welfare. For them, the sales interaction is 
“merely the theater in which ... needs can be: satisfied” 
(Bass 1960, p. 149). They view the interaction from a self- 
ish point of view (Sheth 1976). They are preoccupied with 
themselves during interactions with others and are less 
empathetic toward other people (Miles, Arnold, and Nash 
1990). Success in meeting personal needs outweighs the 
needs for task effectiveness and for accomplishing organi- 
zational objectives. Neither interaction nor task effective- 
ness is rewarding to the self-oriented individual, unless such 
effectiveness contributes to his or her self-esteem, status, or 
other personal gains (Bass 1960). 

Because buyers with a self orientation are keen on pro- 
moting their own agenda and interests, they are likely to be 
most responsive to the influence process of compliance. 
Influence processes based on compliance aim to obtain a 
change in behavior not by altering perceptions about the 
desirability of the intended action but through the use of 
rewards or sanctions. Such processes are effective in obtain- 
ing behavioral change when targets are either reluctant to 
act or will not act unless they personally benefit from taking 
an action (Kelman 1961). Threats and promises, which 
operate through the process of compliance, are likely to be 
most effective with buyers with a high self orientation 
because they provide inducements for a necessary behavior. 

When buyers go too far in preserving their own self- 
interest to the detriment of the buyer-seller relationship or 
to the detriment of the buyer's own organization, the use of 
threats (e.g., withdrawing preferential delivery dates or 
applying stringent credit terms) may be the only effective 
means to motivate them to act. In addition, because buyers 
with a high self orientation are primarily interested in their 
own benefits and are highly concerned with extrinsic 
rewards (Williams and Spiro 1985), promises are likely to 
appeal to these buyers because it rewards them with incen- 
tives and concessions (Kelman 1958). In addition, because 
such buyers often display a reluctance to act (Bass 1960), 
promises and threats are a powerful means for overcoming 
this reluctance. Collectively, these arguments indicate that 
buyers with a self orientation are likely to be responsive to 
the influence tactics of threats and promises. _ 

We also anticipate that ingratiation influence tactics will 
be effective. Because buyers with a self orientation are self- 
absorbed, they like to receive attention and brag about their 
accomplishments. At the same time, they are motivated in 
interactions to gain esteem (Stotland and Zander 1958). 
They tend to display dependent and succorant behavior; in 
other words, they need validation and support from others 
(Bass 1967). A key aspect of ingratiation tactics is other 
enhancements. Ingratiation serves the dual role of providing 
buyers the validation they seek and supporting their belief 
of self-importance. 

Hz: The higher a buyer's self orientation, the stronger is the 

positive effect of salesperson (a) threats, (b) promises, and 
(c) 1ngratiation on buyer manifest influence. 


SITs’ Effectiveness and Buyer Interaction 
Orlentation 


Buyers with an interaction orientation believe that socializ- 
ing is a critical aspect of the interaction process. They are 
interested in forming friendships and fostering interpersonal 
relationships (Bass 1960). Such buyers put more emphasis 
on relational elements than on the specifics of the transac- 
tion. They are “often compulsive in establishing a personal 
relationship” with the salesperson and only then "[get] 
involved in the specific content of interaction[s]" (Sheth 
1976, p. 385). Developing personally satisfying relation- 
ships is paramount to such buyers. Given the focus on 
social interactions and relationships, buyers with an interac- 
tion orientation are likely to be most responsive to the influ- 
ence process of identification. Indeed, Kelman’s (1961) 
theory suggests that people are more likely to be influenced 
through the identification process when they are concerned 
with social anchorage. 

Ingratiation and inspirational appeals, which rely on 
identification as an influence process, are likely to result in 
greater manifest influence among buyers with a high inter- 
action orientation. Importantly, ingratiation influence tac- 
tics are aimed at building a rapport with buyers (Kumar and 
Beyerlein 1991). Salespeople who rely on ingratiation tac- 
tics behave affably, discuss shared interests and hobbies, 
and give praise and compliments (Kipnis and Schmidt 
1988). These types of behaviors help forge interpersonal 
relationships, which are consistent with the goals of a buyer 
with an interaction orientation. Conversely, salespeople 
who hardly rely on ingratiation tend to focus less on build- 
ing relationships or on appealing to the buyer’s positive 
image. These salespeople are likely to have lower levels of 
manifest influence with such buyers. 

Buyers with an interaction orientation prefer sales tac- 
tics that emphasize emotional utilities, which indicates that 
such buyers look beyond product features and product 
benefits and make decisions based on emotions (Williams, 
Spiro, and Fine 1990). This concept is consistent with 
recent consumer research that indicates that some people 
use a process in decision making known as the “how-do-I- 
feel-about-it" heuristic (Adaval 2001; Pham et al. 2001). 
Inspirational appeals rely on attaching emotional relevance 
to a product, independent of the product's performance or 
functional utility. Finally, inspirational appeals are likely to 
resonate with buyers with an interaction orientation because 
such appeals seek to elicit excitement and other positive 
emotions in regard to the salesperson and/or his or her 
offerings. 


H4: The higher a buyer's interaction orientation, the stronger 
is the positive effect of salesperson (a) ingratiation and (b) 
inspirational appeals on buyer manifest influence. 


SITs Usage Based on Buyer Orientations 


Prior studies indicate that, in general, salespeople are aware 
of the type of sales interactions buyers prefer (Comstock 
and Higgins 1997) and use this knowledge when interacting 
with buyers (Spitzberg and Cupach 1984). Research sug- 
gests that the choice of influence tactics is likely to depend 
on the characteristics of the target of influence (Erez, Rim, 
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and Keider 1986). In support of this line of reasoning, 
empirical evidence suggests that people use influence tac- 
tics that they believe are most effective in gaining compli- 
ance from others (Leigh and McGraw 1989; Venkatesh, 
Kohli, and Zaltman 1985). Collectively, these arguments 
suggest that salespeople will vary the frequency with which 
they use SITs in an attempt to conform to buyers’ preferred 
orientations. 

Consistent with the preceding logic, because buyers 
with an interaction orientation are likely to respond to influ- 
ence attempts that rely on internalization, it follows that 
salespeople will use information exchange and recommen- 
dations with such buyers. Similarly, when dealing with buy- 
ers with a self orientation, salespeople will use influence 
attempts that rely on obtaining compliance. That is, they 
will use threats and promises. In addition, they will use 
ingratiation because such buyers also respond to influence 
attempts that enhance their self-image. Finally, salespeople 
will use ingratiation and inspirational appeals when inter- 
facing with buyers who are high on interaction orientation 
because attempts that rely on identification as an influence 
mechanism resonate with such buyers. 


H4: The higher the task orientation of a buyer, the greater is a 
salesperson's use of (a) 1nformation exchange and (b) rec- 
ommendations; the higher the self orientation of a buyer, 
the greater is a salesperson's use of (c) threats, (d) 
promises, and (e) ingratiation; and the higher the interac- 
tion orientation of a buyer, the greater is the salesperson's 
use of (f) ingratiation and (g) inspirational appeals. 


Method 


Common Method Blases and Dyadic Study 
Design 


Our methodology and sample design are consistent with 
Podsakoff and colleagues’ (2003) recommendations for 
reducing common methods biases. They recommend (1) 
obtaining measures of the predictor and criterion variables 
from difference sources; (2) offering anonymity/confiden- 
tiality to reduce the chances of responses that are socially 
desirable, lenient, acquiescent, and consistent with how par- 
ticipants think researchers want them to respond; and (3) 
informing the respondents that there are no correct or incor- 
rect answers and that they should respond as honestly as 
possible to reduce evaluation apprehension. In our study, we 
used bidirectionally matched dyads such that the key 
variables are non-self-reported. That is, we had buyers 
report on the influence tactics that salespeople use, and we 
had salespeople report on buyer orientations. In addition, 
we informed the respondents that their responses were con- 
fidential and that there were no right or wrong answers to 
the items in the survey. 


Final Pretest of SITs Measures 


We conducted a final pretest on the SITs measures using a 
convenience sample of 93 salespeople working for a Fortune 
500 company that manufactures farm equipment. Before 
administering the pretest questionnaire, we asked two execu- 
tives from the participating corporation to review the ques- 
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tionnaire and confirm that all instructions and questions 
were clear and unambiguous. We conducted exploratory fac- 
tor analysis on the data to investigate the unidimensionality 
and factor structure of the sales tactics constructs (e.g., Chal- 
lagalla and Shervani 1996; Singh and Rhoads 1991). Using a 
cutoff eigenvalue of 1.0, we found a six-factor solution, con- 
sistent with our expectations. The reliability of each measure 
exceeded the .70 threshold for acceptable reliability, which 
suggests that the measures were internally consistent (Nun- 
nally 1978). At this stage, we determined that the measures 
were ready for the final study. 


Dyadic Sample and Data Collection Procedure 


The unit of analysis in this study is the individual dyadic 
relationship between buyers and sellers. We chose an indus- 
trial retail setting in which a dealer salesperson sells to end 
users. The dealers in the study were resellers of agricultural 
equipment for a Fortune 500 company. These dealers repre- 
sent the manufacturer’s products exclusively. The dealers 
operate in a competitive environment in which customers 
can choose from multiple brands. Most customers are 
small-business operators. The customer’s purchase decision 
is most often made by an individual rather than by a 
committee. 

We first approached the vice president of dealer opera- 
tions at the agricultural equipment manufacturer. This com- 
pany divided its market into 100 regions and had multiple 
dealers in each region. Our sample frame consisted of 400 
salespeople, one from each of 400 dealerships (4 dealer- 
ships per region). Thus, the maximum number of indepen- 
dent, bidirectionally matched dyads that could be achieved 
was 400. Following the procedure we describe subse- 
quently, we collected 193 independent, bidirectionally 
matched, usable dyads, thus yielding a 48% achieved 
dyadic response rate. 

We used a two-phase process to obtain this dyadic data. 
In the first phase, we obtained a customer list for each deal- 
ership from the dealers’ parent organization. This list 
included the names of three customers per dealership. 
These customers were chosen at random from each dealer- 
ship’s customer database. We eliminated 12 because of 
inaccurate addresses, missing or incorrect customer names, 
and duplicate entries. To ensure accurate responses, we 
guaranteed respondents confidentiality and informed them 
that there were no correct or incorrect responses. We asked 
customers to write the name of the salesperson they had pri- 
mary contact with at the dealership on their questionnaire 
and to respond to all questions with that salesperson in 
mind. After three waves of mailings, we received 459 
responses, for a response rate of 39%. In some cases, we 
received multiple responses for each dealership. The 459 
customers who responded to the survey mentioned a total of 
290 unique dealer salespeople. 

In the second phase, we mailed questionnaires to the 
290 salespeople that customers in the first phase identified. 
To ensure that we obtained matched dyads, we provided 
each salesperson with the name of the customer who identi- 
fied him or her and asked the salesperson to respond to all 
questions with that specific customer in mind (we did not 
reveal customer responses). When two or more customers 


identified the same salesperson as their primary contact, we 
selected one customer response on the basis of an assigned 
number chosen at random, using a random-number genera- 
tor. This ensured that each salesperson responded to just 
one questionnaire in which a specific customer was identi- 
fied. This also ensured that each dyad was independent in 
the final analysis. 

We eliminated seven salespeople from the sample frame 
because of incorrect addresses and turnover. This left an 
effective sample of 283 salespeople. We received 199 
responses, for a 70% response rate within this sample. Of 
these, we eliminated 6 responses because of missing data, 
for a total of 193 full-data salesperson responses and, con- 
sequently, 193 individually matched independent pairs of 
buyer-seller dyads, each from a different dealership. We 
tested nonresponse bias with the procedure that Armstrong 
and Overton (1977) outline, using a median split between 
early and late responders for each sample. There were no 
significant differences between early and late responders in 
either sample. 


Measures 


The items and scale anchors for the measures appear in the 
Appendix. The summary statistics for all variables used in 
hypotheses testing appear in Table 2. Because of the likely 
response bias in salespeople’s self-reported influence tac- 
tics, buyers responded to the influence tactics items. Fol- 
lowing the procedures we discussed in detail previously, we 
modified Schriesheim and Hinkin’s (1990), Venkatesh, 
Kohli, and Zaltman’s (1995), and Yukl and Tracey's (1992) 
scales for context and unit of analysis. We used: Kohli and 
Zaltman’s (1988) manifest influence scale to measure the 
effectiveness of each influence tactic. Again, we modified 
the items for context. To eliminate the possibility of social 
desirability bias, buyers responded on this scale. 

We used William and Spiro’s (1985) multi-item scales 
for task orientation, self orientation, and interaction orienta- 
tion. Salespeople were the respondents for these measures 
because their perceptions of the buyer’s orientation drive 
their selection and use of SITs. 


` 


Analytic Procedures and Results 


Construct Valldity 


We analyzed the SITs, buyer orientations, and manifest 
influence measures simultaneously in a single confirmatory 
factor analysis using LISREL 8.54 (Jóreskog and Sórbom 
1996). Fit exceeded the standard cutoff criteria: %2 = 505, 
d.f. = 419; root mean square error of approximation = .033; 
comparative fit index — .99; and standardized root mean 
square residual — .045. The composite trait reliabilities of 
each factor exceeded the recommended cutoff criteria of .70 
(except for buyer interaction orientation, which was .68), 
and the variance extracted for each factor exceeded the .50 
cutoff criteria (Fornell and Larcker 1981; Hair et al. 1998). 
Together, this provides strong evidence that the measures 
are internally consistent. All indicators loaded significantly 
on their corresponding latent factor, indicating convergent 
validity. Using the procedure that Fornell and Larcker 
(1981) recommend, we obtained discriminant validity for 
all pairs of measures. 


Testing the Effectiveness of SITs 


H-H, predicted that the effectiveness of each SIT would 
vary with each buyer orientation. To test these moderator 
effects, we split the sample along the median of the moder- 
ator for each of the three orientation variables to obtain 
"high" and "low" subgroups (Johnston 1984; Kohli 1989). 
For each of the three orientations, we then regressed mani- 
fest influence on the six SITs using cases from both sub- 
groups (restricted run). Next, we ran a second regression in 
which we allowed the regression coefficients of the six 
influence tactics to take different values in the high and low 
subgroups. By including all six influence tactics, we were 
able to examine not only whether the hypothesized relation- 
ships were in the expected direction but also whether the 
nonhypothesized relationships were nonsignificant. 

We performed a Chow test (see Johnston 1984) to 
assess whether the difference in the sum of squared residu- 
als in the restricted and unrestricted run was significant. The 
Chow test results indicate that task orientation (F(s 189) = 


| TABLE 2 
Correlation Matrix and Summary Statistics 





Measure 1 2 3 


1. Buyer task orlentation (.87) 
2. Buyer Interaction orientation .31 (.67) 


3. Buyer self orientatlon —.39 —.20 (.89) 
4. Information exchange .13 .17 —.15 
5. Recommendations .04 15 | —07 
6. Threats —.26 7.24 .20 
7. Promises —.32 —.19 .27 
8. Ingratiation —.04 .18 .05 
9. Inspirational appeals .03 ,.26 —.05 
10. Manifest influence —.10 .10 .00 
M 4.00 3.52 1.90 
SD 72 .90 .88 
Composite reliability .87 .68 .89 
Variance extracted .68 .52 72 


4 5 6 7 8 9 10 
(.87) 

70 . (90) 

—21  -—16 (96) 

02 12 40 (82) 

53 51 06 22  (.78) 

49 58 —13 16 66 (.80) 

54 53  —04 20 47 42 (92) 
3.74 3.31 118 155 331 356 3.40 
1.05 1.19 59 81 1.03 1.04 98 

87 90 97 83 77 81 92 

69 75 88 62 53 58 70 


Notes: Cronbach’s alphas appear on the diagonal of the correlation matrix. Correlations 2 +.14 are significant at p « .05 


Influence Tactics for Effective Adaptive Selling / 109 


2.93, p < .01; adjusted R2 = .44), self orientation (Fre, 189) = 
2.20, p < .05; adjusted R2 = .47), and interaction orientation 
(Fee, 188) = 2.68, p< .01; adjusted R2 = 46) moderate the 
relationship between the six influence tactics and manifest 
influence. This finding lends overall support to the notion 
that buyer orientations matter when influence tactics usage 
is considered. The significant standardized regression coef- 
ficients from these analyses appear in Table 3. 

As Hyp predicted, recommendations have a positive 
relationship to manifest influence when buyer task orienta- 
tion is high (b = .54, p « .01), but recommendations are 
unrelated to manifest influence when buyers have a low task 
orientation. Contrary to H;,, information exchange is posi- 
tively related to manifest influence both when buyer task 
orientation is low (b = .78, p « .001) and when it is high 
(b = .44, p < .05), suggesting a main effect for this tactic. 
Thus, H; receives partial support. Notably, promises, ingra- 
tiation, and inspirational appeals are unrelated to manifest 
influence when buyer task orientation is high, whereas 
threats have a negative effect (b 2 —39, p « .05). Overall, 
these findings indicate reasonable support for our theoreti- 
cal model. 

As H» and H», predicted, promises (b = .21, p < .05) 
and ingratiation (b = .74, p « .001) are positively related to 
manifest influence for buyers with a high self orientation, 
but they are unrelated to it for buyers with a low self orien- 
tation. Threats are unrelated to manifest influence for 
high-self orientation buyers, as H}, predicted, but threats 
are negatively related to it for low-self orientation buyers 
(b = —.30, p < .05). The other influence tactics, except for 
recommendations, are unrelated to manifest influence when 


buyer self orientation is high. In general, these findings sup- 
port H» and are consistent with our theoretical model. 

Finally, H4 receives strong support; both ingratiation 
(b = .46, p < .01) and inspirational appeals (b = .39, p < .01) 
are positively related to manifest influence when buyer inter- 
action orientation is high. Notably, inspirational appeals are 
negatively related to manifest influence (b 2 —43, p « .05) 
for buyers who are low in interaction orientation. We also 
find strong support for our theoretical model because none 
of the other influence tactics are related to manifest influ- 
ence when buyer interaction orientation is high. 


Buyer Orlentation Clusters and SITs Usage 


We used a two-stage analytical procedure to test Hy. In the 
first step, rather than assume that buyers exhibit only one 
orientation, we clustered the buyers using their scores on 
the three buyer orientations as input.? We employed hierar- 
chical clustering using Wards method to cluster the buyers 
initially. We uncovered a three-cluster solution using the 
procedures that Milligan and Cooper (1985) recommend for 
determining the number of clusters. Next, as Punj and Stew- 
art (1983) recommend, we used the cluster seeds from the 
hierarchical cluster analysis as starting values in a k-means 
cluster analysis. We eliminated four outliers on the basis of 
their distance from the cluster seeds. 

Figure 1 depicts the plot of the cluster means. The 
results of the cluster analysis are interesting and suggest 
that buyers are more complex than originally presumed. 


3Again, we thank a knowledgeable reviewer for recommending 
that we cluster the buyers on the basis of their orientations. 


TABLE 3 
Regression Coefficlents Across High and Low Levels of the Buyer Orlentations 
(Dependent Variable: Manifest influence) 





Moderator Leve! 





Moderator Varlable Independent Variables Hypothesis High Low Adjusted R? Chow Test 
Buyer Task Orlentation Information exchange Hia .A4* io" 44 Fis, 189) = 2.93** 
Recommendations Hib b4** — 

Threats —.39* — 
Promises — ur 
Ingratiation — .d4* 
Inspirational appeals — — 
Buyer Self Orlentation Information exchange — V i aci 47 Fie, 189) = 2.20* 
Recommendations ie" 450" 
Threats Hog — 30* 
Promises Hop .21* — 
Ingratiation Hac ET — 
Inspirational appeals — — 
Buyer Interaction Orlentatlon — Information exchange — 1.05*** .46 Fe, 188) = 2.68" 
Recommendations — — 
Threats — — 
Promises — — 
Ingratiation Haa .46** — 
Inspirational appeals Hag 39"  —.43* 
*p « .05. 
**D « .01. 
***p < .001. 


Notes: Dash indicates coefficient is not significant at the .05 level 
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Only one of the clusters aligns on a single orientation; the 
remaining clusters align on multiple orientations. About 
one-third of the buyers fell into each of the clusters, which 
we named “task-focus buyers,” “engagement-focus buy- 
ers,'4^ and “self-focus-with-balance buyers" (see Table 4). 
Task-focus buyers have a much higher task orientation 
score (4.2) than interaction (2.7) and self (1.4) orientation 
Scores. Engagement-focus buyers are high on both task 


^We thank one of the reviewers for alluding to this term. 


FIGURE 1 
Plot of Mean Scores for Buyer Orlentation 
Clusters 





Self Orientatlon 


Interaction 
Orlentation 


Task Orientation 


—9— Task-focus buyers cluster 
—*— Engagement-focus buyers cluster 


—&— Self-focus-with-balance buyers cluster 
- :X- Group mean 





(4.3) and interaction (4.4) orientations but are low on self 
Orientation (1.4). Self-focus-with-balance buyers have 
higher scores on self orientation (2.6) than the other two 
clusters (i.e., task-focus buyers = 1.4 and engagement-focus 
buyers = 1.4). However, no orientation seems to be high 
within this third cluster (i.e., task orientation = 3.6, interac- 
tion orientation = 3.4, and self orientation = 2.6). 

In the second step of the analysis, we used the three 
buyer clusters as the multicategory variable in a multivari- 
ate analysis of variance (MANOVA); the six SITs served as 
the independent variables. Table 4 depicts the mean scores 
of the SITs from the MANOVA. Wilks’ lambda is .82 (F = 
3.13, p < .001), Hotelling's T is .21 (F = 3.17, p < .001), and 
Roy's largest root is .17 (F = 5.20, p < .001) for the 
MANOVA, indicating that the mean values for the SITs 
vary significantly across groups. As Table 4 depicts, the uni- 
variate one-way analysis of variance (ANOVA) results indi- 
cate that the means for all SITs, except for that of recom- 
mendations, are significantly different across the clusters. 

Helmert contrasts indicate that information exchange is 
in the significantly higher range for both the task-focus and 
the engagement-focus buyers and is in the lower range for 
the self-focus-with-balance buyers, which is consistent with 
Ha44. There are no differences in the use of recommenda- 
tions across the clusters, which is not supportive of Ha. 
The use of threats and promises is significantly higher for 
the self-focus-with-balance cluster than for the other two 
clusters, which is consistent with H4, and H44. The use of 
ingratiation and inspirational appeals is significantly higher 
for the engagement-focus cluster than for the other two 
clusters, which is consistent with H4r and H4, but is incon- 
sistent with H4.. Overall, these findings indicate that sales- 
people use the influence tactics recommended by theory in 
their interactions with buyers. 

Table 4 also points to the specific combination of SITs 
used in each cluster. For example, when dealing with 
engagement-focus buyers, salespeople use information 
exchange, recommendations, ingratiation, and inspirational 
appeals at a higher level, and the use of the latter two tactics 
is significantly higher than in the other clusters. Likewise, 
with self-focus-with-balance buyers, it might be expected 


! TABLE 4 
Means for Buyer Orlentation Clusters (MANOVA) 


Buyer Orientation Clustersa 


Task-Focus Engagement- 
Buyers Focus Buyers 
Information 
exchange 3.8 4.0 
Recommendations 3.4 3.5 
Threats 1.2 1.0 
Promises 1.5 1.2 
Ingratiation 3.1 3.6 
Inspirational appeals 3.4 3.8 
Number of 
Observations 59 63 


ANOVA Results 

Self-Focus-with- 

Balance Buyers F Statistic p Value 
3.5 3.40 « .05 
3.2 .89 n.s 
1.3 4.32 < .05 
1.8 7.44 < .001 
3.2 3.28 « .05 
3.4 3.42 « .05 


67 


aMean values in bold are significantly higher than the other mean values along the same row, based on the Helmert comparison procedure 


(p « .05). 


bThese are the results from a one-way ANOVA with the clusters as the grouping variable. 
Notes: The abbreviation “n.s.” indicates that the ANOVA is not significant at the 05 level 
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that all influence tactics are used at moderate levels. This is 
indeed the case, except for threats and promises, which do 
not seem to be used much across the board. Nevertheless, 
the use of threats and promises in this cluster is higher than 
in the other clusters. Thus, overall, salespeople appear to 
use the combinations of tactics that would be expected on 
the basis of theory. The overall pattern of results from the 
MANOVA leads to two important conclusions: First, sales- 
people use combinations of tactics, and second, in general, 
the tactics they use follow the theoretical prescriptions of 
our model. 


Post Hoc Analysis 


To validate that the SITs effectiveness results (i.e., H,—H3) 
we obtained previously for each buyer orientation factor 
also hold true for the buyer clusters, we conducted a multi- 
group analysis using LISREL 8.54. Table 5 depicts these 
results. To assess whether the differences among groups are 
significant, we compared this model with a null model in 
which the gamma paths were fixed across each group. The 
chi-square difference (Ay? = 55.89, d.f. = 12) between these 
two models is significant at p « .0001, indicating that the 
null model should be rejected and that group differences are 
significant. 

As we discussed previously, we find general support for 
H,—H3, which suggests that matching sales tactics to buyer 
orientations positively affects manifest influence. We find 
additional support for this notion, even when we examine 
buyer clusters (see Table 5). Indeed, seven of the nine paths 
are significant and in the expected direction. For example, 
consistent with H,, we find that information exchange and 
recommendations resonate with clusters that include buyers 
with a task orientation (i.e., task-focus buyers and 
engagement-focus buyers). The two exceptions were 
inspirational appeals and threats, which had no impact on 
manifest influence for engagement-focus buyers and self- 
focus-with-balance buyers, respectively. In addition, 
unexpectedly, information exchange had an impact on the 
manifest influence of self-focus-with-balance buyers. 
Overall, for the hypothesized paths, with the exception of 
inspirational appeals, the results from the buyer clusters 
analysis are consistent with the Chow test results we present 
in Table 3. 


Discussion 


This study advances our knowledge of the use of influence 
tactics in buyer-seller interactions. It identifies the specific 
influence tactics that salespeople use with buyers and the 
relative effectiveness of these tactics across different buy- 
ers. Importantly, it demonstrates that salespeople match 
their use of SITs to different buyer orientations in a theo- 
retically prescribed manner and, by doing so, that sales- 
people are able to influence positively the buyers’ decision 
making. 


Contributions to Theory 


The importance of adaptive selling is widely recognized in 
the sales literature. This literature suggests that rather than 
use the same tactics, salespeople should adapt their tactics 
and that these adaptations should “suit” the buyers with 
whom they are dealing (Szymanski 1988; Weitz, Sujan, and 
Sujan 1986). After all, merely changing tactics is insuffi- 
cient if the adaptations do not appeal to buyers. Notably, 
prior research is largely focused on the degree to which 
salespeople change their tactics and not on whether the 
adaptations are suitable. Our study makes a contribution to 
the literature by developing a theoretically grounded frame- 
work for understanding the suitability of each SIT for dif- 
ferent types of buyers. Our theoretical model uses Kelman's 
(1961) influence processes to identify the influence tactics 
that will resonate with buyers on the basis of their orienta- 
tions. By using theory to pinpoint the appropriate tactics for 
different buyers, we add to the stream of research on adap- 
tive selling. 

This study also contributes to the influence strategy lit- 
erature by identifying the influence strategies, or tactics, 
that are used in sales contexts. Frazier and Summers (1984) 
developed what many researchers consider the most widely 
used taxonomy of influence strategies in marketing. How- 
ever, they developed their strategies for a different context 
and for a different level of analysis; principally, these strate- 
gies are aimed at achieving influence within existing firm- 
to-firm-level relationships. In contrast, the personal selling 
context often deals with individual-level relationships. In 
this context, we find that it is necessary to add two tactics to 
Frazier and Summers's taxonomy, namely, ingratiation and 


TABLE 5 
Multigroup LISREL: The Impact of SITs on Mantfest Influence by Buyer Cluster 
(Standardized Gamma Path Coefficients) 





Path from Path to 
Information exchange Manifest Influence 
Recommendations Manifest Influence 
Threats Manifest Influence 
Promises Manifest influence 
Ingratiation Manifest influence 
Inspirational appeals Manifest Influence 
*p < .01. 

**p < .001. 


Notes: x?e3) = 2921 43 for base model. 
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Buyer Orientation Clusters 


Task-Focus Engagement- Self-Focus-with- 
Buyers Focus Buyers Balance Buyers 
,1'* .28** .45* 
20° .21* .11 
—.02 .09 .03 
.10 .09 .29"* 
.17 .32* .37* 
—.11 .03 —.12 


inspirational appeals. In addition, we find that Frazier and 
Summers’s requests and legalistic pleas strategies are either 
implicit or uncommon in the interpersonal sales context. 
This updated taxonomy provides sales researchers with an 
appropriate context-specific set of influence tactics (i.e., 
SITs) for examining the salesperson influence process. 

This study extends the work on buyer orientations (e.g., 
Sheth 1976; Williams and Spiro 1985) and helps identify 
clusters of buyers as they exist in industrial buying. Two 
interesting conclusions can be drawn from our study on this 
topic. First, buyers are more complex than previously 
believed. Our study uncovered three clusters df buyers. 
Although a dominant task orientation defines the first clus- 
ter of buyers, the other two clusters are not as straightfor- 
ward. The second cluster of buyers, engagement-focus buy- 
ers, are high on both task and interaction orientation. 
Intriguingly, the third cluster of buyers, self-focus-with- 
balance buyers, is defined by moderate levels of all three 
orientations but with higher self orientation scores than the 
other two clusters. Second, the theory at the factor level 
(1.e., orientation level) appears to be robust. This means that 
salespeople can influence buyers as long as they use the tac- 
tics that theory recommends for each of the orientations a 
buyer reflects. Overall, our study deepens our understand- 
ing of how buyer orientations might cluster and the combi- 
nations of tactics to use with tbese different clusters of 
buyers. 


Managerlal Implications 


Mangers are interested in knowing whether there are uni- 
versally effective tactics and whether there are any tactics 
that should be avoided. This study provides interesting 
insights into this question by integrating the findings of sev- 
eral different analyses and perspectives on buyer orienta- 
tions. If the traditional view is taken that buyers have a 
single dominant orientation, it would appear that no tactic is 
universally effective across all buyers. Solely on the basis of 
traditional analytic results, the following advice would be 
given: For buyers with a task orientation, sellers should use 
information exchange and recommendations; for buyers 
with an interaction orientation, sellers should rely on ingra- 
tiation and inspirational appeals; and for buyers .with a self 
orientation, sellers should depend on promises and ingratia- 
tion. These results strongly support the key premise of the 
adaptive selling literature (e.g., Weitz, Sujan, and Sujan 
1986) and suggest that a buyer's orientation should drive 
the use of influence tactics. 

However, our additional analyses suggest that, more 
often than not, buyers align along multiple orientations. 
Indeed, only one-third of the buyers align along a single 
dominant orientation (i.e., task orientation); the. other two- 
thirds have multiple orientations. This implies that if an ori- 
entation is ubiquitous among buyers, it is possible to 
uncover universally effective tactics. Notably, we find that 
buyers in all three clusters have moderate to high levels of 
task orientation. Perhaps buyers who have solely an interac- 
tion orientation or solely a self orientation may not be able 
to survive in an organization if they cannot fulfill task obli- 
gations. Given the omnipresence of task orientation, it is not 
surprising that information exchange resonates with all buy- 


ers. Recommendations come close to having universal 
appeal, with two of three buyer clusters responding to this 
tactic. Overall, these findings indicate that the quest for 
unearthing universally effective tactics is not a futile one. 

This study also provides insights into tactics that should 
be avoided. Threats fall into this category. Although theory 
suggests that threats may be effective with buyers who have 
a self orientation, we do not find that to be the case. How- 
ever, two points are noteworthy to put this finding into per- 
spective. First, salespeople may have limited opportunities 
to use threats because few buyers exhibit a high level of self 
orientation. Second, the overall use of threats by sales- 
people in this sample is low, thus raising the possibility that 
the nonsignificant finding is because of range restriction. A 
reason for the lack of reliance on threats may have to do 
with our context. Unlike channels contexts (e.g., Frazier 
and Summers 1986), in industrial retail settings, dealers and 
manufacturers often have little power over customers. Any 
form of threats (e.g., tighter credit terms, delayed delivery) 
is likely to misfire because customers can turn to other sup- 
pliers. Nevertheless, we advocate future studies to retain 
this tactic because in other selling contexts, customer 
options may be more limited, and threats may be a viable 
tactic when dealing with buyers who have a self orientation 
(McFarland 2003). 

Our findings emphasize the importance of the two tac- 
tics that we add to Frazier and Summers's (1984) taxonomy 
and also call into question the comprehensiveness of many 
sales training programs. We find that ingratiation plays a 
crucial role in affecting manifest influence of buyers with 
an interaction orientation and self orientation, whereas 
inspirational appeals seem to work with buyers with an 
interaction orientation. With two-thirds of the buyers falling 
into clusters that exhibit two orientations (1.e., engagement- 
focus and self-focus-with-balance clusters), these tactics 
can be used frequently to persuade buyers. Notably, how- 
ever, several of the best-selling sales training programs in 
the United States underscore a selling approach that is par- 
ticularly suited for buyers with a task orientation. The focus 
of these training programs is on asking a sequence of ques- 
tions to develop fully the “implications and need payoff” 
(as in SPIN selling [Rackham 1988]) or in "analyzing cur- 
rent position" and "thinking through alternative positions" 
(as in Miller, Heiman, and Tuleja's [1985] program strategic 
selling). Although this 1s correct given the universal benefit 
of using information exchange across all clusters of buyers, 
influence tactics with an affective basis should find a 
greater voice in the prominent sales training programs. 

Overall, when buyers have several orientations, sales- 
people might be better off using a combination of tactics 
rather than only one or two tactics. The encouraging news 
for sales managers is that salespeople's choice of influence 
tactics appears to be largely governed by buyers' orienta- 
tions, even when they align along multiple orientations. 
This suggests that if salespeople can identify buyer cues, 
they are likely to respond to them. 


Limitations and Directions for Further Research 


We note the importance of contextual differences. Although 
the salespeople who participated in this study are geograph- 
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ically dispersed across the United States and work at inde- 
pendently owned dealerships, the salesperson data are nev- 
ertheless from a single industry. Thus, there is a need for 
further research that examines a variety of industries and 
firms to expand the generalizability of the empirical results. 
For example, an examination of industries in which buyers 
are highly dependent on sellers would advance the under- 
standing of the appropriateness and effectiveness of the 
threats tactic under these circumstances. In the same spirit, 
it would be important to assess whether the relative impor- 
tance of the different influence tactics is relatively stable or 
varies across cultures. A strength of the current study is that 
though the focus is on professional buyer—seller relation- 
ships in a business-to-business context, the salespeople sell 
both products and services, often in a retail setting. 

An important contribution of this study to the influence 
tactics literature (e.g., DelVecchio et al. 2003; Goff, Bel- 
lenger, and Stojack 1994; Scheer and Stern 1992; Spiro and 
Perreault 1979; Yukl and Tracey 1992) is the validation of 
the SITs framework. Although we were ultimately able to 


use existing influence tactics in the formation of this frame- 
Work, we were careful to follow traditional measurement 
development standards, including a goal of including the 
most commonly used tactics in the framework and eliminat- 
ing those that are not consistent with the personal selling 
context. For researchers who want to examine the sales- 
person influence process within the buyer—seller exchange, 
SITs provide a good starting point. Inspirational appeals 
have received the least attention in the literature compared 
with the other five SITs we identified. Furthermore, almost 
all the literature that has focused on this tactic has not pub- 
lished full item-level data. Thus, four of the five items we 
included in this study for this measure are new. We recog- 
nize that our scale items for this construct will require 
modifications in future studies. Researchers should develop 
items that are more specific to the emotions being targeted.5 


——————————————Q— 
?We are grateful to a knowledgeable reviewer for pointing out 
the necessary modifications to the scale 1tems. 


APPENDIX 
Scale Items 


Buyer Responding In Regard to a Specific Salesperson 


Se ———M—M—MÓÉÓ eme e Te ee 
Source SITs Scale Items 
Based on Information Exchange 


Venkatesh, Kohl, The salesperson ... 
and Zaltman 1. Presented information related to your vanous purchase options. 
(1995) .. 2. Asked about your long-term purchasing goals. 
3. Ensured you recelved all product and sales materials relevant to your purchasing decislons.8. d 
4. Asked if there were any problems or needs he or she could help address.« d 
5. Talked about the possible applications of his or her products or services.d 
Adapted from Recommendations 
Venkatesh, Kohll, The salesperson ... 
and Zaltman 1. Made it clear that by following hls or her recommendatlons, your business would benefit. 
(1995) 2. Outlined the evidence for expecting success from his or her suggestion(s).¢ 
3. Provided a clear picture of the positive impact on your business a recommended course of 
action would have. 
4. Made a logical argument supporting his or her specific suggestions.s d 
5. Made it explicit, when making a suggestion, that it was intended for the good of your firm. 
Adapted from Threats 
Venkatesh, Kohli, The salesperson ... 
and Zaltman 1. Stated that your firm would stop recelving preferential treatment if his or her proposals were 
(1995) Ignored. 
2. Advised that he or she would stop doing business with your company If his or her requests 
were not followed. 
3. Threatened to become uncooperative if your firm falled to agree to his or her demand(s). 
4. Indicated he or she could make “things difficult” for your business if his or her requests were not 
met. 
Adapted from Promises 
Venkatesh, Kohli, The salesperson ... 
and Zaltman 1. Offered to give special attention to your company If you would give him or her new business.¢ 
(1995) 2 Made promises to give something back for complying with his or her request (e.g., discounts, 


quicker delivery). 

3. Offered additional benefits for your business after you had been inltially reluctant to agree to his 
or her terms.c 

4. Offered to provide Incentives to your business for agreeing to hls or her purchase request(s). 

5. Offered a speciflc deal for your business to change your position on certain issues.c 
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APPENDIX 


Continued 
ae E 
Buyer Responding In Regard to a Speciflc Salesperson 


Source SITs Scale Items 
Adapted from Ingratlation 
Schrieshelm and The salesperson ... 
Hinkin (1990) 1. Acted in a friendly manner prior to asking for what he or she wanted. 


2. Sympathized with you about the added problems that his or her request caused.a 
3. Made you feel good about yourself before making his or her sales pitch. 

4. Complimented and praised your achievernents.d 

5. Discussed shared Interests and/or hobbies prior to discussing sales issues.d 


Based on Yukl and Inspirational Appeals 
Tracey (1992) The salesperson ... 
1. Tried to get you excited about what he or she was selllng.4 
2. Argued you had an exciting opportunity to help your company/business.®: d 
3. Described the use of his or her products or services with enthusiasm and conviction. 
4. Appealed to your values and Ideals when asking for your business. 
5. Made a sales pitch which tried to appeal to your emotions. d 





Source Outcome Varlable Scale Items 

Adapted from Kohli Manifest Influences 
and Zaltman 1. How much weight did you give to the salesperson's opinions before buying?& 
(1988) 2. To what extent did the salesperson's involvement influence your choices? 


3. How much Impact did the salesperson have on your purchase decisions? 

4. To what extent did you go along with the salespersons suggestions? 

5. How much weight did you give the salesperson's statements In making you purchase 
decisions?d 

6. To what extent did your decisions reflect the salespersons Influence? 

7. To what extent did the salesperson influence the criteria used for making purchase decisions?8 





Salesperson Responding In Regard to Matched Buyer 





Source ' Buyer Orientation Scale Items 
Adapted from Buyer Task Orientatlonf 
Williams and Spiro 1. This customer likes to focus on the task at hand. 
(1985) 2. This customer likes to make our sales interactions as efflcient as possible. 


3. This customer is highly goal oriented. 
4. This customer is purposeful in accomplishing his/her purchasing goals. 


Adapted from Williams Buyer Self Orientation 
and Spiro (1985) 1. This customer Is more Interested in him/herself than what | have to say. 
2. This customer usually dominates conversatlons.c 
3. This customer tries to impress me with him/herself. 
4. This customer is more focused on talking about him/herself than my products. 


Adapted from Buyer Interaction Orientation! 
Williams and Spiro 1. This customer Is easy to talk to. 
(1985) 2. Thls customer likes to socialize during sales interactions. 


3. This customer likes to talk to people.c 
4. This customer is interested In me as a person, not just a salesperson. 





aWe dropped this ttem after the confirmatory factor analysis. 

bWe found thls Item to have a low factor loading and did not include it In the confirmatory factor analysis. 

cWe dropped this tem because of face validity concerns of the reviewers. 

dWe added this item to the onginal scale. 

eWe measured these Items on a five-point Likert-type scale, anchored by "very little” and “a great deal.” 

fWe measured these Items on a five-point Likert-type scale, anchored by "strongly disagree" and "strongly agree.” 

Notes: We measured all items on a five-point Likert-type scale, anchored by "never" and “always,” except when otherwise noted. 
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Pierre Chandon & Brian Wansink 


How Biased Household Inventory 
Estimates Distort Shopping and 
Storage Decisions 


The authors develop a model of how consumers estimate the level of product inventory in their households. Two 
laboratory experiments and two field studies involving 29 product categories show that (1) consumers anchor their 
estimates on their average inventory and fail to adjust sufficiently; (2) adjustments follow an inelastic psychophysical 
power function, leading to overestimations of low levels of inventory and underestimations of high levels; and (3) 
adjustments are more elastic and, thus, more accurate when inventory is salient. Contrary to the assumptions of 
practitioners and academic modelers, these inventory estimates, not actual inventory levels, drive subsequent 
purchase incidence. Simulation results further show that biased estimates increase overstocking and spoilage 
among stockout-averse consumers but increase stockouts and unmet demand among overstocking-averse 
consumers. By predicting the magnitude, not just the direction, of estimation biases, the model and the results offer 
new insights into accelerating the consumption of healthy foods and improving the targeting of stockpiling-inducing 


sales promotions. 


“I buy lots of things and then go back to the house and see 
the fridge is full of all the stuff I've just bought” 

—Prince William, Heir of the Bntish Throne, 

(BBC News 2003) 


iased inventory estimates can lead to overstocking 
B- stockouts in a household. Overstocking con- 

tributes to households losing an estimated 14% of 
their meat, grain, fruit, and vegetable purchases (Jones et al. 
2003). Household-level stockouts contribute to unmet 
demand, which represents a wasted opportunity for con- 
sumers and for retailers and manufacturers. For example, 
the success of the "Got milk?" advertising campaign is pri- 
marily attributed to its reduction of the frequency of milk 
stockouts in consumers' refrigerators (Manning 1999). If 
biased inventory estimations lead to overstocking and 
stockouts, it is an important issue for the welfare of budget- 
pressured and time-pressured consumers. It is also an 
important issue for managers and researchers because these 
biases can influence storage, consumption, and repurchase 
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decisions and therefore affect promotional elasticity (Meyer 
and Assungao 1990). Furthermore, if consumers are biased 
in their estimations, this would imply that the interpretation 
and policy recommendations of purchase quantity and tim- 
ing models may be biased because these models typically 
assume that consumers know how much of the product is in 
their pantries (Ailawadi and Neslin 1998). 

A recent surge of interest in estimation biases under- 
scores the importance of understanding consumers' inven- 
tory estimation biases. Researchers have shown that refer- 
ence levels, stimulus size, and salience influence 
consumers’ estimations of product volume in a container 
(Krider, Raghubir, and Krishna 2001; Raghubir and Green- 
leaf 2006; Raghubir and Krishna 1999), numerosity 
(Krishna and Raghubir 1997; Pelham, Sumarta, and 
Myaskovsky 1994), and estimations of purchase and con- 
sumption frequency (Lee, Hu, and Toh 2000; Menon, 
Raghubir, and Schwarz 1995). Still, however, no research 
has examined the following important questions regarding 
consumers’ inventory estimations that are of interest to con- 
sumers, marketers, and researchers: (1) How do people esti- 
mate how much product they have in inventory? (2) Are 
their estimations accurate, or are they systematically biased 
by the reference inventory, inventory size, and inventory 
salience? (3) How do biased inventory estimates distort 
storage and purchase decisions? and (4) What is the rela- 
tionship between estimation biases and key category 
characteristics? 

To answer these questions, we build on psychophysics 
research on magnitude estimation and develop a model of 
how consumers estimate inventory. This model incorporates 
reference, size, and salience effects. Using the model, we 
show in a simulation that biased inventory estimates 
increase overstocking and spoilage among stockout-averse 
consumers but increase stockouts and unmet demand 
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among overstocking-averse consumers. We directly test the 
model’s predictions in two laboratory experiments, which 
include manipulations of internal and external reference 
levels, of the actual size of the inventory, and of the inven- 
tory’s salience. Two field studies involving 29 product cate- 
gories further demonstrate the robustness of the model pre- 
dictions. They also show that estimated inventory levels 
predict repurchase decisions better than actual inventory 
levels do and that inventory estimation biases are related to 
three key category characteristics: the degree of impulse 
buying, the ease of stockpiling, and the average promo- 
tional elasticity. In the general discussion, we show how our 
model and the results provide a parsimonious explanation 
for other, previously unexplained estimation biases. We also 
offer new insights for accelerating the consumption of 
healthy foods and improving the targeting of sales 
promotions | 


A Model of Consumers’ Inventory 
Estimations | 


In this section, we develop a model of how consumers esti- 
mate the quantity of a product they have in inventory. The 
key features of the model are that (1) consumers anchor 
their estimations on internal or external reference levels and 
insufficiently adjust for the actual inventory level, (2) 
adjustments are inelastic (they increase more slowly than 
the true deviation from the reference level, and thus their 
quality worsens as the deviation increases), and (3) adjust- 
ments are more elastic (and, thus, more accurate) when 
inventory is perceptually salient. We first show how each 
prediction can be derived from psychophysical research on 
magnitude estimations and then show how these predictions 
can be integrated in a simple power model. 


Reference Effects | 


; 

Inventory estimations involve either judgments of numeros- 
ity (e.g., "How many eggs do I have?") or judgments of vol- 
ume (e.g., "How many ounces of coffee do I haye?"). Many 
studies have shown that consumers anchor numerosity and 
volume estimations on salient internal or external reference 
levels and fail to adjust sufficiently for deviations from the 
reference level. For example, Krishna and Raghubir (1997) 
show that consumers' estimation of the number of dots in a 
line is higher when the dots are in multiple clusters (high- 
reference anchor condition) than when they are all together 
in one uninterrupted line (low-reference anchor condition). 
Raghubir and Krishna (1999) show that volumé estimations 
are anchored on the elongation of the container (see also 
Wansink and Van Ittersum 2003). Krider, Raghubir, and 
Krishna (2001) show that consumers anchor area estima- 
tions on the most salient dimension, in which salience is 
context dependent. For example, they show thal the orienta- 
tion of a square influences whether the diagonal or a side is 
used as an anchor when estimating its surface (see also Pel- 
ham, Sumarta, and Myaskovsky 1994). 

In the context of inventory estimations, we expect that 
the default anchor is the average category inventory level 
for each consumer. This is a reasonable assumption because 
in the absence of other information on actual inventory, the 


average inventory level is the best estimator of actual inven- 
tory if inventory follows a normal or uniform distribution. 
However, consistent with Krider, Raghubir, and Krishna's 
(2001) results, we expect that consumers will use external 
anchors if they are made contextually salient (e.g., by ask- 
ing consumers to judge explicitly whether an inventory 
level is above or below some number). In summary, we 
expect that consumers anchor their inventory estimations on 
their average inventory, except when external reference lev- 
els are made salient (in which case these reference levels 
serve as anchors). 


Size Effects 


Recent research on anchoring effects has shown that when 
people have internally selected a reference as an anchor, 
they insufficiently adjust for the difference between the ref- 
erence and the actual value of the magnitude to be esti- 
mated (Epley and Gilovich 2001). Epley and Gilovich 
(2004) show that people estimate the number of days it 
takes Mars or Neptune to orbit the sun by using the number 
of days it takes the Earth as an anchor (365 days). As a 
result, they adjust more for Neptune (mean estimated 
answer is 3447 days) than for Mars (mean estimated answer 
is 574 days) because they know that Neptune is further 
away from the sun than Mars, but they still fall short of the 
truth (which is 60,225 days for Neptune and 869 days for 
Mars). 

Our model contributes to the literature on anchoring and 
adjustment by further predicting that the size of the adjust- 
ment is inelastic to the actual deviation from the anchor 
because estimations follow a compressive power function 
of actual magnitude (i.e., EST = a x (ACT), where EST is 
the estimated magnitude, ACT is the actual magnitude, and 
b < 1). In other words, the percentage change in estimations 
is lower than the percentage change in actual size (Stevens 
1986). As a result, adjustments become less effective as the 
deviation between the reference level and the actual inven- 
tory level increases. 

There is considerable evidence that magnitude estima- 
tions follow a compressive power function of actual magni- 
tudes. For example, Teghtsoonian (1965) finds that the 
exponent of the power function is approximately .7 when 
estimating the size of three-dimensional objects. Frayman 
and Dawson (1981) examine exponents of power functions 
for different shapes (cubes, spheres, octahedrons, cylinders, 
tetrahedrons) and find that they are all approximately .6. 
For perceived numerosity judgments, Krueger (1982) finds 
a power exponent between .80 and .82. Overall, there is 
strong support in the literature for our prediction not only 
that consumers fail to adjust sufficiently for the deviation 
from the reference level but also that such adjustments are 
inelastic and therefore increase at a lower rate than the true 
difference between the reference level and the actual inven- 
tory level. 


Sallence Effects 


The power exponent, which measures the elasticity of esti- 
mations to actual changes in the magnitude of the stimuli, is 
influenced by the perceptual salience (e.g., visual promi- 
nence) of the different dimensions of the stimulus. Krider, 
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Raghubir, and Krishna (2001) find that the power exponent 
of area estimations for two-dimensional objects is greater 
when the salience of secondary dimensions (those that are 
not used as anchors) is increased. For example, because 
people anchor area estimations of circles on the length of 
their horizontal diameters, area estimations and willingness 
to pay for round pizzas are more sensitive to the actual size 
of the circle when the vertical diameter is made salient. 
Building on these findings, we predict that the elasticity 
of adjustment improves with the perceptual salience of the 
product in inventory (and, thus, that the power exponent is 
greater when inventory is salient than when it is not). Fol- 
lowing Krider, Raghubir, and Krishna (2001), we define 
salience as the ability of the product’s inventory to attract 
attention. For example, the salience of a product’s inventory 
increases if the product is stored in a visible place or if it is 
purchased or consumed frequently, because these factors 
increase attention to the actual inventory level. When inven- 
tory is salient, consumers are more likely to know whether 
the reference level they anchor on is wrong and should be 
adjusted. In the extreme case, consumers may have encoded 
salient inventory so well that they do not need to rely on the 
reference level at all. When inventory is not salient, how- 
ever, consumers may not even know whether actual inven- 
tory is above or below the reference level and therefore will 
rely mostly on the reference anchor. Thus, we expect that 
inventory estimations are adjusted slightly when inventory 
is not salient and are adjusted more strongly when inventory 
is salient. In other words, we expect that inventory estima- 
tions are more sensitive to actual inventory and, thus, more 
accurate when inventory is salient than when it is not. 


Modeling Biases in Inventory 
Estimations and Their Effects on 
Storage and Shopping Decisions 


A Model of Household Inventory Estimation 


We develop a simple mathematical model of consumers’ 
inventory estimations that allows us to test the effects of 
inventory reference, size, and salience simultaneously. This 
model applies to situations in which consumers cannot sim- 
ply look and count their inventory and, therefore, must esti- 
mate it from memory (e.g., when they are in a store). In 
essence, we estimate a series of power models in different 
conditions (e.g., when salience is low versus high) and use 
the model parameters to test for the hypothesized effects. 
The general form of these power models is as follows: 


(1) EST = a x (ACT), 


where EST is estimated inventory, ACT is actual inventory, 
and a and b are parameters estimated with regression. We 
use power models because they are consistent with psy- 
chophysical research on magnitude estimation and because 
they have four desirable properties. 

The first desirable property is that the model intercept 
captures systematic differences between estimated and 
actual inventory, regardless of inventory level. Changes in 
the intercepts shift the power curve up or down but leave the 
shape of the curve unchanged. Therefore, we can test our 
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prediction that internal or external anchors shift estimations 
toward the reference level by examining the value of the 
intercept a across anchor conditions. 

Second, the power exponent b measures the elasticity of 
estimations and influences the shape of the power curve. If 
b = 1, estimations increase at the same rate as actual inven- 
tory. If b « 1, the power function is compressive, and esti- 
mations are inelastic (i.e., the percentage change in estima- 
tions is lower than the percentage change in actual 
inventory, or in other words, estimations increase at a 
slower rate than actual inventory). For example, if b — .5 
and actual inventory increases by 50% (i.e., by a factor of 
1.5), estimations increase by a factor of (1.5)5 = 1.225 (1.e., 
22.596). If b » 1, the power function is expansive, and esti- 
mations are elastic (they increase at a faster rate than actual 
inventory). Therefore, we can test our prediction that inven- 
tory estimations are inelastic by verifying that b « 1. 

Third, we can use the power model to compute the 
crossover inventory level at which estimations are accurate 
(ACT* = aV -b)). Inventory levels below ACT* tend to be 
overestimated, and inventory levels above ACT* tend to be 
underestimated. We can then compare ACT* with the 
hypothesized reference level (e.g., a person's average per- 
ceived inventory). 

Fourth, the power function treats positive and negative 
deviations from the crossover level similarly (i.e., the power 
exponent is the same, regardless of inventory level; there is 
no asymmetry between "gains" and “losses”). When the 
relevant logarithmic measures of accuracy are used (e.g., 
the log ratio of estimated divided by actual inventory), the 
magnitude of the bias is a linear function of actual inventory 
(measured in logs), and its slope is the same above and 
below the crossover level. Note, however, that the symmetry 
in the power model leads to asymmetry when more conven- 
tional measures of accuracy are used, such as absolute error 
(AE; |estimated — actual] or percentage absolute error 
(PAE; Jestimated — actual|/actual). A positive deviation from 
the crossover inventory level always leads to a smaller PAE 
(but to a larger AE) than a similar-size negative deviation 
from the crossover inventory level. For example, if EST — 
3 x ACTS, ACT* = 9 units. When inventory is above the 
crossover level by 6 units (ACT = 15 units), the AE and 
PAE are 3.38 units and 23%, respectively. When inventory 
is below the crossover level by 6 units (ACT = 3 units), the 
AE and PAE are 2.2 units and 73%, respectively. 

To test the hypothesized reference and salience effects, 
we estimate a different power model for each reference and 
salience condition. We illustrate how salience effects can be 
incorporated, but the approach is the same when we exam- 
ine the effects of different inventory reference levels: 


(2) EST = a x (ACT), when salience is low, and 
EST = a’ x (ACTY",, when salience is high. 
To facilitate the statistical estimation of these power 
models, we linearized them as follows:! 


lThe linearization prevents the model from being estimated 
when either estimated magnitude or actual magnitude is zero To 
overcome this limitation, 1t 1s possible to estimate the following 
nonlinear model with the Levenberg-Marquardt least square algo- 


(3) In(BST) = In(a) + b X In(ACT), when salience is low, and 
In(EST) = In(2^) + b’ x In(ACT), when salience is high. 


To test whether a is statistically different from a’ and 
whether b is statistically different from b’, we estimate the 


following moderated regression: 
(4) In(EST) =a+BxSAL+6xIn(ACT) , 
+ Y X In(ACT) x SAL + €, 


where SAL (salience) is a binary variable coded as —4 when 
salience is low and as 4 when salience is high; € is the error 
term; and a, B, 5, and y are estimated with ordinary least 
squares (OLS). These parameters are interpreted as follows: 
œ measures the average intercept across the two salience 
conditions (e* = [a x a/]4 is the geometric mean of the two 
intercepts); B measures the main effect of salience on the 
intercepts (eP = a'/a); 8 measures the average power expo- 
nent across the two salience conditions (5 = [b + b’]/2); and 
y measures the effects of salience on the power exponent, 
that is, the interaction between salience and actual inventory 
(y = b’ — b). If salience improves the accuracy of estimations 
by reducing the degree of compression, we would expect y 
to be positive and statistically different from 0. 


Simulating the Effects of Blased Inventory 
Estimations on Storage and Shopping Decisions 


Because no one expects that consumers have perfect inven- 
tory knowledge, the important issue is to determine the 
extent to which these biases distort consumers' storage and 
purchase decisions. To address this issue, we simulate the 
weekly category purchase decisions of consumers for a per- 
ishable product with stochastic household-level demand and 
examine the conditions under which biased inventory esti- 
mations lead to overstocking and, thus, to spoilage or stock- 
outs and, ultimately, to unmet demand. 

Consider a shopper who must decide whether to buy a 
dozen eggs in each weekly shopping trip but does not know 
what the exact demand will be over the next days because 
the three household members have an independent 50% 
chance of wanting an egg for breakfast. On any given day, 
demand can be 0 (p = 1/8), 1 (p = 3/8), 2 (p = 3/8), or 3 (p = 
1/8) eggs. For the sake of simplicity, we consider two dif- 
ferent types of households. The first segment consists of 
overstocking-averse households that discard their eggs 
quickly (e.g., after 9 days because they consume tbeir eggs 
soft boiled and are afraid of bacteria). The optimal purchas- 
ing rule for overstocking-averse households is to buy a 
dozen eggs if their inventory on the day of their weekly 
shopping trip falls below 6 eggs. The second segment con- 
sists of stockout-averse households that keep their eggs for 
a relatively long time (e.g., 14 days, because they are con- 
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rithm: EST = a x (BSAL) x (ACT)G x (84D, where a = (a x a, 
B — a'/a, 6 = (b x b^, and y = b'/b. The interpretations of the œ, B, 
6, and 'y parameters is the same as in the linearized model. In all 
analyses, both methods provided simular estimates, and therefore 
we use the simpler linearized model, except when incorporating 
zeroes is desirable because of a low number of observations. 


suming them hard-boiled). The optimal purchasing rule for 
stockout-averse households is to buy a dozen eggs if their 
inventory is below 12 eggs. 

Our model of inventory estimation allows us to simulate 
how biased inventory estimates influence the magnitude of 
spoilage and unmet demand for households that are averse 
either to stockouts (with a discard threshold of 14 days and 
a purchasing threshold of 12 eggs) or to overstocking (dis- 
card threshold = 9 days, purchasing threshold = 6 eggs). 
Building on the field study results for eggs (see Field Study 
2), we model inventory estimations for eggs as a compres- 
sive power function with parameters a= b = 1 (i.e. EST = 
ACT) when estimations are unbiased or with parameters a = 
3 and b = .5 (i.e., EST = 3 x [ACT] 5) when estimations are 
biased. 

For stockout-averse consumers, because of the under- 
estimation of high inventory levels, when their estimate is 
12 eggs, the actual inventory is 16 eggs (because 3 Xx 
[16]5 = 12). As a result, stockout-averse consumers con- 
tinue to repurchase a dozen egg even if they have as many 
as 16 eggs in inventory, whereas the optimal purchase 
threshold for them is 12 eggs. Figure 1 illustrates the effects 
of estimation biases for these households ın terms of 
spoilage and unmet demand over the first eight weeks of the 
year. Over the course of one year, Table 1 shows that esti- 
mation biases cause stockout-averse consumers to purchase 
more (+7%, or 36 eggs), to increase the amount of spoilage 
strongly (+225%, or 27 eggs), and to reduce unmet demand 
only slightly (-33%, or 2 eggs). 

For overstocking-averse consumers, because of the 
overestimation of low inventory levels, when their estimate 
is 6 eggs, the actual inventory is 4 eggs (because 3 x [4]5 = 
6). As a result, overstocking-averse consumers forgo repur- 
chasing a dozen eggs even when they have as few as 4 eggs 
in inventory, whereas the optimal purchase threshold for 
them is 6 eggs. Figure 2 illustrates the effects of estimation 
biases for these households in terms of spoilage and unmet 
demand over the first eight weeks of the year. Over the 
course of one year, Table 1 shows that estimation biases 
cause overstocking-averse consumers to’ buy less (-9%, or 
48 eggs), to increase unmet demand strongly (+118%, or 52 
eggs), and to reduce spoilage slightly (—2396, or 12 eggs). 

Overall, the simulation shows that biased inventory esti- 
mations can significantly increase both spoilage and unmet 
demand, depending on whether consumers are averse to 
stockouts or to overstocking. The underestimation of high 
inventory levels causes stockout-averse households to 
repurchase when their inventory is too high, thus leading to 
a significant increase in spoilage. Conversely, the overesti- 
mation of low inventory levels prevents overstocking-averse 
consumers from repurchasing when their inventory is too 
low, thus causing a significant increase in unmet demand. In 
addition, Table 1 shows that biased inventory estimates 
increase the variance of inventory and increase shopping 
and storage inefficiencies (the sum of units spoiled because 
of overstocking and those not consumed because of stock- 
outs) for all households. In the “General Discussion” sec- 
tion, we show how these effects provide marketers with new 
opportunities for increasing consumption and for improving 
the targeting of sales promotions. 
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FIGURE 1 
How Biased Inventory Estimates Influence Daily Inventory, Spoilage, and Unmet Demand for 
Stockout-Averse Households (Purchase Threshold = 12 Units) 


A: With Unbiased Inventory Estimates 
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B: With Blased Inventory Estimates 
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TABLE 1 
Simulation Results: How Blased Inventory Estimates Influence Purchase Quantity, Spollage, Unmet 
Demand, and Dally Inventory 





Stockout-Averse Households 
(Purchase Threshold = 12 Eggs) 


Unblased Blased 

Inventory (Actual PT = (Actual PT = 
Estimations 12 Eggs) 16 Eggs) 
Demand 539 539 
Purchase quantity 552 588 
Spoilage 12 39 
Unmet demand 6 usd 4 
Spoilage plus unmet 

demand 18 43 
Daily inventory (M) 11.1 14.0 
Dally inventory (SD) 4.8 5.4 


Overstocking-Averse Households 
(Purchase Threshold = 6 Eggs) 


Unblased Blased 
(Actual PT = (Actual PT z 
Difference 6 Eggs) 4 Eggs) Difference 
0 539 539 0 

+7% 552 504 —9% 
+225% 53 41 —2396 
—33% 44 96 +118% 
+139% 97 137 +41% 
+27% 7.9 6.6 —1696 
+13% 4.1 43 +4% 


a 

Notes: Spoilage is the annual number of eggs discarded. The discard threshold Is 14 days for stockout-averse consumers and 9 days for over- 
stocking-averse consumers. Unmet demand ıs the number of eggs that were not consumed because of stockouts. The simulation 
assumes that consumers purchase a dozen egg if thelr Inventory estimate on the day of thelr weekly shopping trip is below the pur- 
chase threshold (PT). Because of the underestimation of high inventory levels, the actual purchase threshold for biased stockout-averse 
households is too high (16 versus 12 eggs). Because of the overestimation of low Inventory levels, the actual purchase threshold for 
blased overstocking-averse households Is too low (4 versus 6 eggs). 


Experiment 1: How Inventory Size 
and External Anchors Influence 
Inventory Estimations: 


Procedure 


The objective of Experiment 1 is to test how external 
anchors and inventory size influence inventory estimations 
(we test the effects of internal anchors and of inventory 
salience in Experiment 2). To achieve this objective, Exper- 
iment 1 used a mixed design with one within-subject factor 
with four levels (one, three, seven, or nine units in inven- 
tory), one between-subject factor with three levels (no 
external anchor, low external anchor, or high external 
" anchor), and two replications (two different products) for 
each level of inventory. 

Participants first examined a color picture of a pantry 
containing 8 target products and 13 other products in differ- 
ent quantities. The pantry contained one unit of 2 target 
products, three units of 2 other target products, seven units 
of 2 other target products, and nine units of 2 other target 
products. Participants were later asked to recall the inven- 
tory level of the 8 target products (Coca-Cola cans, Life- 
savers candy, Smucker's jam, Campbell's soup, Charmin 
toilet tissue, Crest toothpaste, Carr's crackers, and Heinz 
tomato sauce). We rotated the products across the four 
inventory-level conditions following a Latin-square design. 
We selected the inventory levels of the target products on 
the basis of a prestudy of 21 consumers, who indicated that 
this range (one to nine) would straddle the average inven- 
tory of these products for almost al] respondents'(the typical 
average inventory for these products is three or four units). 

The participants were 216 undergraduate students who 
were awarded extra credit participation points for a course 
they were attending. We told them that the aim of the study 
was to measure their liking of different types of teas. Con- 
sistent with this, we first asked them to evaluate the three 


brands of tea that were present in the pantry. To direct their 
attention toward the other products in the pantry (including 
the eight target products), we asked the participants to esti- 
mate the overall quality of the brands displayed and to indi- 
cate whether some of these products should be stored in a 
refrigerator rather than a pantry. We then asked each partici- 
pant to return the first booklet containing the pantry picture. 
After a brief distracter task, we gave participants a second 
booklet containing a typical anchoring manipulation. We 
asked participants in the high-external-anchor condition 
(low-external-anchor condition) to indicate whether the 
number of units of each of the eight target products was 
above or below nine (one). We then asked them to estimate 
the number of units of the target products. Participants in 
the control condition estimated the number of units of the 
target products. Finally, we asked all the participants to 
write down how they had estimated the inventory levels for 
these products. 


Hesults 


We consider first the estimations made in the control (no 
external anchor) condition. Figure 3 shows that the mean 
estimated inventory was well below reality when there were 
seven or nine units in inventory, close to reality when there 
were three products in inventory, and slightly above reality 
when there was only one product in inventory. To test size 
effects in the control condition formally, we estimated the 
following linear regression with OLS: 


(5) In(EST,,) = & + B x In(ACT,,) + XG, X CAT, +e, 


where EST, is the estimated inventory of product j by par- 
ticipant i, ACT, is the actual inventory, CAT, are seven 
binary variables accounting for product differences, and € is 
the error term. 

As we expected, the power exponent was statistically 
below 1 (a = b = .43, t-test of difference from 1 = 9.1, p < 
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FIGURE 2 
How Biased Inventory Estimates Influence Daily Inventory, Spollage, and Unmet Demand for 
Overstocking-Averse Households (Purchase Threshold = 6 Units) 


A: With Unbiased Inventory Estimates 
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B: With Biased Inventory Estimates 
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FIGURE 3 
Experiment 1: Effects of External Anchors on 
Inventory Estimations (Geometric Means, 
Confidence Intervals, and Model Predictions) 
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.001), and the intercept was statistically above 0 (a = 
In(a) 2 .58, t 2 11.8, p « .001), which indicates that the 
intercept of the power model (a — 1.79) is statistically larger 
than 1. Five of the seven product-specific intercepts were 
statistically significant. In addition, we compared the fit of 
the power model with that of an alternative linear model 
(EST, = & + B x ACT; + 20, X CAT, + €). As we expected, 
the R-square was stronger for the power model (R2 = .36) 
than for the linear model (R? = .33), and the mean average 
percentage error (MAPE) was lower for the power model 
(MAPE - .58) tban for the linear model (MAPE - .74; 
paired t-test — 21.3, p « .001). In summary, as we predicted, 
consumer inventory estimations in the control condition fol- 
low a compressive power function of actual inventory. 

We now turn to the analysis of the anchoring manipula- 
tion. As a manipulation check, we asked two coders, both of 
whom were unaware of the objective of the experiment, to 
classify consumers’ retrospective protocols into three cate- 
gories. The “internal anchor” category included protocols 
that referred to how much of the product a person usually 
has in inventory. The “visual memory” category included 
protocols that recalled the picture of the pantry. The “other” 
category included the remaining protocols, such as “I just 


guessed.” Consistent with the anchoring literature, which 
shows that people are unaware of the effects of external 
anchors (Mussweiler, Strack, and Pfeiffer 2000), no proto- 
col mentioned the anchoring manipulation. An analysis of 
the 166 usable protocols shows that the frequency of proto- 
cols mentioning internal anchors was higher in the control 
condition (M = 22.1%) than in the two external anchor con- 
ditions (M = 7.1%; X? = 7.8, p < .01). This suggests that, as 
we predicted, consumers were more likely to use internal 
anchors when external anchors were not contextually 
salient. 

Figure 3 shows that the anchoring manipulation shifted 
inventory estimations toward the anchor but did not influ- 
ence the relationship between estimated and actual inven- 
tory (the power curves are parallel). To test anchoring 
effects formally, we estimated the following regression: 


(6) In(EST,,) = a + B X In(ACT;,) 
+ 6 x EXTANCHI, + y x EXTANCHS, 
+ À x EXTANCHI, x In(ACT;,) 
+ 8 x EXTANCHO, x In(ACT;) + 2.0, x CAT, + €,, 


where EST, is the estimated inventory for product j by par- 
ticipant i, ACT, is the (geometric) mean-centered actual 
inventory for product j and participant i? EXTANCHI, is a 
binary variable taking the value of % if participant i was in 
the low-external-anchor condition (anchor = 1) and —4 oth- 
erwise, EXTANCHS, is a binary variable taking the value of 
^ if participant 1 was in the high-external-anchor condition 
(anchor = 9) and —4 otherwise, and CAT, are seven binary 
variables accounting for product differences (j = [1, ..., 7]). 
The simple effects of both external anchors were statisti- 
cally significant and in the expected direction (for the 
anchor on one unit, 6 = —07, t = —2.0, p < .05; for the 
anchor on nine units, y = .27, t = 7.8, p < .01), indicating 
that inventory estimations were assimilated toward the 
anchors. Consistent with the model, the interaction parame- 
ters were not statistically significant (A = .03, t= .6, p = .52, 
and 0 = .2, t = .4, p = .68), indicating that the degree of 
compression of inventory estimations was the same across 
the three conditions. Finally, the intercept was statistically 
different from 0 (a = 1.21, t = 78, p < .01). This indicates 
that participants significantly overestimated their inventory 
when it was only one unit (in the control condition). Four of 
these product-specific intercepts were statistically 


significant. 


2In practice, we divide actual inventory by its mean across 
inventory conditions (five units). In a moderated regression, mean 
centering the variables involved in an interaction allows us to esti- 
mate the main effects of the variables involved ın an interaction 
when the other variables involved in the interaction are at their 
mean. For example, it allows us to estimate the effects of anchor- 
ing on inventory estimations when inventory level is at its average 
level (five units). We use geometric mean centering rather than 
arithmetic mean centering to be consistent with the power func- 
tional form and to prevent negative values whose logs cannot be 
computed. 
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Discussion 


Experiment 1 shows that inventory estimations are assimi- 
lated toward external anchors and are adjusted for the actual 
inventory level through a nonlinear compressive power 
function. Experiment 1 also shows that the rate of adjust- 
ment for the actual size of the inventory remains constant, 
regardless of which reference levels serve as anchors. In 
other words, external anchors do not influence the exponent 
of the power function and thus do not influence the quality 
of adjustments from the reference level. Finally, the proto- 
cols provide indirect evidence that consumers rely on inter- 
nal anchors when external anchors are not salient. However, 
because we did not measure the value of these internal 
anchors, we cannot test the effects of internal anchors or 
determine whether an average home inventory can serve as 
one. Experiment 2 further tests the model by examining the 
effects of internal anchors and by directly manipulating 
product salience. 


Experiment 2: How Internal 
Anchors, Inventory Salience, and 
Inventory Size Influence Inventory 

Estimations 


Procedure 


Experiment 2 used the same procedure and stimuli as 
Experiment 1 but with three important differences. First, we 
did not manipulate external anchors but asked each partici- 
pant to indicate the average inventory of the eight target 
products in their own house (the hypothesized internal 
anchor). Second, we manipulated the perceptual salience of 
the target products in three combined ways. Salient prod- 
ucts were located on the top or middle shelf of the pantry 
(as opposed to the bottom shelf), were separate from other 
products (rather than being crowded together with them), 
and were given multiple facings when available in more 
than one unit (rather than being stacked together in an over- 
Japping fashion). We assigned each of the eight target prod- 
ucts to one of the eight conditions created by the two (high 
or low salience) x four (one, three, seven, or seven units) 
design. As in Experiment 1, we rotated the products across 
the eight inventory size and salience conditions following a 
Latin-square design. Third, as a means to check the effec- 
tiveness of the salience manipulation, we asked participants 
to rate how visible each product was in the picture. 

The participants were 150 undergraduate students who 
were awarded extra credit participation points for a course 
they were attending. Manipulation checks show that the 
salience manipulation was successful. An analysis of vari- 
ance indicated that products in the high-salience condition 
were rated as “more visible" (M = 6.75 on a nine-point 
scale, anchored by 1 = “completely disagree" and 9 = 
"completely agree") than those in the low-salience condi- 
tion (M = 5.89; F(1, 1090) = 24.6, p « .001). However, nei- 
ther the inventory size nor the salience manipulation influ- 
enced the home inventory reported for each product 
(F(3, 1090) = .58, p = .63, and F(1, 1090) = .59, p = 44, 
respectively). This shows that the average home inventory 
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level was not estimated from the inventory level or the 
salience of the product in the study. 


Results 


Figure 4, Panel A, shows the mean estimated inventory as a 
function of the inventory and salience manipulations. As we 
expected, increasing salience made estimations less com- 
pressive and more accurate. To examine the effects of inter- 
nal anchors, we assigned participants to a high- or low- 
internal-anchor group on the basis of their self-reported 
average home inventory level for each product. Across the 
eight products, the average home inventory in the high- 
internal-anchor group was M = 8.5 units versus M = .9 units 
in the low-internal-anchor group. Figure 4, Panel B, shows 
mean estimated inventory as a function of actual inventory 
for both internal anchor groups. As we expected, estima- 
tions in the high-internal-anchor condition were higher than 
those in the low-internal-anchor condition, regardless of 
inventory level, but the two estimation curves remained par- 
allel, suggesting that internal anchors (unlike salience) did 
not influence the elasticity of adjustments. 

To test our predictions that salience reduces the degree 
of compression of estimations and that estimations shift 
toward internal reference levels, we estimated the following 
regression: 


T)  In(EST,) =a +B x In(ACT,) + 8 x INTANCH, 
+ YX SAL, + X x INTANCH; x In(ACT,) 
+ 0 x SAL, x In(ACT,,) + Xo; x CAT; + &,, 


where EST; is the estimated inventory for product j by par- 
ticipant i, ACT, is the geometric mean-centered actual 
inventory for product j and participant i, INTANCH, is a 
binary variable taking the value of % if the home inventory 
of participant i for product j is in the top 5096 of the distrib- 
ution for this product and —4 otherwise, SAL, is a binary 
variable taking the value of A if product j is in the high- 
salience condition for participant 1 and —4 otherwise, and 
CAT, are seven binary variables accounting for product dif- 
ferences (j = [1, ..., 7]). 

As in Experiment 1, the power exponent was statisti- 
cally below 1 (B = .41, t-test of difference from 1 = —28.9, 
p < .001), indicating that the rate of adjustment increases 
more slowly than the actual inventory size. The coefficient 
capturing the simple effect of the internal anchor was posi- 
tive and statistically significant (6 = .09, t = 2.3, p « .05), 
but its interaction with the actual inventory level was not 
(A = .02, t= .5, p = .61). This shows that inventory estima- 
tions were assimilated toward the average home inventory 
for that product but that this internal anchor did not change 
the rate at which estimations were adjusted for the actual 
inventory level. The main effect of the salience manipula- 
tion was positive and statistically significant (y = .22, t = 
6.5, p < .01), and its interaction with the actual inventory 
level was also positive and statistically significant (0 — .15, 
t = 3.7, p < .01). Because of the significant interaction 
between the actual inventory level and salience, the effects 
of salience are not statistically significant when the inven- 
tory level is low (one or three units). This shows that adjust- 
ments are more sensitive when inventory is salient than 
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when it is not. Finally, the intercept was statistically differ- 
ent from 0 (a = 1.23, t = 71.3, p < .01), and four of the 
seven product-specific intercepts were statistically signifi- 
cant, replicating the results of Experiment 1. 


Discussion 


Experiment 2 shows that internal anchors, like the external 
ones, shift estimations toward the reference level but do not 
change the rate at which estimations are adjusted for the 
actual inventory level. Experiment 2 also shows that the 
salience of the product in the pantry increases the rate of 
adjustment. Estimations of less salient products are almost 
entirely driven by the reference inventory (the power curve 
is almost flat). In contrast, estimations of salient products 
are significantly influenced by the actual inventory level 
(the slope of the curve is close to one). 

Taken together, Experiments 1 and 2 provide strong evi- 
dence in support of the model of inventory estimations. Yet 
although judgment and estimation biases are often found in 
a laboratory setting, they can be less apparent, or even 
negated, in the field, in which there is less variation in 
inventory size, reference levels, and salience and consumers 
may bave greater experience with the estimation task. In 
Field Study 1, we investigate the robustness of these effects 
by measuring actual and estimated inventory and product 
salience in six categories. We also study whether estimated 
inventory is a better predictor of category purchase inci- 
dence than actual inventory. In Field Study 2, we test the 
robustness of the model in 23 new categories, and we exam- 
ine whether the degree of compression of a given category 
is associated with the degree of impulse buying, the ease of 
stockpiling, and the promotional elasticity of that category. 


Field Study 1: Inventory Salience 
and Estimation Accuracy 


Procedure 


Over two periods of five days each, we intercepted 121 
adult consumers in four different central Illinois supermar- 
ket parking lots as they were exiting the supermarket, and 
we asked them to estimate their current inventory of six 
product categories in exchange for $9. After completing 
their estimates, participants were given a preaddressed, 
stamped envelope and a brief questionnaire that asked them 
to check their actual inventory levels for these categories 
when they returned home the same day. The same question- 
naire also asked them to rate the visibility of each category 
in their homes by indicating their agreement with the fol- 
lowing sentence: "These [category name] are stored in a 
very visible place." They rated their agreement on a nine- 
point scale, anchored by 1 = “strongly disagree" and 9 = 
"strongly agree." Finally, participants indicated whether 
they had purchased any products from these six categories 
during their latest shopping trip. 

We chose the six categories (apples, canned tuna, hot 
dogs, potatoes, tea bags, and tomatoes) on the basis of a 
pretest involving 16 consumers, which showed that they 
estimated their inventory using discrete package units (as 
opposed to continuous measures, such as ounces) and that 
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there was enough variance in the way different households 
stored them to expect salience effects (e.g., some consumers 
keep tomatoes in a salient container on the kitchen counter, 
whereas others store them in the refrigerator where they are 
less visible). To verify the quality of the measures of actual 
inventory levels, we visited 16 households the day after they 
sent us their pantry checks and measured the number of 
units for these six categories ourselves. When we excluded 
partial units and subsequently purchased units, pantry accu- 
Tacy was well over 90% for this subsample of households. 
Of the 121 consumers intercepted, 90 (74.4%) returned 
their questionnaire within one week, and we included them 
in the analysis. 


Results 


Purchase incidence. Part of our motivation for studying 
biases in inventory estimations is the assumption that esti- 
mations, not actual inventory, drive important decisions, 
such as whether to repurchase from a given category during 
a supermarket shopping trip. Field Study 1 enables us to test 
this assumption by comparing the association between cate- 
gory purchase incidence on the one hand and estimated or 
actual inventory on the other hand. 

We measured purchase incidence using a variable 
(REFILL) that took the value of 1 if at least one purchase 
was made from category j by participant i during the super- 
market shopping trip that occurred just before the estima- 
tion and 0 if otherwise. As we expected, the correlation 
between purchase incidence and estimated inventory was 
negative and statistically significant (r[REFILL;;, EST,] = 
—10, p < .05), whereas the correlation between purchase 
incidence and actual inventory was not statistically different 
from zero (r[REFILL,,, ACT,] = —03, p = .52). To test 
whether the predictive power of estimated and actual inven- 
tory is statistically different, we used a repeated measures 
analysis of variance with EST, and ACT, as the within- 
subjects measures and REFILL, as the between-subjects 
factors. As we expected, the interaction between REFILL, 
and the within-subjects factor (estimated versus actual 
inventory) was statistically significant (F(1, 441) = 3.8, p< 
.05).3 This result implies that purchase incidence is more 
strongly associated with estunated inventory than with 
actual inventory. 


Inventory estimation biases. We found that the mean 
estimated inventory was within 1096 of the mean actual 
inventory level for the six categories. However, this aggre- 
gate accuracy is not a result of consumers being accurate. 
Rather, it is a result of underestimations compensating for 
overestimations. Across the six categories, only 49% of 
inventory estimations were accurate, whereas 28% were 





3The main effect of the within-subjects factor was not statisti- 
cally significant (F(1, 441) = 1.6, p = .20), indicating that mean 
estimated inventory is similar to mean actual inventory. The main 
effect of REFILL, was also not statistically significant 
(F(1, 441) = 2.0, p = .16), which is not surprising, given that it 
measures the average effect of a nonsignificant association 


(between o M ACT,,) and of a significant but weak asso- 
ciation (between FILL, and EST,). 
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underestimations and 2396 were overestimations. For exam- 
ple, Figure 5 shows that the mean inventory estimation of 
tea bags and tomatoes is slightly overestimated at low levels 
(the two lowest quartiles) but is strongly underestimated at 
high levels (the two highest quartiles). Figure 5 also shows 
that these estimates were less compressive and, thus, more 
accurate when tomatoes and tea bags were highly salient 
than when they were not. 

lo test the statistical significance of size and salience 
directly, we estimated the following regression: 


(8) I(EST,) = a+ Bx In(ACT,) + 8 x SAL, 
+ YX SAL, x In(ACT,,) + Zo; x CAT, + &, 


where EST, is the estimated inventory for category j by par- 
ticipant i, ACT, is the actual inventory for category j and 
participant i (it is not mean centered so as to be able to test 
the intercept when the actual inventory is equal to 1), SAL,, 
is a mean-centered binary variable measuring the visibility 
of category j in the pantry of participant i (it is categorized 
using a median split, but the results are unchanged if we use 
the original continuous measure), and CAT, are five binary 
variables accounting for product differences (j = [1, ... , 5p. 
As we expected, the intercept was statistically larger than 
zero (Q = .39, t= 6.2, p < .01), indicating that the intercept 
of the power model is larger than 1 (a = e = 1.48).-Consis- 
tent with the hypothesized size effects, the exponent was 
statistically lower than 1 (b = B = .77, t-test of difference 
from 1 = —7.5, p < .01), indicating that inventory estima- 
tions are also compressive in the field. The main effect of 
salience was not statistically significant (5 = —.09, t = —.9, 
p = .36), which means that as in Experiment 2, salience did 
not influence estimations when there was only one unit in 
inventory. Consistent with the hypothesized salience effects, 
the interaction between salience and actual inventory was 
positive and statistically significant (y = .08, t = 2.0, p< 
.05). As in Experiment 2, inventory estimations were more 
compressive when the category was not particularly visible 
in the pantry, and they were more accurate when the cate- 
gory was visible in the pantry. 


Discussion 


Field Study 1 makes two contributions. First, it shows that 
estimated inventory is more strongly related to category 
purchase incidence than is actual inventory. Importantly, 
this result cannot be explained by mere-measurement 
effects, because inventory estimations were measured as 
consumers were exiting the store, after their purchases had 
been made. In addition, our decision to measure purchase 
incidence with the second questionnaire (completed at 
home) and not during the parking lot interview (when 
inventory estimations were collected) reduces the likelihood 
that these results are driven by self-presentation biases (i.e., 
that consumers reported inventory estimations consistent 
with their purchase decisions). 

Second, Field Study 1 supports the experimental results 
by showing that inventory estimations follow a compressive 
function of actual inventory levels, even when they are 
made by adult consumers for products that they have just 
bought. In addition, it shows that consumers adjust for 
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actual inventory levels more when the products are salient 
(stored in a visible place) than when they are less salient. 
A limitation of Field Study 1 is that it did not measure 
the average product inventory for each household. There- 
fore, its findings should be interpreted as showing that 
respondents with lower actual inventory tend to overesti- 


mate their inventory, whereas respondents with higher 
inventory levels tend to underestimate them. Naturally, the 
average inventory across respondents may not be represen- 
tative of the average inventory of either very light or very 
heavy buyers. However, note that this heterogeneity in aver- 
age inventory level reduces the likelihood of detecting 
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inventory biases in the population (because extreme inven- 
tory levels are actually average levels for these extreme 
consumers). 

Field Study 1 raises three further questions: (1) Are 
these results generalizable to nonfood products? (2) Do 
average inventory levels serve as estimation anchors in the 
field? and (3) Is the degree of compression of inventory 
estimations related to managerially important category 
characteristics? We address the first question by conducting 
a large-scale field study in which we measure estimated and 
actual inventory for 23 new food and nonfood categories. 
We address the second question by incorporating data on 
average inventory level (rather than on current actual inven- 
tory level), which we collected in an additional survey. 
Finally, we address the third question by studying the asso- 
ciation between the power exponent, which measures the 
degree of compression in each category, and three category 
characteristics, which we obtained from secondary data, 
namely, the degree of impulse buying in the product cate- 
gory, the ease of stockpiling the product, and the average 
promotional brand elasticity of the brands in the product 
category. 


Field Study 2: How Estimation 
Biases Vary Across Products 


Procedure 

Field Study 2 used the same procedure as Field Study 1 to 
measure biases in inventory estimations. In Field Study 2, 
however, we measured inventory levels in ounces rather 
than in units for seven products (soft drinks, coffee, sham- 
poo, mayonnaise, laundry detergent, dishwashing detergent, 
and ketchup) for which inventory levels are typically mea- 
sured in ounces because of the large variations in package 
sizes. We measured inventory levels for the other 16 prod- 
ucts (soap, canned soup, spaghetti, vacuum cleaner bags, 
yogurt, toothpaste, frozen meat, eggs, frozen vegetables, 
butter sticks, canned fruit, pasta sauce, cookies, toilet tissue, 
salad dressing, and breakfast cereals) in units. Another dif- 
ference is that we did not measure inventory salience. 

To avoid respondent fatigue, we surveyed participants 
on five to eight products. To verify the accuracy of the 
actual inventory measures, we asked a subgroup of con- 
sumers to telephone one of the researchers immediately 
after they had checked their actual inventory levels. We 
called the consumers who had not telephoned by 7:30 that 
evening and reminded them to check their inventory. There 
were no systematic differences among the results of partici- 
pants who had telephoned, the results of participants who 
needed to be reminded, and the rest of the participants who 
were not contacted. To check accuracy further, we told an 
additional subgroup of consumers to keep their question- 
naire because we would pick them up the next day. During 
the pickup round, we requested permission to inspect their 
actual inventory. With the households who agreed to the 
inspection (approximately 75%), there were no full-unit 
discrepancies with the self-reported inventory. Of the 461 
consumers who participated in Field Study 2, 317 (68.7%) 
returned their questionnaire in a timely manner. Together 


130 / Journal of Marketing, October 2006 


with data we obtained in Field Study 1, we had a total of 
2185 estimations on 29 products (an average of 75 observa- 
tions per category). 

To avoid common method biases, we obtained data on 
impulse buying, ease of stockpiling, and promotional 
response from two independent sources. For impulse buy- 
ing and ease of stockpiling, we used the survey of 108 prod- 
uct categories that Narasimhan, Neslin, and Sen (1996) con- 
ducted and published in Journal of Marketing.^ This survey 
measured impulse buying by asking 100 consumers to rate 
their agreement with the following two statements: “I often 
buy the product on a whim when J pass by it in the store,” 
and "I typically like to buy this product when the urge 
strikes me." We measured ease of stockpiling by asking the 
same consumers to rate their agreement with the following 
two statements: "It is easy to store extra quantities of this 
product in my home,” and “I like to stock up on this product 
when I can." For promotional elasticity, we used data from 
the "Infoscan Topical Marketing Report," generated by 
Information Resources Inc. and published in the P-O-P 
Times (1991). This report provides an estimate of the aver- 
age percentage brand sales increase in response to a 1596 
price cut with an in-store display for 164 product categories 
based on the results of Information Resources Inc.’s Promo- 
tionScan model (Abraham and Lodish 1993) estimated on 
the checkout data of 2400 grocery stores. The category defi- 
nitions of Narasimhan, Neslin, and Sen's (1996) survey and 
of the Infoscan Report matched ours in 25 of the 29 product 
categories. We used data from categories closest to the 
remaining four (e.g., frozen side dishes for frozen 
vegetables). 


Results 


Are these results generalizable to nonfood products? As 
in Field Study 1, we found that the mean estimated inven- 
tory is similar to the mean actual inventory. With four 
exceptions (crackers, ketchup, tomatoes, and potatoes), the 
mean estimations are all within 20% of the mean actual 
inventory for the 29 products (the 6 products surveyed in 
Field Study 1 and the 23 surveyed in Field Study 2). As in 
Field Study 1, this aggregate accuracy does not indicate that 
consumers are accurate at the individual level; instead, it 
indicates that overestimations compensate for underestima- 
tions. Ácross the 29 products, only 4196 of inventory esti- 
mations are accurate (within one unit or one ounce of the 
actual inventory), whereas 3196 are underestimations, and 
25% are overestimations. 

To test the model predictions, we estimated the power 
model shown in Equation 1 for each of the 29 categories. 
Because of the low number of observations for some cate- 
gories, we used the nonlinear Levenberg—Marquardt least 
square algorithm, which allows us to incorporate observa- 


4Scott Neslin, Albert Wesley Frey Professor of Marketing in the 
Tuck School of Business at Dartmouth College, generously pro- 
vided these data. 


tions with a zero-estimated or actual inventory level (we 
obtained similar results when estimating a linearized model 
on nonzero observations with OLS). 

Table 2 shows that the power model fits the inventory 
estimation data well (the mean R? is 54%). All power expo- 
nents are below 1, and the variance from 1 is statistically 
significant at the 596 level for all 29 products. All power 
intercepts are above 1, and the difference is statistically 
significant at the 5% level for 22 of the 29 products and is 
statistically significant at the 10% level for 3 other products. 
In addition, the fit of the power model was superior to the 
fit of a linear model (EST = a + b x ACT) for almost all 
the products. Specifically, the R-square of the power model 
was higher than the R-square of the linear model for 23 of 
the 29 products, and the average fit improvement was 3% 
and was statistically significant. Similarly, the MAPE was 
lower for the power model than for the linear model for 28 
of the 29 products and was statistically Jower for the power 
model than for the linear model across all observations 


inventory estimations for these 29 products follow a com- 
pressive power function of actual inventory level. 

Using the estimated model parameters, we computed 
the crossover inventory level (e%/(! - 8)) for each product. As 
we expected, the crossover inventory level was within the 
range of observed actual inventory levels for all 29 prod- 
ucts. Excluding one outlier (tea bags, which has high inven- 
tory levels when measured in units), we found that low 
inventory levels tended to be overestimated but that as 
actual inventory reached four to six units or 24—42 ounces, 
estimations tended to become accurate. However, when 
actual inventory levels were above these average levels, 
they tended to be strongly underestimated. 


Do average inventory levels serve as anchors in field 
estimations? We asked 37 adult consumers, similar to those 
involved in Field Studies 1 and 2, to estimate their average 
inventory level (as opposed to their current inventory level) 
for these 29 products. We then estimated the following con- 
strained model: 


(MAPE ower = -44 versus MAPEiinear = .61; paired t = 10.5, 
p « .001). All these results support our prediction that (9) EST;/AVG, = a x (ACT,/AVG))P + €,,, 
| TABLE 2 


Fleld Studies 1 and 2: Category-Level Power Regression Results (Estimates, Standard Errors, Fit, and 
Predicted Crossover Inventory Level) 


Category Intercept 

Apples 1.23" (.13) 
Butter sticks 2.03* (.68) 
Canned fruit 1.52** (.25) 
Canned soup 1.81** (.22) 
Canned tuna 1.61** (.22) 
Cereals 1.35** (.16) 
Coffeea 2.61** (.92) 
Cookies 1.46** (.13) 
Dishwashing detergenta 3.40* (1.60) 
Eggs 3.08** (1.09) 
Frozen meat 2.34** (.71) 
Frozen vegetables 1.74** (.28) 
Hotdogs 2.45** (.72) 
Ketchupe 3.08** (.91) 
Laundry detergenta 3:51" (1.71) 
Mayonnalse® 9.75" (1.27) 
Pasta sauce 1.14 (.12) 
Potatoes 1.52 (.47) 
Salad dressing 1.36 (.23) 
Shampoo8 4.87** (2.13) 
Soap 1.79** .21) 
Soft drinks@ 14.36 (11.64) 
Spaghetti 1.43** (11) 
Tea bags 2.07** (.61) 
Toilet paper 1:57” (.24) 
Tomatoes 1.35 (.29) 
Toothpaste 1.36** (.12) 
Vacuum cleaning bags 1.78™* (:26) 
Yogurt 3.29** (.60) 


*Statistically different from 1 at the 1096 level (one-tailed). 
**Statistically different from 1 at the 5% level (one-talled). 
alnventory measured in ounces. 


Exponent r(EST, ACT) N ACT* 
83**  (.04) .93 89 3.4 
63**  (.14) .63 51 6.8 
Ti"  (.08) .86 52 4.2 
69**  (.06) .78 155 6.8 
66*  (.07) .78 98 4.1 
77™  (.06) .88 53 3.7 
73"  (.09) .71 56 34.9 
42"  (.06) .75 53 2.0 
61*  (.11) .60 55 23.1 
54**  (.14) .61 33 11.5 
53"  (.13) .61 39 6.1 
61"  (.08) 80 52 4.1 
57**  (.09) .68 87 8.0 
5/* 3 (.08) .69 55 13.7 
71**  (.09) .67 54 75.9 
60**  (.09) 73 56 27.2 
84**  (.08) .85 53 2.3 
78**  (.10) 74 88 6.7 
66* (11) .70 53 2.5 
55**  (.11) 55 57 33.7 
68**  (.06) .74 147 6.2 
41™* (.16) .50 55 91.5 
53**  (.05) .69 159 2.1 
82™*  (.05) .83 85 56.9 
81**  (.05) .91 52 10.7 
69**  (.10) .71 85 2.6 
75™  (.06) .89 42 3.4 
59**  (.06) .66 140 4.1 
42*  (.07) .66 131 7.8 


Notes: EST = estimated Inventory, ACT = actual inventory, N = number of estimations, and ACT* = crossover inventory (mean estimated inven- 


tory i8 equal to mean actual inventory). 
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where EST,, is the estimated inventory for product j by par- 
ticipant i at the time of the study, ACT; is the actual inven- 
tory, and AVG; is the average inventory for product j mea- 
sured in the additional survey. This model is a simple 
rewriting of the basic power model in Equation 1, in which 
estimated and actual inventory are expressed as a proportion 
of the average inventory of the product rather than as the 
original units. This transformation influences the intercept 
but leaves the power exponent and the model fit unchanged 
from the values shown in Table 2. 

If people use the average inventory level as an anchor, 
inventory estimations are unbiased when actual inventory is 
at its average level, and therefore the mean estimated inven- 
tory is equal to the mean actual inventory level when actual 
inventory is at its average level. In Equation 9, this occurs 
only if the intercept (œ) is equal to 1 (because if ACT, = 
AVG;, EST, = AVG, = ACT). As expected, we found that 
the intercept of this new model was not statistically differ- 
ent from 1 in 25 of the 29 categories. We obtained similar 
results when using the mean actual inventory level, which 
we computed across participants in Field Studies 1 and 2 as 
the category average inventory rather than the mean 
obtained from the new survey. Overall, these results are 
consistent with the hypothesis that average inventory levels 
serve as internal anchors in the absence of salient external 
anchors. 


Is the degree of compression of inventory estimations 
related to managerially important category characteristics? 
As we show in Table 2 and Figure 5, some products, such as 
yogurt, exhibit strong compression and therefore are some- 
what inelastic to actual changes in inventory. For example, 
given that the power exponent of yogurt is .42, if its inven- 
tory increases by 50%, estimations increase by only 19% 
(see Figure 5). Figure 5 also shows that other products, such 
as toilet paper, exhibit little compression and therefore are 
relatively accurate at all levels of inventory. Because the 
power exponent of toilet paper is .81, if its inventory 
increases by 5096, estimations increase by almost the same 
percentage (39%). We expect that these category differ- 
ences in the rate of adjustment to actual inventory are linked 
to three key category characteristics: (1) the likelihood of 
impulse buying, (2) the ability to stockpile, and (3) the aver- 
age brand promotional elasticity in that category. 

The actual inventory of products that are likely to be 
bought on impulse is apt to fluctuate more and in a less pre- 
dictable way than the actual inventory of products whose 
purchases are planned. As a result, we expect that estima- 
tions will be more compressive and, therefore, less accurate 
for products with a high degree of impulse purchasing. In 
contrast, the actual inventory of products that are easy to 
stockpile should be easier to monitor than the inventory of 
products that are difficult to stockpile. We also expect that 
estimations will be less compressive for products that are 
easy to stockpile. Finally, consumers are more likely to 
switch to another brand or to stockpile in response to a pro- 
motional display when they have an accurate understanding 
of their inventory than when they do not know how much of 
the product is left in their inventory and therefore hesitate to 
buy the promoted brand (Wansink, Kent, and Hoch 1998). 
Consumers who have no idea about how much of the prod- 
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uct they have in inventory are more likely to pass a promo- 
tion and follow their habitual purchasing pattern for fear of 
overstocking. Consequently, we expect estimations to be 
more compressive for categories with a high average pro- 
motional elasticity. 

We tested these hypotheses by estimating the following 
model with the nonlinear Levenberg-Marquardt least 
square algorithm: 


(10) | EST, =a x BMP; x B55TO, x B5PROM, x (ACT, )5 
xq oco oe ROMS, 


where EST; is the estimated inventory for category j by par- 
ticipant i, ACT, is the actual inventory for category j and 
participant i, IMP, is a mean-centered binary variable mea- 
suring the degree of impulse buying in the category, STO, is 
a mean-centered binary variable measuring the ease of 
stockpiling the product category, and PROM; is a mean- 
centered binary variable measuring the average promotional 
elasticity in the category (we dichotomized the three binary 
variables with a median split). 

As we expected, the parameter capturing the interaction 
between ACT and IMP was statistically below 1 (y, = .93, 
asymptotic 95% confidence interval [CI] = [.85, .99]), indi- 
cating a lower power exponent (i.e., less accuracy) for cate- 
gories bought on impulse. Conversely, the parameter cap- 
turing the interaction. between ACT and STO was 
statistically above 1 (y, = 1.15, 95% CI = [1.04, 1.26]), indi- 
cating a higher power exponent (i.e., more accuracy) for 
categories that are easy to stockpile. Finally, the parameter 
capturing the interaction between ACT and PROM was 
above 1 but was statistically significant only at the 10% 
level (y3 = 1.07, 90% CI = [1.00, 1.15]), indicating a 
slightly higher power exponent (i.e., more accuracy) for 
categories with high average promotional elasticity. In sum- 
mary, these analyses supported our hypotheses that inven- 
tory estimates are particularly biased for product categories 
that are bought on impulse, are difficult to stockpile, and 
have a low average promotional elasticity. 


General Discussion 


Consumers' biases in inventory estimations can distort stor- 
age and shopping decisions and can lead to overstocking 
and spoilage or to stockouts and unment demand. To under- 
stand the origin of these biases, we developed a model of 
how consumers estimate the amount of product they have in 
inventory. This model predicts that (1) unless an external 
reference level is salient, consumers anchor their estima- 
tions on their average inventory level and insufficiently 
adjust for the actual inventory level; (2) the adjustment from 
the average inventory follows an inelastic power function; 
and (3) the adjustment is more elastic and, thus, more accu- 
rate when inventory is perceptually salient. Drawing on this 
model, we simulated the effects of biased inventory esti- 
mates on the amount of spoilage and unmet demand for 
consumers who are averse to stockouts or to overstocking. 
We then tested the predictions of the model for 8 products 
in two laboratory experiments and for 29 products in two 
field studies. 


The key results of these studies are as follows: First, 
contrary to the assumptions of practitioners and modelers, 
consumers’ estimates of their inventory are better predictors 
of category purchase than their actual inventory. Second, 
although individual inventory estimations are seldom accu- 
rate, the mean estimated inventory of a product category is a 
valid estimation of its mean actual inventory because under- 
estimations tend to compensate overestimations. Third, in 
the absence of salient external reference levels, consumers 
anchor their inventory estimations on their average inven- 
tory level and adjust insufficiently for the actual size of the 
inventory. Fourth, adjustments from the reference level fol- 
low an inelastic psychophysical function (a power function 
with an exponent below 1), and as a result, the quality of the 
adjustments deteriorates as the inventory deviates from the 
reference level. In short, below-average inventory levels are 
slightly underestimated, average inventory levels are accu- 
rately estimated, and above-average inventory levels are 
strongly underestimated. Fifth, inventory estimations are 
more elastic (more sensitive to actual changes in inventory) 
and, thus, more accurate when inventory is salient than 
when it is not. Sixth, the underestimation of high inventory 
levels causes stockout-averse households to repurchase 
when their inventory is too high, thus leading to a signifi- 
cant increase in spoilage. Conversely, the overestimation of 
low inventory levels prevents overstocking-averse con- 
sumers from repurchasing when their inventory is too low, 
thus causing a significant increase in unmet demand. 
Finally, the least elastic and, thus, least accurate estimations 
are those of product categories that are bought on impulse, 
are difficult to stockpile, and have a low promotional 
elasticity. 


Implications for Further Research 


By identifying three possible sources of biases, our model 
contributes to the literature on estimations, which has 
focused primarily on documenting biases rather than on 
explaining why these biases occur and what factors moder- 
ate them. For example, despite the considerable evidence 
that people underestimate the duration of past and future 
events when their duration is greater than five minutes and 
overestimate them when they last less than five minutes, 
there is no consensus about why this occurs (Lee, Hu, and 
Toh 2000; Roy, Christenfeld, and McKenzie 2005). Still, 
this pattern of results, even in this seemingly unrelated con- 
text, can be readily explained by two features of our model: 
(1) People anchor estimations on average duration, and (2) 
adjustments for deviations from the average are inelastic. 
For researchers interested in magnitude estimations, our 
basic finding that deviations from the reference level follow 
a compressive power function has important implications. 
First, it enables them to predict the magnitude, not just the 
direction, of these biases. Second, it enables them to predict 
the average estimation bias for any duration, not just for 
those already measured. Third, it can lead to new testable 
predictions, such as the magnitude of estimation biases will 
become larger as the duration increases, even when the 
magnitude of the bias is measured as a percentage deviation 
from reality (for a proof of this, see Chandon and Wansink, 


in press). Fourth, this model can enable researchers to make 
predictions about the boundary conditions of duration esti- 
mation biases. For example, it would predict that duration 
estimations are more accurate when the event is salient and 
when average duration is used as an anchor than when the 
event is not salient or when external reference levels are 
used as anchors. 

Our finding that inventory estimations follow a com- 
pressive power function of actual inventory is consistent 
with psychophysics research and with a great deal of accu- 
mulated evidence on magnitude estimation studies but is the 
opposite of what signal detection theory would predict. A 
key feature of signal detection theory is its assumption that 
people account for the relative costs of over- and underesti- 
mations (Green and Swets 1988). For inventory estimations, 
overestimations are more costly when inventory is low and 
when stockouts are likely to occur. In contrast, underestima- 
tions are more costly when inventory is high and when 
overstocking is likely. As a result, signal detection theory 
would predict that consumers underestimate low inventory 
levels (to avoid costly overestimations) and overestimate 
high inventory levels (to avoid costly underestimations). 
One explanation for our opposite findings is that in our 
studies, the cost-benefit payoff of the estimations was con- 
stant across inventory levels. Further research could try to 
reconcile the psychophysics and signal detection predic- 
tions by manipulating the costs of over- and underestima- 
tions (e.g., by measuring or manipulating the opportunity 
costs of having too much or too little). Another explanation 
is that wishful thinking leads consumers to bias their esti- 
mations optimistically in the direction of what they hope 
they will be rather than in the direction that will help them 
avoid overstocking and stockouts. Collecting process data, 
especially over time, may help in teasing out these explana- 
tions and in further advancing the understanding of the ori- 
gins of inventory estimation biases. 

Finally, our findings challenge the interpretation of the 
results of purchase quantity and timing models, which 
assume that consumers have accurate, or at least unbiased, 
knowledge of how much of a given product they have in 
inventory (Ailawadi and Neslin 1998; Bell, Chiang, and 
Padmanabhan 1999; Sun 2005). These models are estimated 
on scanner panel data, which contain no information on 
inventory. Therefore, they estimate inventory from a per- 
son's purchase quantity and timing data. Recall our finding 
that estimated inventory predicts repurchase decisions bet- 
ter than actual inventory. This indicates that the inventory 
estimated by these models may be what consumers estimate 
the inventory to be and not what it really is. Thus, these 
models measure the effects on purchases of consumers' 
inventory estimations rather than the effects of their actual 
inventory. Our finding that inventory estimations are inelas- 
tic implies that these models overestimate the effects of 
actual inventory on purchase decisions. Larger changes in 
actual inventory (and, thus, deeper price cuts) may be nec- 
essary to achieve the effect sizes reported in these studies. 
Biases in inventory decisions could easily be incorporated 
into purchase timing and quantity models. One solution 
would be to incorporate a power model of inventory estima- 
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tion within these models. Another would be to convert the 
actual inventory levels predicted by these models directly 
into inventory estimates using the relevant parameters esti- 
mated in the field studies and shown in Table 2. 


Implications for Managers and Consumers 


Our model and the results offer new insights into accelerat- 
ing the consumption of healthful foods. An important find- 
ing of prior research on inventory was the “stock pressure 
effect”; that is, inventory stockpiling accelerated the con- 
sumption of foods (Chandon and Wansink 2002). Unfortu- 
nately, it was convenient snack foods that received the 
greatest boost in consumption frequency when stockpiled. 
Given the rising trend in obesity, responsible managers of 
healthful foods could use our new insights into how con- 
sumers estimate inventory to influence their consumption of 
these foods. One way, as was done with the “Got milk?’ 
campaign, would be to raise inventory salience. This would 
reduce the likelihood of wasting healthful foods, which are 
often more perishable than less healthful ones. It would also 
accelerate the consumption rate of health foods. Studies of 
food-intake diaries have shown that the intake frequency of 
fruits and yogurts increases as their perceived expiration 
date approaches (Wansink 2006). Such efforts can also lead 
people to think of a recent instance in which they consumed 
the food, which in turns increases consumption intentions 
(Wansink and Deshpandé 1994). Because inventory estima- 
tions are anchored on a person’s own typical or normal level 
of inventory, another way of increasing a person’s purchase 
and consumption of a healthful food would be to increase 
his or her perception of what should be a typical or normal 
level of inventory of that food. For example, knowing that 
the average family of four has 11.2 cans of soup in inven- 
tory could lead consumers to alter their inventory (and con- 
sumption) norm for the relatively healthful option of canned 
soup (Wansink and Chandon 2006). 

Our model and the results also offer new insights into 
the targeting of sales promotions to consumer and category 
segments. Consider the two segments of consumers shown 
in Table 1 with identical demand: overstocking-averse con- 
sumers (who only repurchase when their inventory is low) 
and stockout-averse consumers (who repurchase at higher 
levels of inventory). The tendency to overestimate low 
inventory levels makes overstocking-averse consumers too 
conservative and thus limits their annual purchases to 504 
units in the simulation, which is below their annual demand 
of 539 units. In contrast, the tendency to underestimate high 
inventory levels makes stockout-averse consumers too lib- 
eral and thus increases their annual purchases to 588 units, 
which is above their annual demand of 539 units. Which of 
these two segments should be targeted with sales promo- 


tions? To answer this question, we simulated the impact of a 
price reduction that would lower the purchase threshold by 
two units for both segments. We found that this would 
increase purchases by 1296 among overstocking-averse con- 
sumers but by only 6% among stockout-averse consumers. 
The actual increase is likely to be even greater because 
these figures do not account for the consumption accelera- 
tion that higher inventory levels create. Because these 
overstocking-averse consumers probably include those with 
small households or those with small kitchens, multiunit 
promotions (“buy 5 get 1 free") and packaging would seem 
particularly appropriate because they facilitate flexible stor- 
age and reduce spoilage. Although identifying who the 
overstocking-averse consumers are would require specific 
data collection, the same reasoning would apply to category 
differences, suggesting benefits of targeting sales promo- 
tions to categories in which most consumers are averse to 
overstocking rather than to categories in which most con- 
sumers are averse to stockouts. 

Finally, the simulation results show that retailers and 
manufacturers would increase their sales if they helped 
overstocking-averse consumers improve the accuracy of 
their estimations. This is because it would prevent these 
consumers from overestimating their low inventory levels 
and from mistakenly postponing their purchases. In con- 
trast, improving the accuracy of stockout-averse consumers 
would reduce their purchases because it would prevent them 
from making the mistake of repurchasing when they still 
have a lot of product left in inventory. 

What could be done to improve the accuracy of inven- 
tory estimations? The lack of self-knowledge about estima- 
tion strategies revealed by the protocol data and the robust- 
ness of the biases exhibited in the field studies suggest that 
consumers do not learn much from experience, even though 
they often run out of stock and waste overstocked products. 
(This, in itself, raises the issue as to whether consumers 
may be softening the negative consequences of estimation 
errors by adapting their consumption.) 

Our model and our findings suggest that inventory esti- 
mations could be improved by helping consumers recognize 
when their estimations are likely to be particularly inaccu- 
rate, such as when their inventory is lower or higher than 
usual. In these cases, our findings suggest that consumers 
should increase the extremity of their estimations to com- 
pensate for the inelasticity of their intuitive estimates. Mar- 
keters could also improve the accuracy of inventory estima- 
tions by raising the perceptual salience of inventory levels. 
This could be done by changing where and how a product is 
stored, by usage-related advertising, or by transparent pack- 
age designs that facilitate the monitoring of inventory. 
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Factors Influencing the Effectiveness 
of Relationship Marketing: 
A Meta-Analysis 


Relationship marketing (RM) has emerged as one of the dominant mantras in business strategy circles, though RM 
investigations often yield mixed results. To help managers and researchers improve the effectiveness of their 
efforts, the authors synthesize RM empirical research in a meta-analytic framework. Although the fundamental 
premise that RM positively affects performance is well supported, many of the authors’ findings have significant 
implications for research and practice. Relationship investment has a large, direct effect on seller objective 
performance, which implies that additional meditated pathways may explain the impact of RM on performance. 
Objective performance is influenced most by relationship quality (a composite measure of relationship strength) 
and least by commitment. The results also suggest that RM is more effective when relationships are more critical 
to customers (6.g., service offerings, channel exchanges, business markets) and when relationships are built with 
an individual person rather than a selling firm (which partially explains the mixed effects between RM and 


performance reported in previous studies). 


elationship marketing (RM), both in business prac- 
tice and as a focus of academic research, has "expe- 
rienced explosive growth" in the past decade (Srini- 
vasan and Moorman 2005). Morgan and Hunt (1994, p. 22) 
define RM as “all marketing activities directed towards 
establishing, developing, and maintaining successful rela- 
tional exchanges." Most research and practice assumes that 
RM efforts generate stronger customer relationships that 
enhance seller performance outcomes, including sales 
growth, share, and profits (Crosby, Evans, and Cowles 
1990; Morgan and Hunt 1994), but some business execu- 
tives have been disappointed in the effectiveness of their 
RM efforts (Colgate and Danaher 2000). Researchers have 
also suggested that in certain situations, RM may have a 
negative impact on performance (De Wulf, Odekerken- 
Schröder, and Iacobucci 2001; Hibbard et al. 2001). 
Overall, these findings indicate tbat the effectiveness of 
RM efforts may vary depending on the specific RM strategy 
and exchange context; this inconsistency with regard to per- 
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formance suggests the need for a meta-analysis to integrate 
the abundance of accumulated empirical research and to 
understand better the RM strategies that are most effective 
for building strong relationships, the outcomes that are most 
affected by customer relationships, and the conditions in 
which RM is most effective for generating positive seller 
outcomes. Advancing understanding of the primary drivers 
of RM effectiveness can increase the return on firms’ RM 
investments dramatically and provide researchers with 
insights into ways to build more comprehensive models of 
the influence of RM on performance (Reinartz and Kumar 
2003). 

Using Dwyer, Schurr, and Oh’s (1987) seminal article 
on relationships; Crosby, Evans, and Cowles’s (1990) intro- 
duction of relationship quality; and Morgan and Hunt's 
(1994) key mediating variable theory of RM, most research 
has conceptualized the effects of RM on outcomes as fully 
mediated by one or more of the relational constructs of 
trust, commitment, relationship satisfaction, and/or relation- 
ship quality. The existing literature offers a wide range of 
antecedents for these relational mediators, and researchers 
disagree about which one best captures the characteristics 
of a relational exchange that influence performance. For 
example, Morgan and Hunt (1994) propose that trust and 
commitment are both key to predicting exchange perfor- 
mance, whereas others suggest that either trust (e.g., Doney 
and Cannon 1997; Sirdeshmukh, Singh, and Sabol 2002) or 
commitment (e.g., Anderson and Weitz 1992; Gruen, Sum- 
mers, and Acito 2000; Jap and Ganesan 2000) alone is the 
critical relational construct. 

Another school of thought suggests that the global con- 
struct of relationship quality, as reflected by a combination 
of commitment, trust, and relationship satisfaction, offers 
the best assessment of relationship strength and provides 
the most insight into exchange performance (e.g., De Wulf, 
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Odekerken-Schrider, and Iacobucci 2001; Kumar, Scheer, 
and Steenkamp 1995). These different relational mediators 
have been linked empirically to many antecedents and out- 
comes, which leads to the critical question, How does the 
relational mediated model vary across different relational 
perspectives? 

In this article, we systematically review and analyze the 
literature on relational mediators in a meta-analytic frame- 
work (Figure 1) to provide insight into the following four 
research questions: (1) Which RM strategies are most effec- 
tive for building customer relationships? (2) What outcomes 
are most affected by customer relationships? (3) Which 
moderators are most effective in influencing relationship— 
outcome linkages? and (4) How does the RM strategy — 
mediator — outcome linkage vary across different 
mediators? 


Conceptual Framework 
In reviewing the literature pertaining to relational media- 
tors, we identified many constructs with similar definitions 
that operate under different aliases and constructs with 
similar names but different operationalizations. Thus, we 
use a single construct definition (see Table 1) to code exist- 
ing research; we include a construct in the conceptual 
framework only if at least 10 effects emerge to support its 
empirical analysis. Of the many constructs investigated, 
only 18 met these criteria and appear in the model. Our 
nomological placement of each construct is driven by both 
theory and the frequency of placement in extant research. 
Of the studies that include hypothesized relationships with 
relational mediators, more than 90% are consistent with the 
causal ordering of constructs in our framework, with the 


exceptions of conflict and cooperation, which agree with 
our nomological framework in approximately 70% of extant 
studies. 

Although a relationship is, by its very nature, two sided 
and both parties typically share in the benefits of a strong 
relationship, some antecedents and outcomes may have dif- 
ferential effects according to the measurement perspective 
(e.g., dependence). Thus, we adopt terminology to identify 
the perspective of each construct relative to its relational 
mediators. In this framework, "seller" refers to the party 
that implements the RM effort in the hope of strengthening 
its relationship with the “customer,” and the relational 
mediator captures the customer’s perception of its relation- 
ship with the seller. For clarity and consistency, we use 
these customer and seller perspectives even when the two 
parties may not be engaged in a typical exchange transac- 
tion (e.g., a strategic alliance). Thus, we classify 
antecedents and outcomes as “customer focused” when they 
share the same perspective as the relational mediator and as 
“seller focused” when they adopt a perspective opposite 
that of the evaluation of the relational mediator. We develop 
our conceptual framework in four parts, which roughly par- 
allel our research questions, by first reviewing the literature 
on relational mediators, then investigating the antecedents 
and outcomes of these mediators, and, finally, studying 
potential moderators of the impact of relational mediators 
on outcomes. 


Relational Mediators 


Successful RM efforts improve customer loyalty and firm 
performance through stronger relational bonds (e.g., De 
Wulf, Odekerken-Schrider, and Iacobucci 2001; Sirdesh- 


FIGURE 1 
Relational Mediator Meta-Analytic Framework 
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aConstruct had sufficient reported effects to be included in the multivanate causal model. 
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Constructs 


TABLE 1 


Review of Construct Definitions, Aliases, and Representative Studies 


Definitions 


Relational Mediators 


Commitment 


Trust 


Relationship 
satisfaction 


Relationship 
quality 


Antecedents 
Relationship 
benefits 


Dependence 
on seller 


Relationship 
investment 


Seller 
expertise 


Communica- 
tion 


Similarity 


Relationship 
duration 


Interaction 
frequency 


Conflict 


An enduring desire to maintain a 
valued relationship 


Confidence In an exchange 
partners reliability and integrity 


Customers affective or emotional 
state toward a relationship, 
typically evaluated cumulatively 
over the history of the exchange 


Overall assessment of the 
strength of a relationship, 
conceptualized as a composite or 
multidimensional construct 
capturing the different but related 
facets of a relationship 


Benefits received, including time 
saving, convenience, 
companionship, and improved 
decision making 


Customers evaluation of the 
value of seller-provided resources 
for which few alternatives are 
available from other sellers 


Seller's Investment of time, effort, 
spending, and resources focused 
on building a stronger relationship 


Knowledge, experience, and 
overall competency of seller 


Amount, frequency, and quality of 
Information shared between 
exchange partners 


Commonality In appearance, 
lifestyle, and status between 
individual boundary spanners or 
similar cultures, values, and goals 
between buying and selling 
organizations 


Length of time that the 
relationship between the 
exchange partners has existed 


Number of interactions or number 
of Interactions per unit of time 
between exchange partners 


Overall level of disagreement 
between exchange partners 
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Common Allases 


2 


Affective, behavioral, obligation, 
and normative commitment 


Trustworthiness, credibility, 
benevolence, and honesty 


Satisfaction with the 
relationship, but not overall 
satisfaction 


Relationship closeness and 
strength 


Functional and social benefits 
and rewards 


Relative and asymmetric 
dependence, switching cost, 
and imbalance of power 


Support, gifts, resources, 
investments, and loyalty 
programs 


Competence, skill, knowledge, 
and ability 


Bilateral or collaborative 
communication, information 
exchange, and sharing 


Salesperson or cultural 


similarity, shared values, and 
compatibility 


Relationship age or length, 


continuity, and duration with firm 


or salesperson 


Frequency of business contact 
and Interaction Intensity 


Manifest and perceived conflict 
or level of conflict, but not 
functional conflict 


Representative Papers 


Anderson and Weitz 1992; Jap 

and Ganesan 2000; Moorman, 

Zaltman, and Deshpandé 1992; 
Morgan and Hunt 1994 


Doney and Cannon 1997; 
Hibbard et al. 2001; Sirdeshmukh, 
Singh, and Sabo! 2002 


Crosby, Evans, and Cowles 1990; 
Reynolds and Beatty 1999 


Crosby, Evans, and Cowles 1990; 
De Wulf, Odekerken-Schróder, 
and lacobucci 2001 


Hennig-Thurau, Gwinner, and 
Gremler 2002; Morgan and Hunt 
1994; Reynolds and Beatty 1999 


Hibbard, Kumar, and Stern 2001; 
Morgan and Hunt 1994 


De Wulf, Odekerken-Schróder, 
and lacobucci 2001; Ganesan 
1994 


Crosby, Evans, and Cowles 1990; 
Lagace, Dahlstrom, and 
Gassenhelmer 1991 


Anderson and Weitz 1992; Mohr, 
Fisher, and Nevin 1996; Morgan 
and Hunt 1994 


Crosby, Evans, and Cowles 1990; 
Doney and Cannon 1997; Morgan 
and Hunt 1994 


Anderson and Weltz 1989; Doney 
and Cannon 1997; Kumar, 
Scheer, and Steenkamp 1995 


Crosby, Evans, and Cowles 1990; 
Doney and Cannon 1997 


Anderson and Weitz 1992; 
Kumar, Scheer, and Steenkamp 
1995 


TABLE 1 


Continued 
Constructs Definitions Common Allases Representative Papers 
Outcomes i 
Expectation Customers Intention to maintain Purchase intentions, likelihood Crosby, Evans, and Cowles 1990; 
of the relationship in the future, to leave (reverse), and Doney and Cannon 1997 
continuity which captures the likelihood of relationship continuity 
continued purchases from the 
seller 
Word of Likelihood of a customer - Referrals and customer referrals Hennig-Thurau, Gwinner, and 
mouth positively referring the seller to Gremler 2002; Reynolds and 
another potentlal customer Beatty 1999 
Customer Composite or multidimensional Behavioral loyalty and loyalty De Wulf, Odekerken-Schróder, 
loyalty construct combining different and lacobucci 2001; Hennig- 
groupings of intentions, attitudes, Thurau, Gwinner, and Gremler 
and seller performance indicators 2002; Sirdeshmukh, Singh, and 
Sabol 2002 
Seller Actual seller performance Sales, share, sales Reynolds and Beatty 1999; 
objective enhancements including sales, effectiveness, profit, and sales Siguaw, Simpson, and Baker 
perform- share of wallet, profit performance 1998 


ance performance, and other 
measurable changes to the 
sellers business 


Coordinated and complementary 
actions between exchange 


Cooperation 


Coordination and joint actions 


Anderson and Narus 1990; 
Morgan and Hunt 1994 


partners to achieve mutual goals 


mukh, Singh, and Sabol 2002), but the literature offers var- 
ied perspectives on which relational constructs mediate the 
effects of RM efforts on outcomes. Commitment and trust 
are most often studied; commitment is “an enduring desire 
to maintain a valued relationship" (Moorman, Zaltman, and 
Deshpandé 1992, p. 316), and trust is *confidence in an 
exchange partner's reliability and integrity" (Morgan and 
Hunt 1994, p. 23). Another relationship mediator, relation- 
ship satisfaction, is a customer's affective or emotional state 
toward a relationship. Relationship satisfaction reflects 
exclusively the customer's satisfaction with the relationship 
and differs from the customer's satisfaction with the overall 
exchange. Other researchers have suggested that these 
mediators are merely indicators of the global mediator rela- 
tionship quality, which is "an overall assessment of tbe 
strength of a relationship" and is conceptualized as a multi- 
dimensional construct that captures the many different 
facets of an exchange relationship (De Wulf, Odekerken- 
Schróder, and Iacobucci 2001, p. 36; see also Crosby, 
Evans, and Cowles 1990). Its structure and underlying 
dimensions vary across empirical studies, but central to the 
conceptualization is the belief that no single dimension or 
relational construct can fully define the “overall depth or 
climate” of an exchange relationship (Johnson 1999, p. 6). 
Thus, whereas the literature consistently conceptualizes 
a mediating model for the effects of RM on performance, 
the specific relational mediators or composite of mediators 
appear to be driven mainly by researcher discretion; empiri- 
cal comparisons of the differential effects of these relational 
mediators are noticeably absent. For example, some 
researchers propose trust as the critical relational mediator. 


Berry (1996, p. 42) offers "trust as perhaps the single most 
powerful relationship marketing tool available to a com- 
pany,” and Spekman (1988, p. 79) suggests that trust is the 
“cornerstone” of long-term relationships. Alternatively, 
Gundlach, Achrol, and Mentzer (1995, p. 78) propose com- 
mitment as the "essential ingredient for successful long- 
term relationships," and Morgan and Hunt (1994, p. 23) 
suggest “commitment among exchange partners as key to 
achieving valuable outcomes." De Wulf, Odekerken- 
Schröder, and Iacobucci (2001) prefer the overall concept of 
relationship quality to any specific component. In summary, 
there is little agreement among researchers as to which indi- 
vidual or composite relational mediator best captures the 
key aspects of a relationship that most affect outcomes. To 
address this issue empirically, our meta-analytic framework 
compares the relative effects of the different perspectives by 
analyzing relational mediators separately and as a group. 


Antecedents to Relational Mediators 


Customer-focused antecedents. Customers may per- 
ceive value in a relationship when they receive relationship 
benefits from an exchange partner (e.g., time savings, con- 
venience, companionship), which increases their willing- 
ness to develop relational bonds. Relationship benefits have 
been shown to affect relational mediators positively (Mor- 
gan and Hunt 1994; Reynolds and Beatty 1999). Depen- 
dence on the seller reflects the customer's evaluation of the 
value of seller-provided resources for which few alterna- 
tives are available (Hibbard, Kumar, and Stern 2001). The 
literature is mixed regarding the effect of a customer's 
dependence, or relative dependence (i.e., the customer's 
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dependence reduced by the seller’s dependence), on rela- 
tional mediators. Researchers find empirical support for 
both positive and negative influences of relative dependence 
on relational mediators (Anderson and Weitz 1989; Morgan 
and Hunt 1994), which indicates that its impact may be 
contingent on the context. 


Seller-focused antecedents. Researchers have investi- 
gated various RM strategies that sellers can employ to 
strengthen relationships. Relationship investment refers to 
the time, effort, and resources that sellers invest in building 
stronger relationships. Such investments often generate 
expectations of reciprocation that can help strengthen and 
maintain a relationship and, therefore, positively influence 
relational mediators (Anderson and Weitz 1989; Ganesan 
1994). Seller expertise reflects the knowledge, experience, 
and overall competence of the seller. When customers inter- 
act with a competent seller, they receive increased value, 
their relationship becomes more important, and they invest 
more effort to strengthen and maintain it (Crosby, Evans, 
and Cowles 1990; Lagace, Dahlstrom, and Gassenheimer 
1991). 


Dyadic antecedents. Customer- and seller-focused 
antecedents are meaningful from one side of the exchange 
dyad, but other antecedents require the active involvement 
of both exchange partners and are equally meaningful from 
both perspectives. For example, communication, or the 
amount, frequency, and quality of information shared 
between exchange partners (Mohr, Fisher, and Nevin 1996), 
requires both parties to exchange information. Communica- 
tion builds stronger relationships in an exchange by helping 
resolve disputes, align goals, and uncover new value- 
creating opportunities (Morgan and Hunt 1994). Similarity 
is the commonality in appearance, lifestyle, and status 
between individual boundary spanners or the similar cul- 
tures, values, and goals between organizations. Such simi- 
larities between people or organizations may provide cues 
that the exchange partner will help facilitate important goals 
and has been shown to affect relational mediators positively 
(Crosby, Evans, and Cowles 1990; Doney and Cannon 
1997). Relationship duration is the length of time that the 
relationship between the exchange partners has existed, 
whereas interaction frequency refers to the number of inter- 
actions per unit of time between partners. Both provide 
trading partners with more behavioral information in varied 
contexts, which allows for better predictions that should 
increase each party's confidence in its partner's behavior 
(Anderson and Weitz 1989; Doney and Cannon 1997). 
Finally, conflict entails the overall level of disagreement 
between exchange partners; this is often termed "perceived" 
or “manifest” conflict. As conflict increases, the customer is 
less likely to have confidence in the long-term orientation 
of the seller or to invest in building or maintaining a rela- 
tionship; thus, conflict should negatively influence the cus- 
tomer's trust in and commitment toward the seller (Ander- 
son and Weitz 1992). 


Consequences of Relational Mediators 


Customer-focused outcomes. Increased customer loyalty 
is one of the most common outcomes expected from RM 
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efforts, but loyalty has been defined and operationalized in 
many different ways. An expectation of continuity reflects 
the customer's intention to maintain the relationship in the 
future and captures the likelihood of continued purchases. 
However, researchers have criticized this measure of loyalty 
because customers with weak relational bonds and little 
loyalty may report high continuity expectations as a result 
of their perceptions of high switching costs or their lack of 
time to evaluate alternatives (Oliver 1999). Word of mouth 
(WOM) captures the likelihood that a customer will refer a 
seller positively to another potential customer and, there- 
fore, indicates both attitudinal and behavioral dimensions of 
loyalty. Some studies operationalize customer loyalty as a 
composite or multidimensional construct that includes 
groupings of intentions, attitudes, and seller performance 
indicators. We note that relational mediators positively 
influence global measures of customer loyalty, just as they 
do its individua] components (De Wulf, Odekerken- 
Schróder, and Iacobucci 2001; Sirdeshmukh, Singh, and 
Sabol 2002). 


Seller-focused outcomes. Possibly the most important 
outcome of RM efforts is seller objective performance, 
which captures the seller's actual performance enhance- 
ments, including sales, profit, and share of wallet. Some 
researchers have found empirical support for the influence 
of relational mediators on seller objective outcomes (e.g., 
Doney and Cannon 1997; Siguaw, Simpson, and Baker 
1998), but several other studies have failed to find any sig- 
nificant effects, which implies that the effect of RM on 
performance may be context dependent (Crosby, Evans, and 
Cowles 1990; Gruen, Summers, and Acito 2000). 


Dyadic outcomes. Cooperation captures the level of 
coordinated and complementary actions between exchange 
partners in their efforts to achieve mutual goals. Coopera- 
tion promotes value creation beyond that which each party 
could achieve separately, but because one party often 
receives its portion of the value earlier, the other party must 
have enough trust in the relationship to wait for its future 
reciprocation. Researchers have shown that trust and com- 
mitment between exchange partners are critical for cooper- 
ation (Anderson and Narus 1990; Morgan and Hunt 1994). 


Moderators of Relational Mediators’ Influence on 
Outcomes 


The RM model (RM strategies — relational mediator > 
outcomes) we conceptualize herein can be applied across 
many different contexts in which business strategies may 
have varying effects. Therefore, an objective of our meta- 
analysis is to identify and empirically test the influence of 
potential moderators on the linkages in the RM model. 


Contexts influencing relationship importance. Relation- 
ship marketing is based on the premise that building strong 
relationships positively influences exchange outcomes, and 
researchers recognize that exchanges vary across a spec- 
trum from transactional to relational (Anderson and Narus 
1991). For exchanges in which relationships are more 
important, we expect that the relational mediators will have 
a greater impact on outcomes, whereas in highly transac- 
tional exchanges, the relationships between buyers and sell- 


ers may have little influence on outcomes. Extant literature 
identifies three situations in which relationships may be 
more important for the success of an exchange. First, in 
general, services are perceived as less tangible, less consis- 
tent, and more perishable, and customers and sellers are 
more involved in the production and consumption of ser- 
vices than they are for products (Zeithaml, Parasuraman, 
and Berry 1985). This close interaction between customers 
and sellers may make customer-seller relationships more 
critical for services, and the intangibility of the offering 
may make the benefits of trust more critical because evalua- 
tions often are ambiguous. 

Second, channel researchers tend to distinguish between 
channel partner exchanges and direct seller-customer trans- 
actions. Exchanges between channel partners have higher 
levels of interdependence, require coordinated action, and 
rely on the prevention of opportunistic behavior (Anderson 
and Weitz 1989). Thus, coordination improvements and the 
reduction of opportunistic behaviors through strong rela- 
tionships should be more important in a channel context, 
which should lead to a greater impact of relational media- 
tors on performance than their impact in direct exchanges. 

Third, Anderson and Narus (2004, p. 21) differentiate 
consumer and business markets on the basis of the impor- 
tance of relationships, maintaining that a "firm's success in 
business markets depends directly on its working relation- 
ships." If a working relationship is more critical.for a cus- 
tomer's success in business markets, relationships should 
have a greater impact on exchange outcomes in business 
than in consumer markets. 


Individual versus organizational relationships. Cus- 
tomers may form a relationship with an individual boundary 
spanner in the selling organization and/or with the selling 
organization as a whole. This issue of individual versus 
organizational relationships also has significant managerial 
implications as firms continue to try to increase their ser- 
vice efficiencies through the use of technology (e.g., cus- 
tomer relationship management). Experimental research 
shows that when people evaluate another individual, they 
make stronger, quicker, and more confident judgments than 
when they evaluate a group; those judgments also are more 
strongly related to outcomes and behaviors (Hamilton and 
Sherman 1996). Accordingly, we expect that customers' 
judgments based on the relational characteristics of an indi- 
vidual boundary spanner (e.g., trust in the salesperson) will 
be stronger, more confident, and more strongly linked to 
outcomes than their judgments based on the relational char- 
acteristics of a selling firm (e.g., trust in the firm). Post hoc 
findings support this premise. Doney and Cannon (1997, p. 
45) report that "the process by which trust develops appears 
to differ when the target is an organization ... as opposed to 
an individual salesperson,” and Iacobucci and Ostrom 
(1996, p. 69) find that “[iJndividual-to-firm relationships 
[are] also typically short-term and less intense in compari- 
son to individual-level dyads.” Thus, the positive effect of 
relational mediators on outcomes will be greater when the 
relational mediator is targeted toward an individual member 
of the selling organization than when it is targeted toward 
the organization. 


Method 


Collection and Coding of Studies 


The key impetus for RM research was Dwyer, Schurr, and 
Oh's (1987) seminal article, so we searched empirical 
research for the mediators of interest during the period 
1987—2004. We employed various methods in our literature 
search, including (1) a search of the ABl/Informs, 
PsycINFO, and Business Source Premier databases for each 
relational mediator; (2) a search of the Social Sciences Cita- 
tion Index, using the seminal articles for these constructs; 
(3) manual shelf searches of journals that contain research 
on relational mediators; and (4) e-mails sent to researchers 
in the domain asking for their published and unpublished 
works. Our search generated more than 100 published and 
unpublished studies, each of which we evaluated for mea- 
sures of the relationships among antecedents, outcomes, 
and the four relational mediators. Because correlations were 
the most common metric included in these studies (29590), 
we e-mailed authors to request the correlation matrices for 
any studies in which they were not provided. Two indepen- 
dent coders, who were not familiar with tbe study, used the 
definitions in Table 1 to code the studies, and any differ- 
ences (overall agreement 29596) were resolved through dis- 
cussion (Szymanski and Henard 2001). When a single study 
provided more than one effect size estimate for the same 
relationship, we calculated an average. In cases in which the 
multiple effect size estimates from the same study were 
independent, we included them as separate effect size esti- 
mates. This procedure prevents the bias that may occur as a 
result of multiple counts of dependent effect size estimates 
and enables us to code moderators that vary across subsets 
of a sample in a single study (e.g., Brown and Peterson 
1993). Ultimately, we combined 637 correlations from 111 
independent samples drawn from 94 different manuscripts 
to yield a combined N of 38,077 with which to calculate the 
pairwise effect size estimates. 1 


Univarlate Analyses 


We began our analysis by adjusting our basic input mea- 
sure, correlations (r), for corrections due to measurement 
error (scale reliability differences); we report the correlation 
adjusted for reliability (Hunter and Schmidt 1990). We then 
adjusted for sampling error (sample size differences), and 
we report the sample-weighted reliability-adjusted r and its 
95% confidence intervals (CIs).2 We calculated the chi- 
square test (d.f. = 1) for association and then addressed the 
file-drawer problem by computing the classical file drawer 
N (Rosenthal 1979) and the Q statistic test of homogeneity 


1A list of the articles used in our empirical meta-analysis is 
available on request. 

2Before applying the sample weights, we first converted the 
reliability-adjusted r's to variance-stabihzing Fisher’s z scores 
(Rosenthal 1994; Shadish and Haddock 1994). Following standard 
procedures (Shadish and Haddock 1994, p. 268), we reconverted 
them back to r's to report the sample-weighted reliability-adjusted 
r and the 9596 CIs. 
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(Cheung and Chan 2004; Hunter and Schmidt 1990) for 
each relationship.3 

We performed such analyses for the influence of each 
antecedent on the four relational mediators (provided that 
there were four effects for each antecedent), which enables 
us to compare the influence of each antecedent on each 
mediator. Although these mediators measure different 
aspects of a relationship, researchers have argued that they 
are highly related and difficult to distinguish and, therefore, 
can be combined into a composite construct (Crosby, 
Evans, and Cowles 1990; De Wulf, Odekerken-Schróder, 
and Jacobucci 2001; Smith 1998). To facilitate the compari- 
son of relative effects among the different antecedents on 
the overall relationship, we duplicated these analyses for the 
effects of each antecedent on all four mediators as a group. 


Causal Model 


In addition to the pairwise analyses, we aggregated the 
studies to test the nomological causal model implicit in Fig- 
ure 1. This multivariate technique has the advantage of ana- 
lyzing all linkages simultaneously, but it also needs signifi- 
cantly more data because the effects (i.e., correlation 
coefficients) must be available between each construct in 
the model and all other constructs, not just the pairwise 
effects for constructs with proposed relationships (Brown 
and Peterson 1993). Thus, causal models typically are lim- 
ited to only the most commonly studied constructs. We 
determined the average-adjusted intercorrelation among all 
constructs in the framework whose required correlation 
coefficients were reported in three or more studies (Table 
2). Furthermore, to increase the number of constructs that 
met this requirement and to provide a concise synthesis of 
the literature, we grouped all relational mediators together, 
and thus we propose a fully mediated model (Morgan and 
Hunt 1994). Of the 14 antecedent and outcome constructs 
we include in Figure 1, only 6 met this criterion and could 
be evaluated in the causal model. 


Results 


After we report the results of our causal model estimation 
procedure (Cheung and Chan 2005; Furlow and Beretvas 
2005), we provide the results of the pairwise and casual 
model analyses structured around our four focal research 
questions. Because the first two questions focus on the 


3To estimate the publication bias associated with published 
studies, we employed multiple methods: (1) Rosenthal's (1979) 
well-known file drawer method; (2) Orwin's (1983) failsafe N, 
which represents the number of missing studies (set to .05) that 
would bring the effect to .075, or less than the .10 effect level that 
Cohen (1977) classifies as signifying a low effect; and (3) compre- 
hensive meta-analysis software (http://www.meta-analysis.com/ 
html/stat_analysis_overview.html) to compute funnel plots for the 
various relationships. Funnel plots offer a simple scatter 
plot—based visual tool for investigating publication bias in meta- 
analyses (Sterne and Egger 2001). Overall, the funnel plots cor- 
roborate the inferences we drew from the file drawer N and 
Orwin’s failsafe N; namely, the data we use in the meta-analyses 
do not display any evidence of publication bias. 
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effectiveness of antecedents that influence relational media- 
tors (Table 3) and relational mediators that influence out- 
comes (Table 4), we report the findings beginning with the 
most influential constructs, and we concentrate on the 
aggregate results for all mediators (i.e., last row of each 
construct in Tables 3 and 4). Next, we report the results of 
the moderator analyses to understand the context in which 
RM is most effective. Finally, we concentrate on the last 
research question, namely, how the RM mediating model 
varies across different mediators. For this question, we no 
longer focus on the aggregate results but instead evaluate 
the effects that pertain to each mediator and thus provide 
insight into how the effects vary across mediators. 


Causal Model Estimation 


The fit indexes from the structural model testing of the 
causal model indicate that this model fits the data poorly: 
X*(5) = 322.27, p < .01; comparative fit index = .87; 
goodness-of-fit index = .95; and root mean square error of 
approximation = .19 (for an example of this technique, see 
Brown and Peterson 1993). Modification indexes suggest a 
revised causal model that includes direct paths from depen- 
dence on seller and relationship investment to seller objec- 
tive performance. The revised model results in indexes that 
indicate a good fit to the data: y2(3) = 12.15, p < .01; com- 
parative fit index = .89; goodness-of-fit index = .99; and 
root mean square error of approximation = .04.4 


Which RM Strategies Are Most Effective for 
Building Customer Relationships? 


As we show in Table 3, not all RM strategies (antecedents) 
are equally effective for: building relationships. The average 
of the sample-weighted reliability-adjusted correlations 
among antecedents and mediators is .41, and they range 
from .13 for relationship duration to the largest absolute 
effect of —67 for conflict. All paths from antecedents to 
relational mediators are supported in the pairwise analyses, 
except for the path from interaction frequency to relation- 
ship satisfaction. Most of these findings appear to be robust 
with regard to the number of null studies needed to render 
the observed effects zero (mean file-drawer N is 3152). 
Only three linkages appear susceptible to a file-drawer 
problem: interaction frequency — relationship satisfaction 
(N = 2), relationship duration — relationship satisfaction 
(N = 18), and relationship duration — relationship quality 
(N = 25). In the Q-statistic test for homogeneity, with one 
exception (i.e., seller expertise — relationship satisfaction), 
all the tests for homogeneity are significant. 

Several insights can be drawn from the evaluation of the 
relative impact of different RM strategies on building strong 


*The median sample size from our meta-analysis of the studies 
included in the causal model is 2839. Modification indexes indi- 
cate that a direct path from relationship duration to seller objective 
performance could improve the model fit, but our evaluation of the 
parsimony-adjusted fit indexes suggests that the slight improve- 
ment in fit is more than offset by a loss in parsimony. The signifi- 
cance and pattern of effects do not change with this additional 
path, so we do not add it. 


TABLE 2 
Average Reliabllity-Adjusted Intercorrelations Among Constructs in Causal Model 





Constructs RBEN 
Relation Benefits (RBEN) [.87] 
SD 


Number of studies 
Cumulative sample size 


Dependence on Seller (DEPS) 12 
SD .29 
Number of studies " 4 
Cumulative sample size 886 

Relationship Investment (RINV) 42 
SD . 08 
Number of studies =. 7 
Cumulative sample size 1911 

Communication (COMM) 49 
SD .15 
Number of studies 10 
Cumulative sample size 2380 

Relationship Duration (RDUR) .15 
SD —. .08 
Number of studles 3 
Cumulative sample slze 1097 

Relational Mediator (RMED) | 48 
SD .18 
Number of studies 18 
Cumulative sample size 5108 

Seller Objective Performance (SOP) 23 
SD .04 
Number of studies 3 
Cumulative sample size 600 


DEPS RINV COMM RDUR RMED SOP 


[.85] 

13 [.82] 

21 

5 

1273 

28 AT [.85] 

21 13 

12 9 
3260 2893 

14 02 18 [.99] 

17 06 13 

7 7 6 

2150 3496 2282 

19 45 51 11 [.85] 
25 18 19 13 

33 24 38 26 

9296 8564 9803 10,720 

29 44 30 12 35 [.92] 
31 31 30 12 22 
8 7 12 8 47 
2860 2813 3149 4293 16,469 


Notes: Entries on the diagonal In brackets are weighted mean Cronbach’s alpha coefficients. We included constructs in the causal model when 
three or more correlation coefficients were available among that construct and all other constructs In the model. 


customer relationships. Conflict (r = —.67) has the largest 
absolute impact on the relational mediators of all 
antecedents, in support of the importance of resolving prob- 
lems and disagreements to prevent relationship-damaging 
conflicts (alternatively, the presence of conflict may seri- 
ously undermine the effect of other RM antecedents). That 
the largest effect is negative extends to the RM domain the 
finding that people pay more attention to negatives than to 
positives (Shiv, Edell, and Payne 1997); this warrants fur- 
ther investigation. The seller expertise (r — .62) and commu- 
nication (r = .54) antecedents have the greatest positive 
influence on relational mediators. The great impact of seller 
expertise suggests the importance of training boundary 
spanners and the potential detriments of staffing call centers 
with inexperienced or unskilled employees. The influence 
of seller expertise also seems to apply across all four rela- 
tional mediators, in support of Vargo and Lusch's (2004, p. 
3) claim that "skills and knowledge are the fundamental 
unit of exchange,” such that sellers’ skills and knowledge 
are the most important value-creating attributes. Similarly, 
the large positive effect of communication on all mediators 
is consistent with its role in both uncovering value-creating 
opportunities and resolving problems. 


Relationship investment (r = .46), similarity (r = .44), 
and relationship benefits (r = .42) are the next most influen- 
tial RM strategies. The strong positive impact of the seller’s 
relationship investments and customer relationship benefits 
indicates that managers should engage in proactive RM 
spending. The importance of similarities between buyers 
and sellers suggests that without common reference points, 
exchange partners may find it difficult to move the 
exchange from a purely economic or transactional basis to a 
relational basis. The last three antecedents—dependence on 
seller (r = .26), interaction frequency (r = .16), and relation- 
ship duration (r = .13)—have notably smaller effects on 
relational mediators. 

The causal model analysis generates the same rank 
order of relative effects of the antecedents on relational 
mediators as the pairwise analyses, which increases our 
confidence in the univariate results. Communication (B = 
29, p < .01), relationship investment (D = .23, p < .01), rela- 
tionship benefit (B = .18, p « .01), and dependence on seller 
(B = .05, p « .01) all have significant, positive effects on 
relational mediators, but relationship duration (B = .02) fails 
to influence relational mediators significantly in the multi- 
variate analysis. 
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What Outcomes Are Most Affected by Customer 
Relationships? 


In Table 4, we show that customer relationships do not 
equally influence all exchange outcomes. The average of 
the correlations among relational mediators and: outcomes is 
.55, ranging from a low of .35 for seller objective perfor- 
mance to a high of .70 for cooperation. All paths from rela- 
tional mediators to outcomes are supported. None of these 
results appears to be susceptible to a file-drawer problem; 
all paths would require more than 375 null studies to gener- 
ate a zero effect, with a mean file drawer N of 6153. All 
Q-statistic tests for homogeneity are significant, demon- 
strating statistical heterogeneity and supporting a moderator 
analysis. 

Relational mediators have the largest combined influ- 
ence on the dyadic outcome of cooperation (r, = .70), fol- 
lowed by WOM (r = .61). This finding reinforces the impor- 
tance of relationship building for a high level of customer 
cooperation. The greater impact of relational mediators on 
WOM (r = .61) than on the expectation of continuity (r = 
56) or on customer loyalty (r = .52) lends support to Reich- 
held's (2003, p. 48) premise that WOM may. be the best 
indicator of "intense loyalty" Only customers who have 
strong relationships with sellers are willing to risk their own 
reputation by giving a referral. 

Of the five outcomes, relational mediators have the least 
influence on seller objective performance (r = .35). Thus, 
although customer relationships positively influence perfor- 
mance outcomes, in support of efforts put into RM strate- 
gies, the actual effect on performance is lower than that on 
the other four outcomes. This finding is not surprising; rela- 
tional mediators are more closely related to loyalty and 
cooperation than is objective performance, which often 
depends on other, nonrelational factors (e.g., the economy). 

The causal model includes only one outcome, but we 
confirm the significant influence of relational mediators on 
seller objective performance (P = .16, p < .01). In addition, 
although the impact of relationship benefits and communi- 
cation strategies on seller objective performance is fully 
mediated by the relational mediators, the influence of 
dependence and relationship investment is only partially 
mediated; both dependence on seller (B = .22, p « .01) and 
relationship investment (B = .34, p « .01) also have large, 
direct effects on seller performance. 


Which Moderators Are Most Effective in 
Influencing Relationship-Outcome Linkages? 


In Table 5, we present the influence of moderators on the 
linkage between relational mediators and outcomes.? The 
premise that customer relationships have a greater impact 
on exchange outcomes in situations in which relationships 
are more critical to the success of the exchange is supported 





5We carried out the moderator analysis using the procedure that 
Brown (1996) and Grewal and colleagues (1997) employ. How- 
ever, the results we report in Table 5 must be interpreted cau- 
tiously because in the majority of the nonsignificant cases, the 
power of the test is relatively small (Cohen 1977; Fern and Mon- 
roe 1996). On the basis of our power analysis, we have flagged the 
relationships in Table 5 that we believe researchers would be pre- 
mature in dismissing. 


for the impact of all mediators on customer loyalty among 
services, channels, and business customers. The correlation 
of all mediators with customer loyalty is .58 for service ver- 
sus .43 for product-based exchanges (p « .05), .65 for chan- 
nel versus .46 for direct interactions (p « .01), and .56 for 
business versus .46 for consumer markets (p « .05). We find 
a similar effect in business markets for the impact of rela- 
tionships on seller objective performance, for which the 
influence of all mediators is r = .36 in business markets ver- 
sus r = .25 in consumer markets (p < .01). In summary, the 
significant moderation of the influence of relationships on 
customer loyalty across services, channels, and business 
markets, as well as on performance in business markets, 
provides support for our premise that customer relation- 
ships have a greater impact on exchange outcomes in situa- 
tions in which relationships are more critical to success. 

Contrary to our expectations, relational mediators' 
influence on the expectation of continuity is greater in con- 
sumer than in business markets, mostly because of commit- 
ment's influence on the expectation of continuity (r — .46 
business, r = .71 consumer; p « .05). Because commitment 
taps a customer's desire to maintain a valued relationship, 
whereas the expectation of continuity captures a customer's 
intent to maintain the relationship, consumers may be better 
able to convert their attitudes or desire into an intention than 
are business buyers because consumers have a higher 
degree of control over their actions. Consistent with the 
theory of planned behavior, the link between an attitude and 
an intention should be stronger as control increases (Ajzen 
and Fishbein 1980). Thus, the stronger impact of commit- 
ment on the expectation of continuity (which results from 
higher levels of control) in consumer markets than in busi- 
ness markets may offset the typically greater importance of 
relationships in business markets. 

As we proposed, relationships have a greater impact on 
customer loyalty when the target of the relationship is an 
individual person (r = .56) than when the target is an orga- 
nization (r = .46; p « .05). Similarly, the impact of relational 
mediators on cooperation is greater (r — .68 for interper- 
sonal, r = .55 for interorganizational; p < .05) when the cus- 
tomer's relationship is targeted toward a person employed 
by the seller than when it is targeted toward the seller over- 
all. We provide additional support for this finding in Table 
5, in which we show that of the 16 moderation tests, 81% 
are in the expected direction, and the impact of all media- 
tors on seller objective performance is significant at the p « 
10 level (r = .40 for interpersonal, r = .31 for 
interorganizational). 


How Does the RM Strategy — Mediator — 
Outcome Linkage Vary Across Mediators? 


The preceding research questions focus on the effects of the 
four mediators as a group. In this subsection, we investigate 
the individual linkages to identify when mediators operate 
differently; we begin with the front half of the model: the 
RM strategy — relational mediator linkage (Table 3). The 
effectiveness of RM strategies varies across different rela- 
tional mediators. We consider the differential effects of rela- 
tionship investments and benefits on mediators together 
because they are logically related. Sellers' relationship 


The Effectiveness of Relationship Marketing / 147 


'SISÁTEUB J0]8Jepoui Jejnonred yey] uuopied 1ou pip e^ yeu} ejeorpuj oi usep B esn eA ‘| UEU} sse| S| JO}BIePOW JO [GAR] euo JO} SƏ jo JEqUINU eu ueuA JO 
JEW JOU 8| HONO SIU} UBYAA "I018Jepouu JO |e^e| uoee 18 $126jje oau} 1s89j ye Bujpuy jo Ajijqeqoud uoud B eu; einsue oj aou JO XIS S! Saye ABI jo Jequunu [810] eui ueu Ajuo suosugduloo 
jno Áure am 'Ajeuoggeledo 'sjeApeue eui wo WOM Burddoup pejseBBns s129jje jo 1equinu payu, eu; 'uopippe uj "'suosueduioo eu} Wo sdnou6qns oju| pepoo eq jou pinoa yeu) sejpnys 
peddojup eA (9681 UMOJg) suosueduioo 1891-1 0} pejoe[qns 'siseujuered ui 81984 jo 1equunu [810] eui ui^ *[e^e; JojeJepoul LOBE 10) pe1ejunooue s4286 eane eu MOUS SELIG [eo eu | :S91oN 
'(966| eouoj pue Ue '//61 ueuo?) peyejepouu ÁpugoyiuBis Jou sB Burssjusip ui eunyeureud eq Aew sjeuoreesei yeu} 
sdjusuomele1 (,8, }duosuedns ui) Ajnuep| e^ '%08 0} amod eui eaow 0} peujnbei seipnis jo 1equinu pue 'jewod 'ezis ejo peyedxe eu jo sjseq eu UC "J[guis ÁJGARSJ8GJ 8| 196) OL UIA poep 
-OS8B Jemod eiu 'diusuoge|e! ueAJD e jo 126jje eu ejeJepoui AnugogiuDIs jou 860P ejqeueA JojeJepoui B uou 10} sesso eui jo Ajuo[eu e u| 'Ajsnogneo pejeJd1eju| eq pjnous sinsei jueoyiuDisuoNs 
'(pej[m-euo) Go" > d, 


—————————————————M—M—————— M ——— M a — 


(0€) SG" (Yt) 89° (€) 19 (ev))oy  (v220&* (1209 (9969 (4) 29 Gy uope1edooo + siojeipeui [Iv 
— — — — — — — — 0 uogajedooo + Ápjenb diusuonsjeu 
— — — — — — — — q uopge1edooo «- uomosjsnes djysuonejey 
(FL) v9' (6) OZ (1) 99 (€2)z9° (rL)99° (OL) OZ’ (12) 49 (&) ez ve uonBiedooo + 1sni| 
APL) OF (2) 6z (M wo  (sDos (6) ix (0er 1Dzv (8) ee 9L uopBJedooo e jueuujululo7) 
ally) LE (cc) Or .(el)Se (es)9e (Le) re (61) 9€ (ly) 9€ (21) Ze S9 eougwuoped eajoelqo 1ej[es <— suopipeow |y 
aly) ZV (Z) 6¢° — — — — — — 9 eouBuuoped eAgnoe[qo 1ejes <— Áyenb diusuogsjeH 
eZ) 6L (S) tr (c) ec (9) or — — — — Z eouguuoped eAnoe[qo 1ejjes + uopogjsyes djusuongjeu 
(ZL) Le’ (€L) 8€ (9) 9c (92)se°  (oDOoe& (2) Le (pe) v&€ (v) Za ce eousuuoped eAnoeíqo 1ejes + ISM 
(8L) 9€ (2) 8z Fr) be (bee  (uD2& (z) ze xoDor (9) oz 0c eouguuoped eAnoe[qo sejas + jueunjuuoo 
(6€) 9t (01) 9e (66) 9%  (1D99 .o0por (oO so deger (s oc 0G AyeÁo| Jeuiojsno + sJojBIpeui |y 
(8) v (4) 19° A ep (Z) og — — 29) or (L) vv 6 ÁyeÁo| 1euojsno + Aypenb diysuopejoy 
JU) EF (2) es (2) ce (z) o9 — — — — 6 ÁyeÁo| Jeujojsno <— uoposjsnes djysuopRjey 
(pL) OS (S) 1€ (91D)! 86" (vp) er. «UDev. (1) ee (ZL) Zp’ (s) eg Oc Areoj 1eulojsno & 1snJ| 
(0L) 28 (zæ) co (6) co (c) es (6) es (Dee 8) ev (z) oz cL AyfeAo| Jewoysno <— jueuujulujo) 
(Ze) es (ej) | .(s)iS ) (ee)es  (eD2s — (oove (yz) es’ (SL) 9e 8r Aynujuos jo uoiejoedxe + suoyeipew |y 
— — — — — — — — € Áynuguoo jo uogngjoedxe + Ayrenb diusuonsjeu 
— — — — — — — — G Aynuguoo jo uongjoedxe + uopoejsnes djusuonsjed 
(ZL) Se (p) Og (60 zo | (ce))vs (ese (2) Z9 (si) 9g (9) IF vc Áynujguoo jo uogejoedxe «c 1snJJ. 
(ZL) es (vy) po | .(9 i£ agw (Obes (p vs (8 or (9) ee 9L Áynuguoo jo uojej2edxe <— jueunmuulo2 
[eguopgEzjueB1o jenpjupu| 4eumsuot sseujsng Pæja jeuuByg  jonpoid  eojues SPAI Sdjysuo}jBjoy pe1eJepow 
———————— o —————— 0 ———— ——————— B Oo 
sdiusuojejeu &]oXIBIN 1Joumnsuo2 soDuBuox3 28110 soBDuBu2x3 AUD 
[gBuonezjueBuo SnsJeA sseu[sng Sn8J9A Jouusu? poseg-1onpoJd [20L 
SNSIVA |BNPIAIPU| Sn8J9A e2|AJoS 


€————————————— H-"——— ————ÓÁAXEE-— 
Se9uio2ingo uo $129)j3 ,SJ018|pejJ [BuO[1BJ6H UO SJOJBIAPOW Jo eouengu| 


S 3 18V.L 


te 


148 / Journal of Marketing, October 2006 


investments normally generate customer relationship bene- 
fits, but in some cases, an investment may not be desired or 
generate any actual benefit. Relationship investment has the 
least impact on commitment (r = .34) of all the relational 
mediators (mean of other relational mediators, r = 52), with 
no overlap in the CIs. Thus, sellers can strengthen their 
overall relationships through investments (possibly by gen- 
erating feelings of reciprocity), but the relative impact on 
customer commitment is minimal. Alternatively, customer 
relationship benefits have the greatest impact on customer 
commitment (r — .51), especially compared with customer 
trust (r = .33, no overlap in CI), which suggests that cus- 
tomers value these benefits and want to maintain them. This 
discrepancy may occur because many relationship invest- 
ments do not generate value for the customer and therefore 
do not lead to customer commitment. Although investments 
that do not generate customer value may strengthen rela- 
tionships by generating debts of reciprocity, they will not 
necessarily generate an enduring desire to maintain a valued 
relationship. 

As we might have expected, dependence has a greater 
positive effect on commitment (r = .37) than the other medi- 
ators (mean of other relational mediators, r — .19, no over- 
lap.in CD, which reflects customers' desire to maintain a 
relationship with the seller on which they are dependent. 
The relatively limited effect of dependence on customer 
trust (r = .21) may be due to customers’ concerns that sell- 
ers will take advantage of their dependence. 

Although similarity often is hypothesized to influence 
trust by reducing uncertainty and serving as a cue to facili- 
tate goals, we find that similarity actually has a greater 
impact on commitment (r = .63) than on trust (r = .41). This 
greater impact on commitment might be explained by 
research on stereotype behaviors, which suggests that 
people want to strengthen and maintain relationships with 
"jn-group" members and that similarity is a proxy for cus- 
tomers' perceptions of a seller's fit with their in-group 
(Devine 1995). 

The influence of interaction frequency on trust is much 
greater (r = .30) than that of the three other mediators (mean 
of other relational mediators, r = —.01, no overlap in CT). As 
customers interact more frequently with sellers, they appear 
to gain more information about their partner, which reduces 
their uncertainty about future behaviors and improves trust; 
however, the frequency of their interaction has little effect 
on other relational mediators. 

We now turn our attention to the back half of the model, 
the relational mediator — outcome linkage (Table 4), for 
which we find that relational mediators have, differential 
effects on most of the outcomes studied. Commitment (r = 
.58) has the greatest influence on customer loyalty (mean of 
all other relational mediators, r = .47, no overlap of CD, as 
we might expect from these two similar constructs. 

Moreover, relationship quality has the greatest influence 
on objective performance (r = .63), followed: by trust (r = 
.35), relationship satisfaction (r — .32), and commitment 
(r = .27), and the Cls of relationship quality, trust, and com- 
mitment do not overlap. These findings indicate that RM 
researchers may need to take a multiple mediator or com- 
posite view when they measure customers' relationships to 


capture their impacts on objective performance. Different 
dimensions of a relationship may be synergistic, and supe- 
rior performance may be possible only when the relationship 
is sufficiently strong on all critical aspects. Trust (r = .73) is 
most critical for cooperation compared with the other medi- 
ators (mean of other relational mediators, r = .66, no overlap 
of CD), in support of its role in coordinating actions among 
partners to create value and achieve mutual outcomes. 


Discussion 


We provide evidence that the intervening role of relational 
mediators between RM strategies and exchange outcomes is 
more complex than is currently suggested in the extant 
research, but the fundamental premise that RM and strong 
relationships positively affect performance is well sup- 
ported. Several of our findings offer important implications 
for improving the effectiveness of RM research and practice 
(for a summary of key findings and implications, see Table 
6). 

First, RM strategies/antecedents have a wide range of 
effectiveness in terms of generating strong relationships, 
though specific strategies appear to be most effective for 
strengthening specific aspects of a relationship. Overall, 
expertise and communication are the most effective 
relationship-building strategies across all elements of a rela- 
tionship, whereas the other strategies often have differential 
effects across the different mediators. For example, generat- 
ing relationship benefits, promoting customer dependency, 
and increasing similarity to customers are more effective 
strategies for increasing customer commitment than for 
building trust, whereas relationship investment and interac- 
tion frequency have the opposite effect. Therefore, when 
comparing the relative effectiveness of RM strategies, the 
results depend on the relational mediator under investiga- 
tion. These findings indicate that RM may be improved by 
taking a more fine-grained approach in which managers tar- 
get RM strategies at specific relational weaknesses. 

Second, we find that objective performance is influ- 
enced most by relationship quality (a composite measure of 
relationship strength) and least by commitment, which sup- 
ports a multidimensional perspective of relationships in 
which no single or "best" relational mediator can capture 
the full essence or depth of a customer-seller relationship 
(Hennig-Thurau, Gwinner, and Gremler 2002; Johnson 
1999). Previous research (Berry 1996; Doney and Cannon 
1997; Spekman 1988) that offers either commitment or trust 
as the key, central, or cornerstone relational mediator may 
be focused too narrowly; a relationship may be truly effec- 
tive only when most or all of its key aspects are strong. 
Therefore, research that focuses only on commitment and 
generalizes from its impact on customer intention or inter- 
mediate behaviors to its effect on seller performance may 
prove misleading. For example, commitment has the great- 
est impact on customer loyalty and the smallest impact on 
objective performance. 

Third, the large, direct effects of dependence and rela- 
tionship investment on seller objective performance in the 
causal model suggest that these antecedents influence per- 
formance through alternative, mediated pathways. Although 
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TABLE 6 
Summary of Key Findings and Implications 





Key Findings 


Antecedents 
Relationship marketing strategies/antecedents have a 
wide range of effectiveness for generating strong 
relationshlps. Expertise and communication are most 
effective, then relationship Investment, similarity, and 
relationship benefits; dependence, frequency, and 
duration are relatively ineffective. 


The negative impact of conflict is larger In magnitude 
than the positive effect of any other RM strategy. 


Specific RM strategies appear most effective for 
strengthening one aspect of a relationship. 
Relationship benefits, customer dependency, and 
similarity are more effective for increasing commitment 
than for building trust; the opposite Is true for 
relationship investment and frequency. 


Outcomes 
Relationship quallty (a composite measure of relatlonship 
strength) has the greatest Influence on objective 
performance, and commitment has the least. 


Surprisingly, relationship investment has a large, direct 
effect on seller objective performance, In addition to its 
frequently hypothesized Indirect mediated effect. 


Dependence has a large, direct effect on seller objective 
performance but a relatively small Impact on relational 
mediators. 


Of all outcomes, relationships have the greatest Influence 
on cooperation and WOM and the least on objective 
performance. 


Moderators 
Relationship marketing is typically more effective when 
relationships are more critical to customers, such as for 
(1) service versus product offerings, (2) channel versus 
direct exchanges, and (3) business versus consumer 
markets. 


Customer relationships often have stronger effects on 
exchange outcomes when thelr target is an individual 
person than when it is a selling firm. 


Research and Managerial Implications 


Selection and training of boundary spanners is critical; 
expertise, communication, and similarity to customers are 
some of the most effective relationship-building 
strategies. Expertise’s impact supports Vargo and Lusch’s 
(2004) premise that “skills and knowledge” are the most 
Important seller value-creation attributes. 


All proactive RM efforts may be wasted if customer 
conflict is left unresolved. 


Relationship marketing may be improved through a fine- 
grained approach that targets specific relational 
weaknesses. The relative effectiveness of RM 
strategies depends on the relational mediator 
investigated. 


No single relational mediator captures the full essence or 
depth of a customer-seller relationship; the findings 
support a multidimensional perspective of relationships. 
Extant research focused on a single relational medlator 
may provide misleading guidance. 


The classic mediating model of RM (Morgan and Hunt 
1994) should be adapted to Include alternative mediated 
pathways (e.g., reciprocity). 


Dependence is not an effective relationship-bulldlng 

strategy but can improve performance in other ways, 

possibly by increasing switching costs and barriers to 
exit. 


Relationship marketing efforts may be effectively 
extended across many other nontraditional buyer—seller 
interactions (e.g., Interdepartmental groups) for which 
cooperation Is often critical for success. Word-of-mouth 
behaviors may be the best discriminator of true customer 
loyalty (Reichheld 2003). 


Researchers must take care when extending findings 
across contexts in which relationship Importance may 
vary. Managers might target RM expenditures to 
customer segments with the highest deslre for strong 
relationships to improve returns. 


Hesearchers should differentiate the effects of customer 
relationships with boundary spanners from those with 
firms. Strategies such as team selling, salesperson 
disintermediation, and the use of call centers should be 
evaluated In light of the impact of interpersonal 
relationships. 





dependence is not very effective at building relationships, it 
can improve performance by increasing switching costs and 
barriers to exit, which may make it an effective 
performance-enhancing strategy but not an effective RM 
strategy. However, relationship investment both builds cus- 
tomer relationships and directly improves performance, 
which suggests that the extant relational-mediated frame- 
work is not comprehensive and that additional mediators 
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(e.g., reciprocity) must be investigated to explain the impact 
of RM on performance fully. The importance of capturing 
the direct effect of relationship investment (B = .34) is rein- 
forced by its greater impact on objective performance than 
the effect of the relational mediators (B = .16) in the causal 
model. 

Fourth, the findings that strong relationships appear to 
be more effective for building customer loyalty and improv- 


ing seller performance for (1) service versus product offer- 
ings, (2) channel versus direct exchanges, and (3) business 
versus consumer markets lend support to the premise that 
RM may be a more effective strategy in situations in which 
relationships are more critical. This finding calis into ques- 
tion sellers’ efforts to force RM strategies in, contexts in 
which the customer's relational needs are unclear; it also 
may explain the less-than-desirable results of RM on per- 
formance that have been documented in previous studies 
(e.g., Reinartz and Kumar 2003). Because these situational 
moderators are coarse proxies for customers' relationship 
needs, RM effectiveness likely varies across other factors 
that influence customers' needs for strong relationships. In 
turn, researchers must take care when extending RM 
research to these different contexts. 

Fifth, the results suggest that customer relationships 
have stronger effects on exchange outcomes when their tar- 
get is an individual person than when their target is a selling 
firm. Thus, RM strategies focused on building interpersonal 
relationships between boundary spanners (e.g., dedicated 
salesperson, social entertaining) may be more effective than 
those focused on building customer-firm relationships (e.g., 
team selling, frequency-driven loyalty programs). Social 
psychology's individual and group judgment theory 
(Hamilton and Sherman 1996, p. 336), which posits “differ- 
ences in the outcomes of impressions formed of individual 
and group targets, even when those impressions are based 
on the very same behavioral information,” has several 
implications for the marketing domain and may provide a 
parsimonious explanation for previous marketing research 
(Doney and Cannon 1997; Iacobucci and Ostrom 1996). 
The post hoc finding that conflict has a more negative 
impact (p < .01) on customer-firm relationships than on 
customer-individual relationships is also consistent with 
this theory because judgments about individuals are more 
resilient to disconfirming events than are judgments about 
groups (Hamilton and Sherman 1996). Thus, managers may 
want to use boundary spanners or salespeople rather then 
centralized service centers to resolve conflicts because cus- 
tomers' relationships with salespeople may witbstand con- 
flict better than their relationships with selling firms. 


Managerial Implications 


Most promising for managers is that five of the strategies 
with the greatest impact are either seller focused or dyadic, 
in support of the effectiveness of proactive relationship- 
building strategies undertaken by sellers. Business execu- 
tives focused on building and maintaining strong customer 
relationships should note that the selection and training of 
boundary spanners is critical; expertise, communication, 
and similarity to customers are the most effective 
relationship-building strategies. The next most effective 
strategy is for managers to make relationship investments 
and generate relationship-based benefits for customers; fur- 
thermore, relationship investment has the added benefit of 
influencing performance directly. However, managers must 
recognize that these proactive efforts will be wasted if they 
leave customer conflict unresolved because the negative 
influence of conflict on customer relationships is greater in 
magnitude than that of any other strategy. Thus, some firms 


could generate higher returns by reallocating their RM 
investments to conflict resolution. Extending service recov- 
ery research into the RM domain to develop strategies for 
“relationship recovery" also might be worthwhile. A strat- 
egy of increasing customer dependence does not appear to 
be an effective way to build relationships, but it seems to 
influence seller performance directly. Neither relationship 
duration nor interaction frequency is a good driver of strong 
customer relationships. 

Of all the outcomes we analyze, relationships have the 
greatest influence on cooperation and WOM. The impact on 
cooperation implies that RM efforts may be effectively 
extended across many nontraditional buyer-seller interac- 
tions (e.g., alliances, interdepartmental groups); in these 
situations, cooperation is often critical for success. Simply 
stated, firms that depend on WOM strategies for new cus- 
tomers should implement effective RM programs. 

Some results indicate that a more targeted effort may 
improve RM efficiency. Because RM strategies appear to 
operate through different mediators that affect outcomes 
differentially, a manager who desires cooperation between 
two groups after a merger and who recognizes that trust iS 
the relational mediator with the greatest influence on coop- 
eration should select RM strategies that influence trust best 
(i.e., communication and interaction frequency). Marketers 
with a portfolio of customers, channels, and products could 
improve the return on their RM expenditures by targeting 
their spending toward segments in which RM is more likely 
to pay off, such as customers who purchase more services, 
channel versus direct customers, and business versus con- 
sumer segments. 

Managers may also want to leverage the potentially 
stronger impact on customer loyalty and seller performance 
in relationships that involve an individual boundary span- 
ner. For firms that experience low turnover, focusing their 
RM efforts on building customer-salesperson bonds may be 
a productive strategy, though developing strong relation- 
ships may prove difficult for firms that want to move cus- 
tomers from dedicated salespeople to offshore call centers 
for various reasons. These firms should recognize that a 
lack of seller expertise, dissimilarities between boundary 
spanners and customers, ineffective communication, and 
shifts from interpersonal to person-firm relationships can 
negatively affect customer-seller relationships. 


Limitations 


Meta-analyses have several strengths, but they also contain 
inherent limitations. First, the constructs we include are con- 
strained to variables for which sufficient primary data are 
available. Thus, our framework should be considered a sum- 
mary of the most commonly studied RM constructs, not an 
exhaustive list or even a list of the most important con- 
structs. For example, mutual dependence, seller disclosure, 
and functional conflict have been shown to be important 
constructs for RM, but because of data unavailability, we 
could not include them in our meta-analysis. Second, hetero- 
geneity in effect sizes remained even after we accounted for 
any variability due to the moderator variables in the study, 
which indicates that the effect sizes we report should be con- 
sidered averages and may vary with the inclusion of unmea- 
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sured moderating conditions. Third, because of the limited 
number of studies for some moderator variables, our study 
has limited power to reject null hypotheses. 


Future Research Directions 


After nearly two decades of RM research, marketers’ efforts 
may need to shift from significant testing to identifying 
which, and in what conditions, RM strategies generate the 
highest return on RM investment. Our synthesis of the 
extant literature identifies several avenues that require fur- 
ther study. 

Research should expand the constructs included in our 
RM-mediated framework and determine which aspects or 
dimensions should be included to obtain a multifaceted 
view of relational exchanges. Although commitment and 
trust play critical roles, other candidates might include rela- 
tionship satisfaction, exchange efficiency, equity, relational 
norms, and reciprocity. The absence of any measure of reci- 
procity between exchange partners is especially notable 
because it has been identified as “the core of marketing 
relationships” (Bagozzi 1995, p. 275) and may help explain 
the pattern of effects surrounding the impact of relationship 
investments and benefits on relational mediators. Integrat- 
ing reciprocity into the relational-mediating framework may 
also explain the large, direct effect of relationship invest- 
ment on performance, such that people’s inherent desire to 
repay “debts” generated by sellers’ investments may lead to 
performance-enhancing behaviors, independent of trust or 
commitment. 

In addition to taking a multidimensional perspective of 
relationships, the scope of RM research should expand to 
investigate potential interactions among the relational medi- 
ators and identify relational synergies. For example, the 
strong linkage between relationship quality and objective 
performance may be due to interactions among the different 
facets of a relationship. 

Even some of the high-impact antecedents and impor- 
tant outcomes in our framework appear in relatively few 
primary studies (i.e., conflict, seller expertise, and WOM), 
which suggests the need for additional research. Seller 


expertise, beyond product-specific expertise, might include 
overall customer knowledge, industry expertise, creativity, 
process knowledge, and intraorganizational facilitation. 
Strategies that remedy conflict-laden events, such as service 
or relationship recoveries, also are critical to incorporate 
into both practice and further research. 

The relatively small correlations between customer 
focal antecedents (relationship benefit, dependence on 
seller) and relational mediators are surprising because we 
took the relational mediators from the customer’s perspec- 
tive as well. This finding may be due to a misspecification; 
we may not have studied some critical customer-focused 
antecedents. Thus, researchers should investigate other 
customer-focused antecedents, such as perceived exchange 
efficiency, perceived relationship investments, and liking, to 
identify other key drivers of a strong relationship from the 
customer’s perspective. 

The heterogeneity across nearly all linkages, even after 
we account for the moderators we included, demands 
research to determine other moderators that may influence 
RM effectiveness (e.g., relationship age, customer control, 
customer involvement, relationship orientation of the cus- 
tomer). For example, as the customer’s need for a relation- 
ship increases, RM strategies may become more effective. 
Thus, researchers should develop a measure of the relation- 
ship orientation of the customer to support the segmentation 
of RM efforts. Marketers could then target their RM efforts 
toward customers with the highest susceptibility for RM. 
Contrary to most existing RM research, our results and 
social psychology theory suggest that researchers should 
differentiate between individual-individual and individual- 
firm relationships. 

In summary, we provide insight into the most effective 
RM strategies, the conditions that moderate this effective- 
ness, and how the links between both antecedents and con- 
sequences of relational mediators depend on the mediator 
being investigated. These insights provide managers with 
opportunities to improve the returns on their RM invest- 
ments and researchers with directions to build more robust 
models of the influence of RM on outcomes. 
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Achieving Marketing Objectives 
Through Social Sponsorships 


The corporate social responsibility literature has focused on creating broad associations, such as simply being 
“socially responsible,” and on the simple transfer of positive affect from sponsored causes to sponsoring firms. Both 
views fail to recognize the power of social initiatives as a means for differentiating among socially responsible firms 
and, in particular, for reinforcing the brand's positioning. The authors adopt a more traditional branding perspective 
and show that the fit between a firm's specific associations and a sponsored cause can reinforce or blur the firm's 
positioning, influence liking for the sponsorship, and bolster or undermine the firm's equity. They also show that 
communications decisions can mitigate the negative effects of low fit. Finally, they show that sponsorship effects 


can persist for as long as a year despite day-to-day exposure to other brand communications. 


popular promotional tool. The logic is that con- 

sumers will view a firm more favorably if it supports 
social causes, and surveys suggest that this 1s true.! For 
example, 80% of respondents to a national survey report 
that social sponsorships create a more favorable impression 
of the firm; two-thirds of respondents also report that they 
would be more likely to purchase from a firm that supports 
social causes (Reilly 2000). However, a variety of theory 
and evidence indicate that the impact of social sponsorships 
may be more complex (Hoeffler and Keller 2002; Jagre, 
Watson, and Watson 2001; Lichtenstein, Drumwright, and 
Braig 2004; Pracejus and Olsen 2004; Rifon et al. 2004; cf. 
sen and Bhattacharya 2001) and that when the sponsored 
cause is viewed as not fitting with the firm's image, spon- 
sorships may even be harmful (Speed and Thompson 2000). 
We test this proposition and show that high-fit sponsorships 
(1.e., those for which partners are perceived as congruent on 
some key dimension) can increase brand equity, whereas 
low-fit sponsorships (i.e., those for which partners are per- 
ceived as incongruent) can dilute brand equity. Further- 
more, we show that these effects occur because low fit blurs 
the firm's positioning and decreases the favorability of atti- 
tudes toward the sponsorship. 


Cane support of social causes has emerged as a 


ISponsors can be either firms or brands, but for simplicity, we 
use “firms.” 
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Recognizing that not all firms have a natural high fit 
with a chosen cause, we also show that the negative effects 
of low fit can be reduced (1) through communications deci- 
sions that create perceived fit and (2) by altering the mes- 
sage source. Creating fit—that is, making salient how the 
two organizations are congruent—may be as effective as a 
naturally high-fit cause, whereas communicating through 
the sponsored organization rather than through the firm can 
reduce the perception of low fit and its associated risks. 
Finally, we show that fit effects may persist for as long as a 
year. 


Contribution 


The idea that low fit between a brand and other entities with 
which it is associated (e.g., extensions, alliance partners) 
can be harmful is not a new one (e.g., Aaker 1990; Keller 
and Aaker 1992; John, Loken, and Joiner 1998; Milberg, 
Park, and McCarthy 1997). However, we make several con- 
tributions. First, it might be expected that intrinsically posi- 
tive activities, such as social sponsorships, are less likely to 
suffer from low fit than are purely commercial activities, 
such as brand extensions or alliances. In contrast, we show 
that the benefits of intrinsically favorable actions can be 
mitigated and even reversed by low fit (cf. Hoeffler and 
Keller 2002). Second, the corporate social responsibility lit- 
erature has focused on creating broad associations, such as 
being "socially responsible," and on the simple transfer of 
positive affect from sponsored causes (Brown and Dacin 
1997; Rifon et al. 2004; Smith 2004; cf. Sen and Bhat- 
tacharya 2000). We adopt a more traditional branding per- 
spective and show that the fit between a firm's specific 
associations and a sponsored cause can reinforce or blur the 
firm's positioning. We also show that the unexpectedness of 
low fit leads to increased elaboration on the sponsorship 
and that this elaboration is negatively biased, leading to less 
favorable attitudes toward the sponsorship. Third, we 
extend work on how message variables moderate the effects 
of fit (Bridges, Keller, and Sood 2000), showing that mes- 
sage content and source can each reduce the unfavorable 
effects of low fit. Fourth, we offer the first experimental evi- 
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dence of the enduring effects of sponsorship, showing 
effects that endure for up to a year. Fifth, we add to the lit- 
erature on sponsorship recall. Prior work has shown a bias 
in favor of recalling semantically related pairings (Johar 
and Pham 1999; Pham and Johar 2001). We show that com- 
munications that create fit between semantically unrelated 
organizations can lead to high levels of recall even after a 
one-year delay. 


Theoretical Background 


What Is Fit, and Why Does It Matter? 


In the management literature, the fit construct embodies the 
idea of transferability of expertise or synergies in activities, 
such as when there is similarity in products, technologies, 
or markets (Rumelt 1974) or complementarity of skills and 
activities (Porter 1987). A similar conceptualization under- 
lies Sen and Bhattacharya’s (2001) examination of fit in the 
domain of internal social initiatives. In the brand extension 
and brand alliance literature, the focus is often on con- 
sumers’ perceptions of fit. Although it is assumed that ideas 
of transferability and synergy often underlie perceived fit, it 
is recognized that what is transferred or enhanced may be 
intangible associations (Aaker and Keller 1990; Broniar- 


czyk and Alba 1994; John, Loken, and Joiner 1998; Licht- 
enstein, Drumwright, and Braig 2004; Loken and John 
1993; Simonin and Ruth 1998). We adopt a similar 
consumer-based conceptualization. Fit between a firm and a 
sponsored cause is high when the two are perceived as con- 
gruent (i.e, as going together), whether that congruity is 
derived from mission, products, markets, technologies, 
attributes, brand concepts, or any other key association 
(Bridges, Keller, and Sood 2000; Park, Milberg, and Law- 
son 1991). 

Fundamentally, fit matters because high-fit sponsorships 
are consistent with what would be expected from the firm, 
whereas low-fit sponsorships are not. As we discuss in 
greater detail subsequently, this difference affects the clarity 
of the firm's positioning, and it alters processing of spon- 
sorship information to influence how well the sponsorship 
is liked. In turn, clarity and attitude toward the sponsorship 
influence affective and behavioral responses to the firm, 
two key components of equity (Keller 1993). We outline 
these propositions in Figure 1. 


Clarity of positioning. Consistency in communication 
helps build clarity, or the extent to which people know what 
to expect from the firm (Erdem and Swait 1998; Keller 
1993; Park, Jaworksi, and MacInnis 1986). Because high-fit 


FIGURE 1 
Conceptual Framework and Study 1 Path Estimates (In Boxes) 
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Notes: Path estimates are standardized regression coefficients (b*). All ps < .001. We also examined elaboration as a mediator of the effect of 
fit on attitude toward the sponsorship. We find that the effect of fit on attrtude is partially mediated by elaboration on the sponsorship 
(Dft, eteboraton = -49, p < .001; D&aboraton, attitude = - 19, p < .01, and Ds, atitude = .70, p < .001). The direct effect of fit may occur because 
low fit Is negatively valued; however, this result may simply reflect that our thought measures do not capture intensity in the way that our 
scaled measures do (i.e., thoughts are coded as positive, neutral, or negative). 
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sponsorships are consistent with what people expect from 
the firm, such sponsorships should increase clarity. Because 
low-fit sponsorships are inconsistent with what people 
expect from the firm, such sponsorships should reduce 
clarity. 


Attitude toward the sponsorship. On its surface, a social 
sponsorship is an intrinsically favorable action, one that we 
expect consumers to like. However, the interpretation of 
marketing activities, as is the case with the interpretation of 
any behavior, often involves two stages (Campbell and Kir- 
mani 2000; Gilbert 1989; Johar and Simmons 2000). People 
first make a relatively effortless inference based on a sur- 
face interpretation (e.g., a salesperson’s flattering remark 
reflects his or her true beliefs, a product with an unfamiliar 
brand name is of low quality). Then, and only if they allo- 
cate sufficient processing capacity, people may "correct" 
this inference through a more effortful process that accounts 
for other accessible inputs (e.g., the salesperson has an 
interest in ingratiating him- or herself with the customer, an 
unfamiliar brand is made by a well-respected firm). Simi- 
larly, we expect that a social sponsorship will be interpreted 
according to its surface meaning—that is, as a good act that 
is worthy of liking—unless people engage in further elabo- 
ration, in which case they may consider any other accessi- 
ble inputs to judgment. Low fit plays a dual role in this 
process: (1) It increases cognitive elaboration, and (2) it 
makes countering (negative) inputs accessible. 

Why would these effects occur? Low fit should increase 
elaboration because unexpected events engender thought, 
often in the form of an attributional search to understand the 
reasons for the event (Weiner 1985). Low fit should make 
countering inputs accessible because low fit (i.e., incon- 
gruity, unexpectedness) is negatively valued (Mandler 
1982). Through a process of attitude priming (Fazio and 
Williams 1986; Houston and Fazio 1989), this negative 
affect should engender other negative thoughts, leading to a 
less favorable attitude toward the sponsorship. Similar 
effects of fit on attitudes have been found for brand 
allances and extensions (Aaker and Keller 1990; Boush 
and Loken 1991; Broniarczyk and Alba 1994; Simonin and 
Ruth 1998), but in neither of these domains have increased 
elaboration and specific negative cognitions been 
demonstrated. 


Firm equity. In turn, clarity of positioning and attitude 
toward the sponsorship should influence affective and 
behavioral responses to the firm, two key components of 
equity (Keller 1993). Clarity should have an effect because 
it alters the value of the brand as a signal. When clarity is 
high, buyers who value the brand's benefits perceive less 
risk in choosing the brand and feel less need to gather infor- 
mation to make that choice (Erdem and Swait 1998). Atti- 
tude toward the sponsorship should have an effect because 
it is a new affective association for the firm. Consistent with 
this prediction, responses to brands have been shown to be 
correlated with attitudes toward brand alliances (Simonin 
and Ruth 1998) and extensions (Erdem 1998; Sullivan 
1990). In summary, we expect that the firm's equity 
increases with high-fit sponsorships and decreases with 
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low-fit sponsorships and that clarity of positioning and atti- 
tude toward the sponsorship mediate these effects. 


Hypotheses 


On the basis of the foregoing discussion, we propose the 
following two hypotheses: 


H,: Low-fit (versus high-fit) sponsorships lead to (a) 
decreased clarity of positioning, (b) greater cognitive elab- 
oration (i.e., more thoughts), (c) less favorable thoughts, 
(d) a less favorable attitude toward the sponsorship, and 
(e) reduced equity, as measured by affective and behav- 
ioral responses to the firm. 

H,: Clarity of positioning and attitude toward the sponsorship 
mediate the effect of fit on equity. 


Study 1: Natural Fit 


We tested these hypotheses in an experiment in which we 
asked respondents to read a press clipping that contained 
four business news items. One item reported on the opening 
of an expanded online store by one of two firms and either 
did or did not mention the firm's sponsorship of a nonprofit 
organization. When a sponsorship was mentioned, the spon- 
sored organization was either high or low in fit with the 
company's image, as revealed by pretests. After respondents 
read these items, they responded to our dependent 
measures. 

Because we are interested in the ability of sponsorships 
both to dilute and to enhance equity, we focus on familiar, 
well-liked firms. Furthermore, we focus on the effects of 
“natural fit’—that is, the extent to which the sponsored 
cause is perceived as being congruent with the image of the 
sponsor, independent of efforts to create a perceived fit 
between the organizations. For example, Alpo and the 
Humane Society are high in natural fit because both are 
strongly associated with pets; this association readily comes 
to mind. In contrast, Alpo and the Special Olympics are low 
in natural fit because they share no highly accessible associ- 
ation. Alpo is associated with pet food, and the Special 
Olympics is associated with athletic competitions for the 
mentally disabled. 

Natural fit is of interest for several reasons. First, firms 
may engage in low-fit sponsorships because of a sincere 
interest in the cause or a belief that fit is irrelevant. Second, 
cost efficiencies are greater if there is no need to spend on 
efforts to create fit. Finally, because marketers do not con- 
trol all the contexts in which consumers encounter informa- 
tion about their activities, a sponsorship that does not 
depend on such control for its effectiveness is highly 
attractive. 


Method 


Pretests. We attempted to identify a pair of equally 
well-liked, familiar companies (C4, and C5) that we could 
pair with a set of equally well-liked, familiar nonprofits (N, 
and N2), such that for Cj, N; would be a high fit and N5 
would be a low fit, whereas for Co, N; would be a low fit 
and N, would be a high fit. Thus, our fit manipulation 
would not be confounded with any differences between 
nonprofits. 


Alpo and Sports Authority paired with the Humane 
Society and the Special Olympics met our criteria. The two 
firms were equally well liked (Ms = 6.10 and 6.17; Fi 27 < 
1) and familiar (Ms = 6.50 and 6.57; Fi 27'< 1); we 
measured both liking and familiarity with three (seven- 
point) semantic differential items: “negative/positive,” 
“unfavorable/favorable,” and “bad/good” (Cronbach's a = 
.97), and “unfamiliar/familiar,” “did not recognize/ 
recognized,” and “had not heard of/had heard of’ (Cron- 
bach's & = .96). Furthermore, the two nonprofits were also 
equally well liked (Ms = 6.25 and 6.36; F, 2; = 1.63, p> 
-21) and familiar (Ms = 6.13 and 6.15; F; 2; < 1). Fit, which 
we measured as  "dissimilar/similar, ‘“inconsistent/ 
consistent,” “atypical/typical,” “unrepresentative/representative,” 
“not complementary/complementary,” “low fit/high fit,” and 
"does not make sense/makes: sense” (Cronbach's « = .99), 
varied as we expected (Ms = 6.45 and 1.80; Fy 25 = 500.09, 
p < .0001), and this effect did not differ by firm (F < 1). 
Furthermore, fit did not differ between firms for high-fit 
nonprofits (Ms = 6.65 and 6.25; F; 25 = 1.57, p > .22) or for 
low-fit nonprofits (Ms = 1.84 and 1.75; F, 95 < 1). 

Respondents and design. Two hundred thirty-six stu- 
dents in nonmarketing professional seminars participated in 
the study without compensation. Their average age was 35, 
and 52% were male. The design is a 2 (fit: high versus 
low) X 2 (sponsorship: present versus absent) x 2 (com- 
pany: Alpo versus Sports Authority) x 2 (measurement 
order: sponsorship questions before versus after equity 
measures) completely randomized between-subjects design. 
This design provides different controls for the low-fit and 
high-fit conditions (sponsorship: present versus absent is 
crossed with fit) to ensure that we isolate the effects of 
sponsorship per se. Particularly in the high-fit condition, 
simply thinking about the nonprofit and the firm at the same 
time may lead to more favorable thoughts about the firm, 
for example, being exposed to the Humane Society might 
make a person think about how much he or she likes dogs, 
which may make this person think more favorably about a 
firm such as Alpo, which produces dog food. Our control 
conditions isolate this effect from the effect of sponsorship 
because they expose respondents to both organizations in 
the absence of a sponsorship relationship. The measurement 
order factor allows us to determine whether effects on firm 
equity occur spontaneously or whether they occur only if 
consumers are directed to think about and evaluate the 
sponsorship. : 

Materials. The materials we used appear in Appendix A. 
Each clipping contains four news items. The first two items 
are constant across conditions. The third item varies across 
conditions and describes the promotion of an employee of 
either the Humane Society or the Special Olympics. The 
last item also varies across conditions and describes a new 
Web site for either Alpo or Sports Authority. In the sponsor- 
ship condition, it also announces the sponsorship program. 
There is a no-sponsorship control for every pairing of a 
company with a nonprofit. The only difference between no- 
sponsorship and sponsorship conditions is the presence of 
the sponsorship. 

Measures. We measured fit as we did in the pretest. We 
measured elaboration by asking respondents to list all the 


thoughts they had while thinking about the sponsorship. 
Two coders, who were blind to the conditions, categorized 
thoughts as thoughts about fit (e.g., "The organizations 
seem to go together," "Pet food and the Special Olympics 
have nothing to do with each other"), the sponsorship (e.g., 
"This is a great thing," "It is actually a little offensive"), 
efficacy for the firm ("They may open more of a market for 
themselves,” “I feel that Alpo will not sell more food"), the 
firm's motivation (e.g., "Seems as though Alpo is trying to 
help," "What a cry for publicity"), and other thoughts (e.g., 
“My dog eats Alpo,” “T like pets"). Fit thoughts were further 
classified as high, uncertain, or low fit. Inferences about the 
firm’s motivation were classified as social, uncertain, or 
self-interested motivation. All other thoughts were classi- 
fied as favorable, neutral, or unfavorable. Agreement 
between coders was 81%, and there were no significant dif- 
ferences in their coding for any thought category (ps > .45). 
Discrepancies were resolved by discussion. 

We measured clarity of positioning by rated agreement 
(seven-point scales) with the items "clearly communicates 
what it stands for," "has an image that is difficult to under- 
stand," and "conveys a clear image in all its actions" (Cron- 
bach's a = .94). We measured attitude toward the sponsor- 
ship with the same attitudinal items used in the pretest for 
companies and nonprofits. We measured firm equity with 
these same three attitudinal measures and three behavioral 
intention measures, the rated likelihood to “consider,” “pur- 
chase,” and “recommend” (Cronbach's œ = .96).2 


Procedure. We told respondents that we were interested 
in their responses to news stories. They read the clipping 
and then responded to our dependent measures. We pre- 
sented questions about the sponsorship and the firm in 
counterbalanced order, followed by all other measures. We 
then debriefed respondents and told them that the articles 
were fictitious. 


Results 


The results conform to our predictions. When fit is low ver- 
sus high, clarity of positioning is lower, more thoughts are 
generated in response to the sponsorship, these thoughts are 
less favorable, attitude toward the sponsorship is less favor- 
able, and firm equity (operationalized as affective and 
behavioral responses to the firm) is lower. Furthermore, 
clarity and attitude toward the sponsorship mediate the 
effect of fit on firm equity. Except where we subsequently 
note, none of these effects vary as a function of measure- 
ment order (ps > .11) or company (ps > .30). We present 
means in Table 1. 


Manipulation check. Perceived fit varies as we expected 
(Ms = 1.59 and 6.45; Fir. 152 = 3574.97, p< 0001), and it 
does not vary by firm (F, 152 = 1.83, p > .17). Furthermore, 
perceived fit does not vary by exemplar in the high-fit con- 
dition (Ms = 6.48 and 6.42; F; 76 < 1) or in the low-fit con- 
dition (Ms = 1.51 and 1.67; F; 76 = 1.61, p > .20). 


2Because not everyone buys pet supplies or sports-related prod- 
ucts, we asked these questions as follows: "If you or a friend had 
the need to buy the kind of products sold by (Alpo, Sports Author- 
ity), how likely would you be to...?" 
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Clarity of positioning. As H,, predicted, when fit is low 
rather than high, respondents perceive the firm’s positioning 
as less clear (Ms = 2.86 and 6.17; F;, 151 = 744.70, p< 
.0001). Furthermore, control group comparisons show that 
clarity is lower with a low-fit sponsorship than with no 
sponsorship (Ms = 2.86 and 5.15; F, 113 = 205.75, p < 
.0001) and is higher with a high-fit sponsorship than with 
no sponsorship (Ms = 6.17 and 5.09; F, 114 = 91.98, p < 
0001). 


Number and favorability of thoughts. As H,, predicted, 
more thoughts are generated when fit is Jow than when it is 
high (Ms = 3.53 and 2.07; Fy 147 = 34.88, p < .0001). 
Because low fit is unexpected, we expect that low fit is 
noted more often than high fit, but the overall increase in 
thoughts is not attributable only to thoughts about low fit. 
When we omit fit thoughts, the total number of thoughts 
remains higher with low fit (Ms = 2.64 and 1.82; Fy 147 = 
12.84, p « .0005). An examination of individual thought 
categories shows that though low fit leads to more thoughts 
about fit (Ms = .89 and 29i FE 147 = 32.76, P « .0001), it 
also leads to more thoughts about the sponsorship (Ms = 
1.10 and .79; Fy 147 = 4.60, p < .03). Thus, the increased 
elaboration associated with low fit is focused not solely on 
fit but on other aspects of the sponsorship as well. No other 
effects on the number of thoughts are significant. 

As H,, predicted, thoughts are less favorable overall 
when fit is low than when it is high (Ms = —.16 and 1.12; 
F; 147 = 39.65, p < .0001).3 Furthermore, this difference is 
not attributable solely to thoughts about fit. Thoughts 
remain less favorable with low fit when we omit fit thoughts 
(Ms = .41 and .93; F1, 147 = 7.54, p < .006). Consistent with 
the idea of attitude priming, individual thought categories 
reveal relatively little else systematic other than the overall 
pattern of less favorable thoughts with low fit. Low fit leads 
to less favorable thoughts about fit (1.e., more thoughts 
about low fit; Ms =~.57 and .19; Fy 147 = 59.08, p < .0001), 
the sponsorship (Ms = .11 and .48; F, 147 = 9.11, p < .003), 
and the efficacy of the sponsorship for the firm (Ms = —.14 
and .21; Fi, 147 = 20.93, p < .0001). This last difference is 
marginally greater for Alpo (F; 147 = 2.69, p < .10) but is 
significant for both companies (ps < .02). The effect on 
sponsorship thoughts varies by company (F; 147 = 3.77, p < 
.05) and holds only for Sports Authority (p « .001) but not 
for Alpo (p » .43). Thoughts about motivation vary by com- 
pany (Fi 147 = 10.08, p < .001), becoming less favorable 
(i.e., more about self-interest) with low fit for Alpo (Fy 74 = 
5.86, p < .01) but more favorable (i.e., more about social 
motivation) with low fit for Sports Authority (F 73 = 4.59, 
p < .03). : 


Attitude toward the sponsorship. As Hyg predicted, 
when fit is low rather than high, attitude toward the spon- 
sorship is less favorable (Ms = 4.17 and 6.52; F} 148 = 
195.06, p < .0001). The effect is marginally greater when 
the attitude measure is taken before the firm equity measure 
(Fi, 14g = 2.72, p < .10), but it is significant in either case 
(ps < .0001). 


3We coded thoughts about low fit and self-interested motivation 
as unfavorable thoughts. 


Firm equity. As Hie predicted, when fit is low rather 
than high, the firm's equity is lower (Ms = 4.93 and 6.50; 
F 148 = 260.21, p < .0001). Furthermore, control-group 
comparisons show that firm equity is lower with a low-fit 
sponsorship than with no sponsorship (Ms = 4.93 and 5.60; 
Fi 114 = 15.12, p < .0002) and is higher with a high-fit 
sponsorship than with no sponsorship (Ms = 6.50 and 5.62; 
Fi 114 = 72.99, p < .0001). The effect of a high-fit sponsor- 
ship on firm equity is marginally greater for Alpo (Fy, 114 = 
2.80, p < .09), but it is significant for both firms (ps < 
0001). 


Mediation. H, predicted that the effect of fit on equity is 
mediated by clarity of positioning and attitude toward the 
sponsorship. Our measurement model includes four latent 
variables: fit, clarity, attitude toward the sponsorship, and 
firm equity. Fit is represented by three composite measures, 
which we constructed by randomly dividing our seven items 
into one three-item parcel and two two-item parcels (Kish- 
ton and Widamon 1994; Marsh et al. 1998). Clarity and atti- 
tude toward the sponsorship are represented by our three 
single-item measures in each case. Firm equity is repre- 
sented by two composite measures, our three-item measure 
of attitude toward the firm and our three-item, measure of 
behavioral intention toward the firm. 

The measurement model provides a strong fit to our 
data. Although the chi-square is significant (65.79, d.f. = 
38, p < .003), the comparative fit index (.99; Bentler 1989) 
and the nonnormed fit index (.98; Bentler and Bonett 1980) 
are both acceptably high. Composite reliabilities for the 
latent factors all exceed .91, indicating high internal consis- 
tency (Hatcher 1994, adapted from the work of Fornell and 
Larcker 1981). Variance extracted estimates for the latent 
factors all exceed .84, indicating a high amount of variance 
captured by the factors, compared with error variance (For- 
nell and Larcker 1981). Furthermore, all factor loadings are 
high (>.87) and highly significant (p <-.001), indicating 
convergent validity. Although the factors are highly corre- 
lated, as our model predicts, discriminant validity is also 
good.4 All factors exhibit discriminant validity, as a chi- 
square difference test (ps < .001) and a confidence interval 
test (ps < .001; Anderson and Gerbing 1988) indicate. A 
variance extracted test shows discriminant validity for all 
pairs of variables except fit and clarity, for which the 
squared correlation between factors (.89) slightly exceeds 
the variance extracted for clarity (.86). Thus, only 1 of 18 
tests fails to show good discrimination; this is about the rate 
that would be expected to occur by chance at the standard 
alpha of .05. Furthermore, the one exception is for the 
method (variance extracted) that does not employ a signifi- 
cance test. For the other two methods, we employ the highly 
stringent alpha of .001. 


4Factor correlations: 
Attitude Toward 
Clarity the Sponsorship Equity 
Fit .94 78 83 
Clarity — 74 .84 
Attitude toward the 
sponsorship — — 19 
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Our theoretical model, which we illustrate with the 
ovals and paths in Figure 1, also exhibits strong fit. 
Although the model chi-square is significant (66.75, d.f. = 
40, p < .005), both the comparative fit index (.99) and the 
nonnormed fit index (.98) indicate acceptably high fit. Each 
standardized path coefficient (b*) is nontrivial (>.36) and 
significant (ps < .001). Furthermore, the R-square for all 
latent endogenous variables is sizable (.89, .61, and .77 for 
clarity, attitude toward the sponsorship, and firm equity, 
respectively). Finally, clarity and attitude toward the spon- 
sorship completely mediate the effect of fit. When a direct 
path from fit is added to the model, this path is not signifi- 
cant (b* = .16, p > .332), and model fit is not improved 
(y2 = .93, p > .334).5 

When we measure all variables with scaled measures 
collected at the same point in time, common methods vari- 
ance is a potential problem (Doty and Glick 1998). 
Although we cannot completely rule out common methods 
variance, we use methods to limit such bias. First, we vary 
measurement order, so that respondents do not always eval- 
uate firm-related measures before sponsorship-related mea- 
sures, or vice versa. Second, we use thought data to validate 
our process model, as described previously. Finally, in a 
regression-based mediation analysis (Kenny, Kashy, and 
Bolger 1998), we employ a nonscaled measure of fit (.e., 
our fit manipulation), and again, we find that the effect of fit 
on equity is mediated by clarity of positioning (z = 2.90, p < 
.003) and attitude toward the sponsorship (z = 4.58, p < 
.001). Because this analysis employs our fit manipulation, it 
also confirms fit as a causal variable within the context of 
the mediation analysis. 


Discussion 


A social sponsorship can be an effective way to enhance the 
firm’s equity; however, a poor choice of a cause can elimi- 
nate and even reverse this effect, creating actual harm, as is 
shown by our comparisons with a no-sponsorship control 
group. Low fit decreases clarity, leads to negative elabora- 
tion on the sponsorship and a less favorable attitude toward 
the sponsorship, and dilutes the firm's equity, as measured 
by affective and behavioral responses. These results have 
generality across exemplars and measurement contexts. The 
same firm benefits or is harmed, depending on which of two 
equally well-liked and familiar nonprofits (i.e., high fit or 
low fit) it sponsors. The same nonprofit organization has 
either a favorable or an unfavorable effect, depending on 
which of two equally well-liked and familiar firms (i.e., 
high fit or low fit) is its sponsor. The effects of fit on 
thoughts, attitude toward the sponsorship, and firm equity 
occur regardless of whether equity or sponsorship measures 
are taken first; thus, these effects do not depend on thoughts 
engendered by previous measures (Feldman and Lynch 
1988). Furthermore, our thought data validate our process 
interpretation. Even without the prompting provided by 
scaled measures, people think about whether a sponsorship 
is consistent with the firm’s image, they have more favor- 


5We also tested a model in which the effect of fit on attitude 
toward the sponsorship is mediated by clarity. It yielded signifi- 
cantly poorer fit than our theoretical model: X? = 67.73, p < .001. 
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able thoughts when they perceive a sponsorship as high fit, 
and these effects vary with our fit manipulation. 

Our mediation analysis confirms that fit affects firm 
equity through clarity and attitude toward the sponsorship. 
This result suggests that social sponsorships should work 
through both routes. A firm may gain a positive attitude 
toward a low-fit sponsorship by adding attractive elements 
(e.g., a celebrity spokesperson) but still blur the brand 
image and dilute equity if fit is low. Similarly, a firm may 
reinforce the brand image with a high-fit sponsorship but 
still dilute brand equity if the sponsorship is disliked for 
other reasons (e.g., heavy-handed promotional efforts). 

Finally, our data offer insights into a long-standing idea 
about fit. In the domains of brand extensions and corporate 
social responsibility, it has been argued that low fit engen- 
ders attributions that the firm is motivated by a desire to 
gain market value rather than by a desire to deliver value to 
consumers (Aaker 1990; Keller and Aaker 1992; cf. Brown 
and Dacin 1997). Rifon and colleagues (2004) find support 
for this prediction in the context of a sponsorship of a health 
information Web site. In contrast, we do not obtain this 
effect. Our thought listings (i.e., undirected responses to the 
sponsorships) show no consistent effect of fit on attributed 
motivations. Notably, we also took a subsequent scaled 
measure of motivation and obtained the predicted effect, but 
this measure did not exhibit discriminant validity. Thus, our 
data suggest that scaled measures of motivation, or any spe- 
cific belief related to the sponsorship, should be interpreted 
with caution. Although thought listings are imperfect as 
well, scaled measures prompt judgments about constructs 
(e.g., motivation) that might not have otherwise been salient 
for many respondents. When this occurs, respondents may 
construct a judgment (e.g., based on attitude priming) that 
they might not otherwise make (Feldman and Lynch 1988; 
Simmons, Bickart, and Lynch 1993). Thus, the use of alter- 
native measures, such as undirected thoughts or true behav- 
ioral measures, to validate that proposed responses occur 
spontaneously can greatly improve our understanding of 
how people respond to social sponsorships. 


Study 2: Created Fit, Message 
Source, and the Persistence of 
Effects 


Many firms may not have a high natural fit with many 
causes or may have an ongoing relationship with a low-fit 
cause, particularly with the current interest in corporate 
social responsibility and the common practice of sponsoring 
a standard set of causes (e.g., education, housing, health 
care). We now examine strategies that may help counter the 
negative effects of low fit. Specifically, we consider how 
message content can be used to create perceived fit and how 
message source may reduce the risks associated with low 
fit. We also examine how well the effects of the sponsorship 
persist over a one-year period. 


Created Fit 


Bridges, Keller, and Sood (2000) describe a relational com- 
munication strategy for brand extensions that is intended to 
increase perceived fit. When salient parent-brand associa- 


tions are not relevant to the extension category but non- 
salient associations are, the relational strategy involves 
emphasizing the relevant nonsalient associations. In the 
context of social sponsorships, there are at least two ways 
that a nonsalient shared association can be made salient and 
thus create perceived fit: (1) with product-related donations 
that evoke a shared association and (2) with messages that 
explain how the firm fits with the cause. As an example of 
using product-related donations to create fit, the Ford Motor 
Company (2002) recently helped make the vintage Red Bus 
Fleet of Glacier National Park safe for continued'use. With- 
out this donation, there is no apparent link between Ford 
and national parks; with it, the idea that parks require trans- 
portation services becomes salient. As an example of using 
a message to create fit, Den Tek Oral Care (2006), a sponsor 
of the American Diabetes Association, includes in its spon- 
sorship communications the information that diabetes can 
lead to tooth decay, bad breath, dry mouth, and gum dis- 
ease. Because many people may not know about diabetes- 
related dental problems, the sponsorship might otherwise 
seem to be a bad fit. 

To maximize the probability of successfully creating 
perceived fit, we use both strategies to create fit for one of 
our low-fit sponsorships from Study 1, namely, Alpo's 
sponsorship of the Special Olympics. Specifically, the spon- 
sorship announcement includes the information that the 
Special Olympics participants will receive pet-related gifts, 
and it also explains that caring for pets increases the self- 
esteem of mentally disabled children. We expect that if we 
are successful in creating fit, we will observe benefits simi- 
lar to those of natural fit. 


Message Source 


People are less critical of messages from sources they 
regard as unbiased or disinterested (Artz and Tybout 1999; 
Birnbaum and Stegner 1979; Weiner, LaForge, and Goolsby 
1990). We expect that people will perceive the firm as more 
self-interested than the sponsored nonprofit in communicat- 
ing its sponsorship activity. Study 1’s thought listings show 
that people are aware that sponsorships help achieve mar- 
keting goals; many respondents commented on the efficacy 
of the sponsorship for increasing sales, gaining publicity, 
and so forth. Although the sponsorship benefits the non- 
profit as well, that organization apparently has less to gain 
from communicating that the firm has given it support. 
Therefore, we expect that sponsorships are eváluated less 
critically when the message source is the sponsored non- 
profit than when it is the sponsoring firm. 

Because the issue of source bias has received relatively 
little systematic investigation, we consider ,alternative 
hypotheses about the form of this effect. Birnbaum and 
Stegner (1979) examine source bias in a study in which 
respondents estimated the value of a used car on the basis of 
its blue book value and the source's estimate. When they 
perceived the source as self-interested, their estimates were 
lower than when they perceived the source as disinterested. 
That is, source bias had a main effect on judgments. Thus, 
we might expect that a nonprofit message source will have a 
main effect on the favorability of responses to the sponsor- 
ship (i.e., a positive effect regardless of fit). 


In contrast, Artz and Tybout (1999; see also Wiener, 
LaForge, and Goolsby 1990) examine source bias as a mod- 
erator of responses to message content. People expect 
experts to speak in quantitative terms, whereas they expect 
nonexperts to speak in more qualitative terms. Messages 
that violate these expectations are less effective in creating 
positive attitudes but only if people perceive the source as 
self-interested. If people perceive the source as disinter- 
ested, they do not "penalize" the message when it violates 
their expectations. Thus, source bias moderates the effect of 
violated expectations on judgments. This finding is particu- 
Jarly relevant to our situation because low-fit sponsorships 
violate expectations, whereas high-fit sponsorships do not. 
Thus, we might expect a nonprofit source to increase the 
favorability of responses only when fit is low. 


Persistence of Sponsorship Effects 


Whether sponsorship effects endure is of crucial interest to 
sponsoring firms. We examine effects on firm equity and 
sponsorship recall after a one-year delay, during which 
respondents were exposed to brand messages only as they 
occurred routinely in their daily lives. Although our exami- 
nation of the persistence of effects on firm equity is largely 
exploratory, we expect that sponsorship recall is greater 
with created fit than with low fit. Prior research has shown 
that recall of sponsorship partners is not only based on pure 
recollection but also incorporates constructive processes 
(Johar and Pham 1999). When attempting to recall a spon- 
sorship, people use heuristics such as market prominence or 
semantic relatedness to help them identify sponsorship part- 
ners. À result of this partially constructive process is that 
people recall high-natural-fit sponsorships better than low- 
fit sponsorships. Because created fit involves creating a 
semantic link between otherwise unrelated organizations, 
we expect that people will recall created-fit sponsorships 
better than the same low-fit pairing. Both low fit and cre- 
ated fit begin with low-natural-fit pairings and thus engen- 
der elaboration, but the created-fit sponsorship gains an 
advantage from the creation of this associative link. Study 
1’s data suggest that this is not a common outcome of the 
elaboration for low-fit sponsorships. Only 6% of Study 1’s 
low-fit respondents reported finding such a link. 


Hypotheses 


On the basis of the foregoing discussion, we propose the 
following three hypotheses: 


Ha: Low-fit (versus created-fit) sponsorships lead to (a) 
decreased clarity of positioning, (b) a less favorable atti- 
tude toward the sponsorship, and (c) reduced equity, as 
measured by affective and behavioral responses to the 
firm. 


H,: A nonprofit message source (versus a company message 
source) leads to a more favorable response. This effect 
takes one of two forms: (a) greater clarity, a more favor- 
able attitude toward the sponsorship, and higher firm 
equity with a nonprofit message source, regardless of fit, 
or (b) greater clarity, a more favorable attitude toward the 
sponsorship, and higher firm equity, but only when fit 1s 
low. 

Hs: A created-fit sponsorship is better recalled than the same 
low-fit pairing 
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Method 


Materials (see Appendix B) and procedures are similar to 
those of Study 1. The design is a 2 (fit: low versus 
created) x 2 (message source: company versus nonprofit) 
factorial with a separate-sample, no-sponsorship control. 
Because Study 1 showed high generality of effects across 
exemplars and no substantive effects of measurement order 
for the effects of interest, we examine effects for only one 
company and always take sponsorship measures before 
company measures. We selected Alpo as the company and 
the Special Olympics as the nonprofit. The low-fit condition 
is identical to the low-fit condition in Study 1. In the 
created-fit condition, the sponsorship includes a gift of a pet 
and pet food to the Special Olympics participants. The gift 
of a pet is intended to evoke a connection of Alpo to pets 
and children; its products nourish the pets that are part of 
many happy childhoods. The announcement also offers an 
explanation of how the organizations fit together, noting 
that caring for pets increases the self-esteem of the mentally 
disabled. We collected all measures, except sponsorship 
recall, just after exposure to the sponsorship. We measured 
firm equity again after a one-year delay; we collected recall 
at the same time. One hundred twenty students from non- 
marketing professional training seminars participated in the 
study without compensation. Their mean age was 36, and 
49% were male. 


Results 


In general, the results conform to our predictions. As we 
expected, created fit offers benefits similar to those of natu- 
ral fit. Furthermore, a nonprofit message source (versus a 
company source) can increase clarity, the favorability of 
attitude toward the sponsorship, and firm equity. Evidence 
from our thought measures suggests that these effects are 
more reliable when fit is low. In addition, recall of the spon- 
sored cause is greater with created fit than with low fit. We 
made no prediction about the persistence of effects, but we 
find that the effect of fit on firm equity persists over a one- 
year period. 

Manipulation checks. As we expected, perceived fit is 
greater when fit 1s created than when it is low (Ms = 4.81 
and 2.17; Fy 76 = 125.98, p < .0001); this effect is not mod- 


erated by message source (p > .20). Perceived fit is also 
greater with a nonprofit than with a company as the source 
(Ms = 4.09 and 2.89; Fy 76 = 26.06, p < .0001). Although 
we did not predict this effect, it is consistent with the idea 
that low-fit sponsorships are not judged as harshly with a 
nonprofit message source. 


Clarity, attitude toward the sponsorship, and firm 
equity. We present means in Table 2. As H, predicted, clar- 
ity 1s lower, attitude toward the sponsorship is less favor- 
able, and firm equity is lower with low fit than with created 
fit (Ms = 3.72 and 5.96; F; 76 = 119.69, p < .0001; Ms = 
3.87 and 6.15; F; 76 = 79.19, p < .0001; and Ms = 4.77 and 
6.25; Fi, 76 = 67.87, p < .0001). As Hy, predicted, these 
judgments are more favorable with a nonprofit message 
source than with a company source (Ms = 5.43 and 4.25; 
F;, 76 = 32.88, p < .0001; Ms = 5.62 and 4.40; F; 76 = 22.96, 
p < .0001; and Ms = 5.74 and 5.27; Fy 76 = 6.79, p < .01). 
For clarity, there is also a fit x source interaction (F; 76 = 
6.15, p < .01), but it is not of the exact form that H3, pre- 
dicted, namely, that the simple effect of source will be sig- 
nificant only for low fit. Instead, the effect is larger for low 
ht, but it is significant in any case (F, 3g = 27.32, p < .001 
and F; 33 = 6.92, p < .01). No other fit x source interactions 
are significant. 

We made no predictions with regard to control-group 
comparisons; however, clarity is greater with created fit 
than with no sponsorship (for company source: Ms = 5.63 
and 5.19; Fy sg = 4.66, p < .03; for nonprofit source: Ms = 
6.30 and 5.19; Fi, 5g = 79.48, p < .0001) and is lower with 
low fit than with no sponsorship (for company source: Ms = 
2.88 and 5.19; Fi, sg = 276.09, p < .0001; for nonprofit 
source: Ms = 4.56 and 5.19; Fi, 5g = 6.80, p < .01). Firm 
equity exhibits a similar pattern (created fit: for company 
source: Ms = 6.00 and 5.45; F, 5g = 5.13, p < .02; for non- 
profit source: Ms = 6.49 and 5.45; F; 5g = 27.68, p < .0001; 
low fit: for company source: Ms = 4.55 and 5.45; F}, 5g = 
15.07, p < .0003; for nonprofit source: Ms = 5.00 and 5.45; 
F; 5g = 3.97, p < .05). Note that though a nonprofit source 
increases the favorability of these judgments, it never com- 
pletely overcomes the negative effects of low fit. 


Thought listings. Again, we use respondents' undirected 
thoughts to validate our process explanation. The effects of 


TABLE 2 


Study 2 Means 
Os Iml o 


Low Fit 
Company NPO 


Company NPO 


Created Fit 


Fit Means Source Means 


Control Source Source Source Source Low Created Company NPO 
Scaled Measures 
Clanty of positioning 5.191 2.8882 4,56b2 5.6382 6.30b2 3.728 5.965 4.258 5.435 
Attitude toward the 
sponsorship — 3.15 4.60 5.65 6.65 3.878 6.155 4.408 5.62b 
Firm equity 5.451 4.552 5.002 6.002 6.492 4.778 6.25b 5.278 5.74b 


ee eee ee 
Notes: We indicate significant main effects of fit and source by pairs of means with different alphabetical superscnpts under the “Fit Means" and 
"Source Means” headings, respectively. When there is a fit x source interaction, we indicate any significant simple effects by palrs of 
means with different alphabetical superscripts under the headings "Low fit” and "Created fit" All simple-effect means that differ signifi- 
cantly from the no-sponsorship control mean have a different numencal superscript from the control mean. All ps < .05. NPO = nonprofit 


organization. 
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created fit substantively replicate the effects of natural fit 
from Study 1; however, we do not report these effects in 
greater detail. With regard to the effects of message source, 
thought listings are consistent with the idea that a nonprofit 
message source makes respondents less critical of low fit. 
Fit thoughts exhibit a fit x source interaction (F, 75 = 3.76, 
p < .05); only when fit is low, a nonprofit source leads to 
marginally less unfavorable thoughts about fit than does a 
company source (Ms = —.10 and —42; F, 4; = 2.99, p < 
.09). For both number and favorability of thoughts about the 
sponsorship, there is a fit X source interaction (F1, 75 = 4.66, 
p < .03 and F}, 75 = 3.73, p < .05); only when fit is low, a 
nonprofit source leads to more thoughts and more favorable 
thoughts about the sponsorship (number: Ms = .42 and 
1.20; F) 37 = 6.47, p < .01; favorability: Ms = .05 and .60; 
Fi, 37 = 7.86, p < .008); again, only when fit is low (fit x 
source: Fy 75 = 4.34, p < .04), there is a decrease in the 
favorability of other thoughts with a nonprofit source (Ms = 
.52 and .05; Fy 37 = 8.44, p < .006). Although this proposi- 
tion is highly speculative, it may be that the negative atti- 


tude priming associated with low fit becomes diverted from - 


the sponsorship when a nonprofit source engenders less 
critical processing. There are no other effects on number or 
favorability of thoughts. 


Separate-sample tests. We conducted additional tests to 
increase our understanding of the created-fit and message 
source variables. Because the gift in the created-fit condi- 
tion increases the economic value of the sponsorship, we 
conducted a separate-sample test to determine whether the 
results for created fit are attributable to the gift. We find that 
the gift has little or no effect on perceived fit, attitude 
toward the sponsorship, and firm equity, whereas the fit 
explanation (i.e., caring for pets increases self-esteem) 
accounts for most of the variance in these measures. For 
clarity, the gift has an effect only when the fit explanation is 
given. Thus, the effects of created fit are not attributable to 
increased value but rather to fit per se. The design is a 2 (fit 
explanation: present versus absent) x 2 (gift: present versus 
absent) factorial; we randomly assigned 76 respondents to 
conditions. Both the fit explanation and the gift increase 
perceived fit; however, the effect of the gift is much weaker 
(fit explanation: Ms = 2.84 and 4.39; Fi, 7; = 62.79, p < 
.0001; gift: Ms = 3.42 and 3.81; Fy 5, = 3.83, p < .05). For 
attitude toward the sponsorship and firm equity, the gift has 
no impact (ps > .43), whereas the fit explanation increases 
favorability (attitude: Ms = 5.08 and 6.25; F, 71 = 16.30, 
p < .0001; firm equity: Ms = 4.92 and 5.59; F; 7, = 10.01, 
p < .002). For perceived clarity, there is a fit explanation x 
gift interaction (F; 79 = 4.10, p < .04) such that the gift has 
an impact only if the fit explanation is given (no explana- 
tion: Ms = 3.90 and 4.08; F; 34 < 1; explanation: Ms = 4.06 
and 5.33; F, 36 = 13.34, p < .0008). 

For message source, we conducted a second separate- 
sample test to determine whether respondents view the 
sponsored nonprofit (versus the sponsoring firm) as less 
self-interested in announcing the sponsorship. In a within- 
subjects design, 60 respondents reported how, they would 
perceive the motivations of Alpo and the Special Olympics 
as a message source. They rated (on seven-point scales) the 


likelihood that the source would "have self-serving 
motives," "be trying to win people's liking or patronage," 
“be a trustworthy source of information,” “make the spon- 
sorship sound better than it is," “have unselfish motives,” 
and "be simply trying to inform the public" (we reverse- 
scaled some items; Cronbach's a = .70). As we expected, 
respondents viewed the nonprofit as less likely to have self- 
interested motives (Ms = 3.15 and 4.42 for the Special 
Olympics and Alpo, respectively; F, 4g = 44.96, p < .0001). 
Nonetheless, further investigation into the antecedents of 
the favorable nonprofit source effect is needed. Study 2's 
respondents did not note any difference with regard to 
motives in their undirected thoughts, so it remains possible 
that the beneficial effect of the nonprofit source is derived 
purely from affect transfer or some mediating cognition 
other than thoughts about self-interest. However, that there 
are beneficial effects is not in question; benefits show up in 
both scaled measures and undirected thoughts. 


Persistence of effects. One year later, we met with 47 of 
the original 120 respondents, again during professional 
training seminars. They were evenly spread across condi- 
tions. Given this small sample, we examine only effects that 
are significant on our immediate measures for our whole 
sample and for the portion that constitutes our follow-up 
sample. We then use a repeated measures analysis (immedi- 
ate versus delayed) to track changes in our follow-up sam- 
ple over time. Otherwise, we might mistakenly conclude 
that an effect decays over time when, instead, it is absent in 
our delayed measures because of low power or sample bias. 
Using this criterion, we are able to examine only effects of 
fit and cannot examine control-group comparisons. 

The impact of fit on firm equity lessens over time 
(Fi 34 = 5.89, p < .02), but it remains highly significant 
after a delay of one year. Immediately after exposure to the 
sponsorship information, this sample's equity scores for 
Alpo were 4.96 for low fit and 6.39 for created fit (Fj. 35 = 
35.16, p « .0001). One year later, these equity scores were 
5.19 and 5.96 (F; 44 = 10.52, p < .002). 

We also collected sponsorship recall measures. As Hs 
predicted, recall is greater with created fit than with low fit. 
This effect occurs for both free recall and aided recall. Fur- 
thermore, the previously documented tendency (Johar and 
Pham 1999) to incorrectly recall semantically related pair- 
ings (i.e., false hits) is reduced by created fit. Free recall 1s 
scaled as follows: When a respondent recalled no sponsor- 
ship or recalled only that the sponsored cause “had some- 
thing to do with pets," we code recall as incorrect, or as 0. 
Although the sponsorship involved the gift of pets and pet 
food, the Special Olympics is not an organization that “has 
something to do with pets.” When a respondent recalled that 
the sponsored cause “had something to do with the mentally 
disabled" or "had something to do with athletics," we 
scored this response as partially correct, or as 1, because the 
Special Olympics deals with both. When the respondent 
recalled that the sponsored cause was the Special Olympics, 
we scored this response as accurate, or as 2. No one recalled 
a specific nonprofit other than the Special Olympics. As we 
predicted, free recall is better with created fit than with low 
fit (Ms = .60 and .09; Fy 45 = 4.78, p < .03). We reach the 
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same conclusion if we simply compare the proportions of 
respondents correctly recalling the Special Olympics (35% 
versus 5%; y? = 5.28, p < .02). 

For the aided-recall task, multiple-choice options were 
"no sponsorship," "the Special Olympics," and "the 
Humane Society" We coded aided recall as incorrect if 
respondents indicated that there was no sponsorship or that 
the sponsored nonprofit was the Humane Society and as 
correct if they indicated that the sponsored nonprofit was 
the Special Olympics. The proportion of respondents who 
recalled correctly is significantly greater with created fit 
than with low fit (68% and 2696; Y? = 5.84, p < .01). Fur- 
thermore, the proportion of respondents who incorrectly 
identified the semantically related Humane Society as the 
sponsored nonprofit is marginally lower in the created-fit 
condition than in the low-fit condition (30% and 57%; %2 = 
3.08, p « .07). 


Discussion 


Created fit and the persistence of effects. As is the case 
with high natural fit, created fit increases the favorability of 
responses relative to low-fit sponsorships and to no spon- 
sorship, at least for familiar, well-liked firms. Furthermore, 
the created-fit-sponsorship was better recalled than the 
identical low-fit pairing, consistent with previous work that 
shows that semantically related pairings are better recalled. 
When recall is high, we might expect that less communica- 
tion is required to maintain sponsorship effects over time. 
As a rough test of this possibility, we examined the extent to 
which an aided-recall question helped respondents who had 
not previously free-recalled the sponsorship. Although an 
aided-recall question helped only 2196 of the low-fit 
respondents, it helped 5796 of the created-fit respondents 
(y? = 3.74, p < .05). Thus, simple communication reminders 
or point-of-purchase cues may be effective in keeping 
created-fit (and perhaps high-fit) sponsorships salient. 
Given the strong effects we observed, further investigation 
of created fit is warranted. For example, although it seems 
unlikely, it is possible that the heightened recall resulted 
from the provision of additional interesting information in 
the sponsorship announcement rather than from the creation 
of fit, per se. | 

Notably, one-year follow-up measures of firm equity 
indicate that low-fit sponsorships may also have enduring 
effects, even though they are not as well recalled. Equity 
means for both low fit and created fit changed little over 
time. That the effects of fit remained strong over a year dur- 
ing which respondents were likely exposed to other brand 
messages is impressive. 


Message source. Communicating through the sponsored 
nonprofit can also be beneficial, though it does not com- 
pletely overcome the negative effects of low fit. The main 
benefit is a reduction in the impression of misfit and inap- 
propriateness for low-fit sponsorships, as is shown by the 
perceived fit and clarity measures and by the respondents’ 
undirected thoughts. We also observed benefits of a non- 
profit source for high-fit sponsorships, but the absence of 
these effects in our thought measures leads us to interpret 
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these benefits more cautiously. However, this lack of ver- 
balization may reflect a ceiling effect; high-fit sponsorships 
are viewed favorably even when the company is the mes- 
sage source. 


General Discussion 


Much interest in recent years has centered on building, 
maintaining, and leveraging brand equity. Although much 
of the brand extension (e.g., Broniarczyk and Alba 1994; 
John, Loken, and Joiner 1998; Loken and John 1993), brand 
alliance (e.g., Simonin and Ruth 1998), and cobranding 
(e.g., Samu, Krishnu, and Smith 1999) literature has 
focused on brand-specific associations, the corporate social 
responsibility literature has focused on creating broad asso- 
ciations, such as being “socially responsible," and on the 
simple transfer of positive affect from sponsored causes 
(Brown and Dacin 1997; Rifon et al. 2004; Smith 2004; cf. 
Lichtenstein, Drumwright, and Braig 2004; Sen and Bhat- 
tacharya 2001). We adopt a more traditional branding per- 
spective and show that the fit between a firm’s specific 
associations and a sponsored cause can reinforce or blur the 
firm’s positioning. We also show that the unexpectedness of 
low fit leads to increased elaboration on the sponsorship, 
that this elaboration is negatively biased, and that it is 
accompanied by less favorable attitudes toward the sponsor- 
ship. Finally, we examine the effects of message content 
and source and offer the first experimental evidence of the 
persistence of sponsorship effects over an extended time. 


Overview of Results and Managerial Implications 


In Study 1, we show that sponsorship fit can blur or rein- 
force the firm’s positioning, influence liking for the spon- 
sorship, and, through these effects, influence firm equity. 
These results have straightforward implications for sponsor- 
ship management. First, fit should be a key consideration in 
the development of sponsorships. Supporting a familiar, 
well-liked cause is not enough to ensure a good outcome 
and may even be harmful. Low fit, even with a well-liked 
cause, can dilute the firm's positioning, create dislike for 
the sponsorship, and, as a result, lower firm equity. In con- 
trast, high fit reinforces the firm's positioning, creates 
favorable attitudes toward the sponsorship, and builds firm 
equity. 

Second, management should pay special attention to 
factors that influence liking for the sponsorship. Sponsor- 
ships influence firm equity through their effects on clarity 
of positioning and attitude toward the sponsorship. 
Although high fit reinforces the firm's positioning, it may 
be only one of many inputs to liking for the sponsorship. 
For example, choosing a well-liked cause, avoiding unat- 
tractive elements (e.g., heavy-handed promotions), and pro- 
viding engaging ways for consumers to participate (e.g., 
walkathons, social events) may all contribute to a more 
favorable attitude toward the sponsorship. 

In Study 2, we show that message content and source 
can mitigate the negative effects of low fit, including poor 
sponsorship recall, and we demonstrate that sponsorship 
effects can persist for as long as a year despite day-to-day 


exposure to other brand communications. Again, manage- 
rial implications are straightforward. First, if the firm 
wishes to sponsor a low-natural-fit cause, management 
should create fit by making shared associations salient. 
Created-fit sponsorships can obtain essentially the same 
benefits as high-natural-fit sponsorships, including good 
recall, increased clarity of positioning, more favorable atti- 
tudes, and greater firm equity. There is plenty of room for 
creativity here. We created fit between Alpo and the Special 
Olympics by explaining that caring for pets increases the 
self-esteem of the mentally disabled. The Ford Motor Com- 
pany likely created fit with a national park when it 
explained that it used its expertise in transportation to sup- 
port the park's transportation needs. 

Second, it is risky to assume an understanding of which 
causes people will perceive as high fit. Instead, it is impor- 
tant to verify how the intended audience will view the rela- 
tionship. DenTek Oral Care's sponsorship of the, American 
Diabetes Association probably seems logical (i.e., high fit) 
to people within these two organizations but not to people 
who are unfamiliar with the dental health effects of dia- 
betes. For this audience, fit needs to be created. The caveat 
to verify fit also applies to actions that are intended to cre- 
ate fit. We expected that the gift of a pet and pet food to the 
Special Olympics participants would have a substantial 
impact on perceived fit with Alpo. The gift did increase per- 
ceived fit, but its effect was small compared with that of the 
explanation that caring for pets increases self-esteem. 

Third, firms should consider using the sponsored cause 
as a message source. This decision may boost the effective- 
ness of high-fit sponsorships and can reduce some of the 
risk if the firm is committed to a low-fit sponsorship for 
which it is difficult to create fit. This strategy may be par- 
ticularly helpful when transitioning into a high-fit sponsor- 
ship portfolio. Finally, it appears that sponsorships do not 
require extensive ongoing communications support to have 
enduring effects. We show that sponsorship effects can 
remain strong for up to one year with no interim reinforce- 
ment and as consumers are subject to ordinary day-to-day 
exposure to brand communications. Particularly for created- 
fit sponsorships, simple reminders may have strong effects. 
Free recall of these sponsorships was high, but aided recall, 
wbich provided reminders in the form of multiple-choice 
options, was considerably higher. , 


Limitations and Research Issues 


We examined the effects of one brief exposure to a sponsor- 
ship. Given that we find effects on firn equity one year 
Jater, the real-world potential for sponsorship effects, both 
positive and negative, seems great. However, our results can 
most confidently be generalized to situations such as those 
we examined: familiar and well-liked firms and causes, 
causes that do not evoke too many fearful thoughts (e.g., no 
images of horrible disease), minimal exposures to basic 
sponsorship information, overall communication levels 
similar to those for the firms and causes we examined, very- 
low-fit and very-high-fit sponsorships, and so forth. In par- 
ticular, we would not expect our predictions and findings 
with regard to the effect of clarity of positioning to be repli- 


cated for firms whose positioning is disliked rather than 
liked by consumers. Furthermore, although we focused on 
what we believe are the most common mediators of fit 
effects—clarity of positioning and attitude toward the spon- 
sorship—it seems likely that other mediators operate in 
some contexts. Next, we discuss a few of the variables that 
seem most promising for further research. 


Repeated exposures to less simple communications. 
When a brand extension is launched, it is common for con- 
sumers to be repeatedly exposed to well-developed adver- 
tisements. In contrast, the typical extension study offers a 
single exposure to a simple product description. Studies that 
create a more realistic information environment find that the 
negative effect of low fit (i.e., low category similarity) 
decreases as information about the extension, exposures, 
and consumer innovativeness increase (Klink and Smith 
2001; Lane 2000). Although social sponsorships rarely 
receive communication support equal to that of brand 
extensions—indeed, minimal exposures are common— 
repeated and well-produced messages may mitigate the 
effect of low fit. It is possible to point to apparently suc- 
cessful sponsorships in which fit is questionable, for exam- 
ple, the support of the arts and cultural events by financial 
institutions. People may have learned through repeated 
exposure that financial institutions and cultural events go 
together. If repeated exposure to Jow-fit relationships can 
create fit, high levels of promotion may offer another route 
for overcoming the risks of low fit. However, this strategy 
may require a level of commitment best reserved for special 
situations, for example, when a firm seeks to overcome an 


image problem. 


High fit/negative associative links. Some sponsorships 
are intended to overcome specific negative associations 
(Hoeffler and Keller 2002; Lichtenstein, Drumwright, and 
Braig 2004). For example, oil and chemical companies, 
which are often associated with environmental pollution, 
frequently sponsor environmental causes. Notably, our 
pretests indicate that people perceive these kinds of rela- 
tionships as high fit. For example, manufacturers of alco- 
holic beverages were rated as having a high fit with Moth- 
ers Against Drunk Driving. Àn important question for 
future research is whether high fit in these situations has the 
same beneficial effects we observed in the current studies or 
whether such sponsorships backfire by reminding con- 
sumers of the negative associations. In related findings, 
Szykman, Bloom, and Blazing (2004) report that respon- 
dents infer more society-serving motives when a message 
about the dangers of drinking and driving is delivered by 
Mothers Against Drunk Driving than when it is delivered by 
Budweiser. 


The role of persuasion knowledge. In the domains of 
brand extensions and corporate social responsibility, it has 
been argued that low fit engenders the attribution that the 
company’s action (e.g., creation of a brand extension, sup- 
port of a social cause) is a self-interested attempt to gain 
market value rather than a genuine attempt to deliver value 
(Aaker 1990; Keller and Aaker 1992; Rifon et al. 2004; cf. 
Brown and Dacin 1997). This prediction is based on two 


Achieving Marketing Objectives Through Social Sponsorships / 165 


ideas: (1) that people have “persuasion knowledge” (Fries- 
tad and Wright 1994, 1995), or intuitive beliefs about mar- 
keters’ motives and tactics that they use to interpret mar- 
keters’ actions, and (2) that low fit somehow activates or 
shapes the use of this knowledge. For example, Rifon and 
colleagues (2004) argue that the greater elaboration associ- 
ated with low fit increases the likelihood that people will 
retrieve persuasion knowledge and use it to infer that the 
company is motivated by self-interest (cf. Campbell and 
Kirmani 2000). Relying on scaled measures of perceived 
motivation, Rifon and colleagues find the predicted effect. 
In contrast, our thought listings from both Study 1 and 
Study 2 show no evidence that this effect occurs sponta- 
neously. Instead, we observe the effect only when we 
employ scaled measures. Furthermore, these scaled mea- 
sures do not exhibit discriminant validity. Thus, our results 
suggest that the predicted attributions were not made spon- 
taneously but rather were constructed on the spot, through 
attitude priming, to respond to our measures (Feldman and 
Lynch 1988; Simmons, Bickart, and Lynch 1993). We spec- 
ulate that sponsorship fit may have the predicted effect on 
inferred motivations only when motivation is made salient. 
For example, consumers may be vigilant about marketing 
efforts of a firm that has recently experienced negative pub- 
licity, and therefore they may be quick to think about the 
firm’s motives. If inferred motives vary with fit, a low-fit 
sponsorship may be a particularly poor way to attempt to 
rebuild the firm’s reputation, whereas a high-fit sponsorship 
may be a particularly effective way. 

We also find some evidence that consumers’ use of per- 
suasion knowledge may both increase and decrease the 
marketer’s effectiveness. Our respondents’ comments 
reflected the marketing-savvy attitude reflected in today’s 
popular culture. When fit was low, they chided the mar- 
keter: "Alpo will not sell more food.” When fit was high, 
they implicitly applauded the marketer’s efforts: “By doing 
this, they may open more of a market for themselves.” This 
result implies that marketers may impede their effectiveness 
by any activity that consumers regard as unskilled use of 
marketing tactics and that they may boost their effective- 
ness by winning consumers’ approval of their skill as mar- 
keters. Thus, high-fit sponsorships may help firms achieve 
marketing objectives because of consumers’ use of persua- 
sion knowledge. 


Appendix A 
Study 1 Materials 


Alpo 
The clipping for the high-fit, sponsorship-absent condition 
is as follows: 


Businesses and Organizations in the News 


P&G has introduced a home-use version of its fruit and 
vegetable wash for restaurant and food service operations. 
The wash is formulated to easily remove dirt and 
unwanted residues, including up to 9396 of wax, 9596 of 
handling residues, and is 9896 more effective than water at 
removing pesticides. The product will be available at gro- 
ceries and mass merchandisers throughout the U.S. 


166 / Journal of Marketing, October 2006 


The American Heart Association achieved its goal of 
increasing access to life-saving automated external defib- 
rillators (AEDs) for air travelers with the new FAA 
requirement that commercial airlines carry AEDs as part 
of on-board medical emergency equipment. AEDs are 
small, easy-to-use devises that can analyze heart rhythms 
of cardiac arrest victims to determine if shock is necessary 
and, 1f warranted, deliver a life-saving jolt of electricity to 
the heart. 


The Humane Society today announced the promotion of 
Elizabeth Hansen to the post of Executive Vice President. 
Ms. Hansen has been an advocate for the humane treat- 
ment of animals for over 20 years. In her five years as Pro- 
gram Director of the Humane Society, she has spear- 
headed educational campaigns to increase awareness of 
animal protection issues. 


Alpo Petfoods today announced the opening of its 
expanded online store, www.Alpo.com. The store enables 
consumers to securely purchase an assortment of pet food 
and related products online. The site includes the ability 
for returning customers to purchase without re-entering 
personal information and to interact in real time with cus- 
tomer service representatives. 


In low-fit conditions, we replaced the Humane Society 
item with the following: 


The Special Olympics today announced the promotion of 
Elizabeth Hansen to the post of Executive Vice President. 
Ms. Hansen has been an advocate for athletic programs 
for the mentally retarded for over 20 years. In her five 
years as Program Director of the Special Olympics, she 
has spearheaded educational campaigns to increase aware- 
ness of the benefits of athletics for the mentally retarded. 


In sponsorship-present conditions, we added the follow- 
ing to the Alpo item: 


Coincidentally, Alpo also announced its new sponsorship 
of the [Humane Society or Special Olympics for high-fit 
versus low-fit conditions], which it initiated with a 
$25,000 cash donation. The company will also institute a 
program whereby its employees can receive paid leave to 
participate as volunteers in [Humane Society, Special 
Olympics] events. 


The Sports Authority 


The clipping for the high-fit, sponsorship-absent condition 
is as follows: 


Businesses and Organizations in the News 


P&G has introduced a home-use version of its fruit and 
vegetable wash for restaurant and food service operations. 
The wash is formulated to easily remove dirt and 
unwanted residues, including up to 93% of wax, 95% of 
handling residues, and is 98% more effective than water at 
removing pesticides. The product will be available at gro- 
ceries and mass merchandisers throughout the U.S. 


The American Heart Association achieved its goal of 
increasing access to life-saving automated external defib- 
rillators (AEDs) for air travelers with the new FAA 
requirement that commercial airlines carry AEDs as part 
of on-board medical emergency equipment. AEDs are 
small, easy-to-use devises that can analyze heart rhythms 
of cardiac arrest victims to determine if shock is necessary 


and, if warranted, deliver a life-saving jolt of electricity to 
the heart. 


The Special Olympics today announced the promotion of 
Elizabeth Hansen to the post of Executive Vice President. 
Ms. Hansen has been an advocate for athletic programs 
for the mentally retarded for over 20 years. In her five 
years as Program Director of the Special Olympics, she 
has spearheaded educational campaigns to increase aware- 
ness of the benefits of athletics for the mentally retarded. 


The Sports Authority today announced the opening of its 
expanded online store, www.TheSportsAuthority.com. 
The store enables consumers to securely purchase an 
assortment of sporting goods online. The site includes the 
ability for returning customers to purchase without re- 
entering personal information and to interact 1n real time 
with customer service. 


In low-fit conditions, we replaced the Special Olympics 
item with the following: 


The Humane Society today announced the promotion of 
Elizabeth Hansen to the post of Executive Vice President. 
Ms. Hansen has been an advocate for the humane treat- 

ment of animals for over 20 years. In her five years as Pro- 
gram Director of the Humane Society, she has spear- 

headed educational campaigns to increase awareness of 
animal protection issues. 


In sponsorship-present conditions, we added the follow- 
ing to the Sports Authority item: 


Coincidentally, The Sports Authority also announced its 
new sponsorship of the [Special Olympics or Humane 
Society for high-fit versus low-fit conditions], which it 
initiated with a $25,000 cash donation. The company will 
also institute a program whereby its employees can 
receive paid leave to participate as volunteers in [Special 
Olympics, Humane Society] events. 


Appendix B | 
Study 2 Materials 


The clipping for the no-sponsorship control condition is 
identical to the low-fit, sponsorship-absent control for Alpo 
from Study 1: 


Businesses and Organizations in the News 


P&G has introduced a home-use version of its fruit and 
vegetable wash for restaurant and food service operations. 
The wash is formulated to easily remove dirt and 
unwanted residues, including up to 93% of wax, 95% of 
handling residues, and is 98% more effective than water at 
removing pesticides. The product will be available at gro- 
ceries and mass merchandisers throughout the U.S. 


The American Heart Association achieved its goal of 
1ncreasing access to life-saving automated external defib- 
rillators (AEDs) for air travelers with the new FAA 
requirement that commercial airlines carry AEDs as part 
of on-board medical emergency equipment. AEDs are 
small, easy-to-use devises that can analyze heart rhythms 
of cardiac arrest victims to determine if shock 1s necessary 
and, if warranted, deliver a life-saving jolt of electricity to 
the heart. 


The Special Olympics today announced the promotion of 
Elizabeth Hansen to the post of Executive Vice President. 


Ms. Hansen has been an advocate for athletic programs 
for the mentally retarded for over 20 years. In her five 
years as Program Director of the Special Olympics, she 
has spearheaded educational campaigns to increase aware- 
ness of the benefits of athletics for the mentally retarded. 


Alpo Petfoods today announced the opening of its 
expanded online store, www.Alpo.com. The store enables 
consumers to securely purchase an assortment of pet food 
and related products online. The site includes the ability 
for returning customers to purchase without re-entering 
personal information and to interact in real time with cus- 
tomer service representatives. 


In the low-fit, company-source condition, the Special 
Olympics item is unchanged from the control, and we 
added the following to the Alpo item: 


Coincidentally, Alpo also announced its new sponsorship 
of the Special Olympics, which it initiated with a $25,000 
cash donation. The company will also institute a program 
whereby its employees can receive paid leave’to partici- 
pate as volunteers in Special Olympics events. 


In the low-fit, nonprofit-source condition, the Special 
Olympics item contains the following addition (and the 
Alpo item is slightly edited): 


The Special Olympics also announced Alpo’s new spon- 
sorship of the organization, which ıt initiated with a 
$25,000 cash donation. The company will also institute a 
program whereby its employees can receive paid leave to 
participate as volunteers in Special Olympics events. 


Coincidentally, Alpo Petfoods today announced the open- 
ing of its expanded online store, www.Alpo.com. The 
store enables consumers to securely purchase an assort- 
ment of pet food and related products online. The site 
includes the ability for returning customers to purchase 
without re-entering personal information and to interact in 
real time with customer service representatives. 


In the created-fit, company-source condition, the Spe- 
cial Olympics item is again unchanged from the control, 
and we added the following to the Alpo item: 


Coincidentally, Alpo also announced its new sponsorship 
of the Special Olympics, which it initiated with a $25,000 
cash donation. Citing research that shows that caring for 
pets increases the self-confidence of the mentally dis- 
abled, Alpo reported that it will arrange for interested Spe- 
cial Olympics participants to receive a dog or cat and a 
lifetime supply of pet food. The company will also insti- 
tute a program whereby its employees can receive paid 
leave to participate as volunteers in Special Olympics 
events. 


In the created-fit, nonprofit-source condition, the Spe- 
cial Olympics item contains the following addition (and the 
Alpo item is slightly edited as previously): 


The Special Olympics also announced Alpo’s new spon- 
sorship of the organization, which it initiated with a 
$25,000 cash donation. Citing research that shows that 
caring for pets increases the self-confidence of the men- 
tally disabled, the organization reported that Alpo will 
arrange for interested Special Olympics participants to 
receive a dog or cat and a lifetime supply of pet food. The 
company will also institute a program whereby its 
employees can receive paid leave to participate as volun- 
teers in Special Olympics events. 


Achieving Marketing Objectives Through Soclal Sponsorships / 167 


m~ 


Coincidentally, Alpo Petfoods today announced the apen- 
ing of its expanded online store, www.Alpo.com. The 
store enables consumers to securely purchase an assort- 
ment of pet food and related products online. The site 


includes the ability for returning customers to purchase 
without re-entering personal information and to interact in 
real time with customer service representatives. 
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Rajagopal Raghunathan, Rebecca Walker Naylor, & Wayne D. Hoyer 


The Unhealthy = Tasty Intuition and 
Its Effects on Taste Inferences, 
Enjoyment, and Choice of Food 
Products 


Across four experiments, the authors find that when information pertaining to the assessment of the healthiness of 
food items is provided, the less healthy the item is portrayed to be, (1) the better is its inferred taste, (2) the more 
it is enjoyed during actual consumption, and (3) the greater is the preference for it in choice tasks when a hedonic 
goal is more (versus less) salient. The authors obtain these effects both among consumers who report that they 
believe that healthiness and tastiness are negatively correlated and, to a lesser degree, among those who do not 
report such a belief. The authors also provide evidence that the association between the concepts of "unhealthy" 
and "tasty" operates at an implicit level. The authors discuss possibilities for controlling the effect of the unhealthy = 
tasty intuition (and its potential for causing negative health consequences), including controlling the volume of 
unhealthy but tasty food eaten, changing unhealthy foods to make them less unhealthy but still tasty, and providing 


consumers with better information about what constitutes "healthy" 


tion habits has emerged as an important topic of 
inquiry in recent years (Oakes 2005; Oakes and 
Slotterback 2005; Seiders and Petty 2004; Verbeke 2006; 
Wansink 2004a, b; Wansink and Huckabee 2005). From an 
evolutionary perspective, the easy availability of food pre- 
sents a unique quandary for developed countries and 
societies. On the one hand, it has virtually obliterated prob- 
lems resulting from malnourishment. On the other hand, it 
has generated problems stemming from the overconsump- 
tion of food. For example, obesity is growing at an unprece- 
dented rate, not just in the United States, where nearly two- 
thirds of the population are overweight or obese, but in 
other developing nations as well (Seiders and Petty 2004). 
To many consumers and consumer activists, organiza- 
tions (e.g., lobbyists for grocery manufacturers) and restau- 
rants (e.g., McDonald's, Hardee's) that are interested in pro- 
moting unhealthy eating habits are to blame for consumers' 
propensity to overconsume (Brownell and Horgen 2003; 


A^» the health consequences of food consump- 
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Nestle 2003). Consequently, lawsuits have been filed 
against fast-food companies such as McDonald's (Parloff 
2003). Attorneys representing these companies have argued 
that consumers are aware of the ingredients in their offer- 
ings and that they understand the weight and health conse- 
quences of overconsumption. In their view, consumers are 
ultimately responsible for the negative consequences of 
their own actions.! 

The objective of this research is to examine the role of a 
subtle (and currently unrecognized) factor in the overcon- 
sumption of food that is perceived as unhealthy. What if 
people consume food that is considered unhealthy not 
despite its perceived unhealthiness but rather because of it? 
That is, what if part of the attractiveness of food lies in its 
perceived unhealthiness? This can happen if consumers 
intuitively believe that the unhealthier the food, the tastier it 
is. The operation of such a belief would increase the 
chances that people will overconsume food portrayed as 
less (versus more) healthy because they expect that such 
food will taste better. Consistent with this idea, we propose 
that the perceived unhealthiness of food has the ironic effect 
of enhancing its attractiveness. Consequently, under cir- 
cumstances in which a hedonic (e.g., enjoyment) goal is 
more (versus less) salient, people will choose options they 
perceived as unhealthier, even if no information about their 
tastiness relative to other options is available. Across the 
experiments we conduct in this research, we obtain support 
for this proposition. 





lThe fast-food chains appear to be winning the battle, at least 
for now. In October 2005, the U.S. House of Representatives 
passed a bill to block lawsuits blaming restaurants and food com- 
panies for causing obesity or weight gain (Zhang 2005). 


Journal of Marketing 
Vol. 70 (October 2006), 170—184 


Our research generates two theoretically interesting and 
substantively meaningful insights. First, from a theoretical 
standpoint, we document both direct and indirect support 
for our main thesis, namely, that people subscribe to the 
unhealthy = tasty intuition and that this intuition affects 
inferences about food choice, actual enjoyment of foods, 
and choice of food products. These notions are particularly 
intriguing because there is little rigorous evidence that 
healthiness and tastiness are indeed inversely related to 
each other. Second, our research suggests that the influence 
of the unhealthy = tasty intuition can be implicit; as such, 
even people who may not report believing that unhealthy 
food is tastier make judgments and choices as if they sub- 
scribed to such a view. 

We structure the rest of the article as follows: We begin 
by briefly commenting on the potential sources for the 
unhealthy = tasty intuition. Then, we derive predictions of 
how the intuition affects three variables of theoretical and 
substantive interest: (1) taste inferences of food products 
when information pertaining to their healthiness is pro- 
vided, (2) differences in enjoyment of the same ‘food prod- 
uct when it is presented as being more versus less healthy, 
and (3) choice between two snack options as a function of 
differences in the propensity to pursue a hedonic goal. We 
report results from four controlled experiments that test 
these hypotheses and conclude with a discussion of the 
implications of our findings. 


Theory 


Sources of the Unhealthy = Tasty Intultion 


Intuitions or lay beliefs and theories may be generated 
internally, through personal experience and self-observation 
(Ross and Nisbett 1991), or externally, from environmental 
cues (Morris, Menon, and Ames 2001). We posit that both 
types of sources underlie the unhealthy = tasty intuition. 


Internal source. We posit that the unhealthy = tasty intu- 
ition is generated internally on the basis of a'belief in a 
compensatory relationship between the “wholesomeness” 
of stimuli and their “hedonic potential.” That is, we believe 
that the unhealthy = tasty intuition is a specific manifesta- 
tion of a more general principle, according to which there is 
an inverse relationship between things that are “healthy,” 
“nourishing,” and “good for you” and those that are “enjoy- 
able,” “fun,” and “exciting.” Evidence for an assumed com- 
pensatory relationship between the wholesomeness and 
hedonic potential of stimuli emerged across a separate 
series of experiments we conducted. For example, the 
results from one such experiment revealed that compared 
with an automobile portrayed as attractive and fun to drive, 
one that was portrayed as unattractive and less enjoyable to 
drive was inferred to be safer, even though the more attrac- 
tive automobile was objectively safer (as per the Insurance 
Institute of Highway Safety). We speculate that a belief in 
the more general unwholesome = fun intuition is rooted in 
religious messages, such as the Protestant work ethic, 
according to which a person is morally obligated to accord 
greater priority to necessities over luxuries (e.g., Berry 


1994; Weber 1998; see also Maslow 1962). Implicit in this 
view are the ideas that comfort and Juxury do not come eas- 
ily and that a person needs to work hard to earn the right to 
indulge (e.g., Kivetz and Simonson 2002). Thus, the Protes- 
tant work ethic makes a clear distinction between stimuli or 
activities that are of a “serious,” “worthy,” and “work- 
oriented” nature and those that are of a “trivial,” “frivolous,” 
and "fun-related" nature and suggests that stimuli/activities 
fall neatly into one or the other category but rarely into 
both. In the context of food, this translates into a belief that 
healthy food is less tasty. 


External source. Operating in concert with the internal 
source for the unhealthy = tasty intuition is an external one. 
Through the mass media or personal communication, 
people are constantly exposed to views that are compatible 
with the intuition. Several recent articles in popular maga- 
zines (e.g., "Is Fat the Next Tobacco?" which appeared in 
Fortune [Parloff 2003]), popular books (e.g., Fast Food 
Nation, by Eric Schlosser [2001]), and movies (e.g., Mor- 
gan Spurlock's Super Size Me, which appeared in theaters 
nationwide in 2004) explicitly or implicitly suggest an 
inverse relationship between healthiness and tastiness. Con- 
sumers are also likely to have been exposed to the 
unhealthy = tasty view through personal communication. 
For example, parents and guardians appear to find value in 
propagating such a message because it encourages children 
to consume hedonically unpleasant food that is perceived as 
salubrious (e.g., eating broccoli) and discourages them from 
overconsuming hedonically appealing fare that is poten- 
tially harmful (e.g., candy). 

In summary, we propose that the unhealthy = tasty intu- 
ition is generated internally through its consistency with the 
more general unwholesome = fun intuition and/or exter- 
nally through repeated exposure to views that are compati- 
ble with it.? We believe that these sources help generate the 
hypothesis that unhealthiness is inversely related to tasti- 
ness. After this association has been generated, psychologi- 
cal mechanisms help perpetuate it, as we discuss next. 


Influence of the Unhealthy - Tasty Intuition on 
Decision Making 


Figure 1 depicts the influence of the unhealthy = tasty intu- 
ition on three conceptually separate stages of the decision- 





2It is worthwhile to mention an alternative source for the intu- 
ition: direct experience. A person may arrive at the intuition 
through lay inferencing (Kelly and Thibaut 1969), a process that 
involves personal observation that unhealthier food is tastier. 
There are two reasons we do not believe that this is a valid source 
for the intuition. First, lay inferencing presupposes objectivity on 
the part of the inference maker. However, we know that because of 
the prevalence of external messages consistent with the 
unhealthy = tasty intuition, consumers are unlikely to be objective 
Judges of the correlation between healthiness and tastiness. Sec- 
ond, to assess accurately the true correlation between healthiness 
and tastiness, it is first necessary to assign relatively stable health- 
iness and tastiness ratings to various types of food. However, rele- 
vant findings indicate that ratings of the healthiness and tastiness 
of food likely depend heavily on contextual factors (Drewnowski 
1997; Oakes 2005). 
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making process. Research has shown that consumers gener- 


ate inferences about missing product attributes on the basis 


of lay theories, that is, beliefs and intuitions about the world 
(Broniarczyk and Alba 1994; Sujan and Dekleva 1987). 
Indeed, inferences about missing product attributes are dri- 
ven by intuitions even in the face of contradictory cues. For 
example, in a study in which participants were asked to 
make inferences about a camera’s durability, the partici- 
pants relied more heavily on the intuition that durability and 
warranty are positively correlated than on the actual corre- 
lation (in the presented data) between durability and shutter 
speed, an association that is.not supported by intuition (Bro- 
niarczyk and Alba 1994). 


Influence of intuition on taste inferences. Consistent 
with these findings, we predict that as a consequence of 
belief in the unhealthy = tasty intuition (see Figure 1, Stage 
I), 


H,: The healthier a food item is perceived to be, the lower is 
its 1nferred tastiness. 


Influence of intuition on enjoyment. Inferences about 
missing attributes are likely to act as hypotheses that can 
bias judgments about these attributes (Deighton 1984; Hoch 
and Ha 1986; Shiv, Carmon, and Ariely 2005). The opera- 
tion of this hypothesis confirmation bias was captured in 
shiv, Carmon, and Ariely's (2005) experiments, which were 
designed to demonstrate the placebo effects of marketing 
actions. For example, in their first experiment, they find that 


consumers who pay a discounted price for an energy drink 
that purportedly increases mental acuity derive less benefit 
from the drink (measured in terms of number of mental 
puzzles solved) than do participants who pay full price. 
Consistent with the operation of the hypothesis confirma- 
tion bias, the discounted price serves as a hypothesis about 
the (lack of) effectiveness of the drink, which is confirmed 
on consumption of the drink (and reflected in the subse- 
quent puzzle-solving task). In the context of the current 
research, we expect that consumers will judge the same 
food product as more tasty when it is portrayed as less 
healthy. We state tbis prediction more formally as follows 
(see Figure 1, Stage 2): 


H5: When actual tastiness of a food product is controlled for, 
the healthier the food is portrayed to be, the lower is its 
judged tastiness. 


Influence of intuition on choice. Product choice is often 
a function of consumption goals (Bettman, Luce, and Payne 
1998). In the context of food choice, researchers have sug- 
gested that, in general, the hedonic goal of enjoyment is 
salient and important (Dhar and Simonson 1999; Food Mar- 
keting Institute 1996; Shiv and Fedorikhin 1999; Tepper 
and Trail 1998). For example, Shiv and Fedorikhin’s (1999) 
experiments examine how a person's mood can influence 
how he or she resolves the trade-off between the enjoyment 
and the health benefits of food offerings. Likewise, Dhar 
and Simonson's (1999) experiments examine conditions 
that lead to "highlighting" (a focus on either an enjoyment 


FIGURE 1 
Model of the Influence of the Unhealthy = Tasty Intultion on Judgments and Decisions 
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or a health goal in choosing an entrée and a dessert) versus 
"balancing" (a focus on both goals). Given thatienjoyment 
is likely to be a significant goal when people choose food 
products, the unhealthy = tasty intuition suggests that con- 
sumers with a more (versus less) salient hedonic goal will 
have a relatively greater preference for options they per- 
ceived as unhealthy (see Figure 1, Stage 3). 


H3: Given a choice between two food products that vary in 
healthiness, people with a more (versus less) salient 
enjoyment goal will choose the less healthy option. 


Implicit and explicit influence of intuition. Thus far, we 
have suggested that consumers' taste inferences, the judged 
tastiness of food, and product choice will be inflüenced in a 
manner consistent with the unhealthy = tasty intuition. 
Here, we explore whether explicitness of belief in the intu- 
ition moderates this general pattern. If we find that even 
consumers who do not explicitly report a belief in the intu- 
ition make decisions as if they did, important theoretical 
and substantive implications would emerge. From a theo- 
retical standpoint, such a finding would suggest that the 
influence of the intuition can occur outside the awareness of 
consumers making consumption decisions in the context of 
food. From a substantive standpoint, the finding would 
point to the difficulty of controlling the influence of the 
intuition. ' 

The possibility that even people who do not report a 
belief in the intuition will make decisions as if they believed 
in it is consistent with the mechanism of hidden.covariation 
detection (e.g., Lewicki 1986; Lewicki and Hill 1989; 
Lewicki, Hill, and Czyzewska 1997). Lewicki and col- 
leagues find that people can learn the relationship between 
variables implicitly, that is, without being able to articulate 
their learning. In one experiment (Lewicki 1986, Experi- 
ment 1a), participants were exposed to information about 
some traits (e.g., kindness, capability) of women such that 
these traits covaried with their hair length. Later, the same 
participants were asked to assess these traits in another set 
of women with long and short hair. Participants’ judgments 
of these targets were consistent with the covariations 
between hair lengths and traits in the initial stimuli. Impor- 
tantly, even participants who were unable to articulate these 
covariations explicitly made judgments as if they had 
learned them. Thus, people are able to acquire information 
nonconsciously about covariations in a data set without 
being able to report such learning explicitly (Lewicki and 
Hill 1989, p. 240). 

Consistent with Lewicki and colleagues' (e:g., Lewicki 
1986; Lewicki and Hill 1989; Lewicki, Hill, and Czyzewska 
1997) work, we expect that as a result of exposure to the 
many external sources that suggest a negative correlation 
between healthiness and tastiness, consumers will have 
implicitly learned to associate unhealthiness with tastiness. 
Thus, we expect that even consumers who do not explicitly 
report believing that unhealthier food is tastier will make 
judgments and decisions as if they did. However, because 
there is evidence that explicitly (versus implicitly) learned 
and articulated relationships are likely to have significantly 
stronger influences on judgments (Allen and Janiszewski 
1989), we expect that the influence of the unhealthy = tasty 


intuition will be stronger among people who report explicit 
belief in it. Thus: 


H4: The pattern of results we predict in H,—H3 holds for both 
people who report an explicit belief in the unhealthy = 
tasty intuition and those who do not, but it is stronger 
among the former group. 


Experiment 1: Implicitness of Bellef 
in Intuition 


We designed Experiment 1 to establish that people implic- 
itly subscribe to the unhealthy = tasty intuition. To address 
this objective, we used the implicit association test (LAT; 
Greenwald, McGhee, and Schwartz 1998; Greenwald, 
Nosek, and Banaji 2003), an established tool for demon- 
strating the type of implicit association we wish to 
demonstrate. 


Participants and Procedure 


One hundred thirty-eight undergraduate students partici- 
pated in this experiment (which we conducted using IBM 
computers and the Direct RT software) for course extra 
credit. Participants were told that their task was to catego- 
rize correctly stimuli shown on the middle of their screens. 
We selected stimuli from one of the following four cate- 
gories (see the Appendix) for the categorization tasks: (1) 
pictures of unhealthy foods, (2) pictures of healthy foods, 
(3) words associated with the tastiness or enjoyment of food 
(e.g., tasty, delicious), and (4) words associated with the 
lack of taste or enjoyment (e.g., flavorless, unpalatable). In 
line with established protocol (Greenwald, McGhee, and 
Schwartz 1998), participants completed seven blocks of tri- 
als, five of which were practice blocks designed to familiar- 
ize participants with the target stimuli and the categoriza- 
tion labels. The critical blocks for analysis were Block 4, in 
which the category labels were “Healthy Foods/Enjoyable” 
versus “Unhealthy Foods/Not Enjoyable,” and Block 7, in 
which the category labels were “Unhealthy Foods/ 
Enjoyable” and “Healthy Foods/Not Enjoyable” (see Table 
1). 

Stimuli from all four categories (healthy food images, 
unhealthy food images, enjoyment words, and lack of 
enjoyment words) were presented for classification in the 
critical blocks for 48 trials across both blocks. An implicit 
belief in the intuition that unhealthy foods are tastier (and 
that healthier foods are less tasty) would be reflected in 
faster responses when stimuli were to be categorized in the 
condition in which the classification task was congruent 
with the intuition (i.e., when participants paired unhealthy 
foods with words associated with tastiness and enjoyment) 
than in the incongruent condition (i.e., when participants 
paired healthy foods with words associated with tastiness 
and enjoyment). Explicit measures of belief in the 
unhealthy — tasty intuition were collected (using paper-and- 
pencil measures) before the IAT procedure to compare par- 
ticipants' explicit and implicit attitudes. We assessed explic- 
itness of participants' beliefs in the intuition by eliciting 
their agreement with the following two items (1 — "strongly 
disagree," 9 = "strongly agree"): (1) "Things that are good 
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TABLE 1 
Experiment 1: Sequence of Trial Blocks In IAT 


Number of Items Assigned to Left-Key Items Assigned to Right-Key 

Block Trials Function Response Response 

1 20 Practice Healthy food Images Unhealthy food images 

2 28 Practice Good taste/enjoyment words Bad taste/lack of enjoyment words 

3 24 Practice Healthy food Images + good Unhealthy food images + bad 
taste/enjoyment words taste/lack of enjoyment words 

4 ' 48 Critical test block Healthy food images + good Unhealthy food Images + bad 
taste/enjoyment words taste/lack of enjoyment words 

5 40 Practice Unhealthy food Images Healthy food images 

6 24 Practice Unhealthy food images + good Healthy food images + bad 
taste/enjoyment words taste/lack of enjoyment words 

7 48 Critical test block Unhealthy food Images + good Healthy food images + bad 


taste/enjoyment words 


taste/lack of enjoyment words 





for me rarely taste good,” and (2) “There is no way to make 
food healthier without sacrificing taste.” 


Data Preparation 


Following the revised JAT scoring algorithm (Greenwald, 
Nosek, and Banaji 2003), we subjected all data to the fol- 
lowing criteria: (1) We eliminated trial response latencies 
greater than 10,000 milliseconds (of the 13,248 trials in the 
critical blocks, 2 were above this threshold and thus were 
eliminated); (2) we excluded participants whose response 
times were less than 300 milliseconds on more than 10% of 
the critical trials (we discarded 2 of the participants, result- 
ing in a final sample of 136 participants); (3) we included 
all response latencies, even those for false responses; and 
(4) we computed a difference score between the two critical 
blocks of trials ([congruent: unhealthy food + enjoyable and 
healthy foods + not enjoyable] — [incongruent: healthy 
foods + enjoyable and unhealthy foods + not enjoyable] 
divided by the pooled standard deviation of response laten- 
cies across both blocks). The resultant measure is the IAT D 
effect (Greenwald, Nosek, and Banaji 2003). 


Results and Discussion 


Response latencies were significantly lower when the stim- 
uli were presented 1n the congruent (versus incongruent) 
condition. Mean response time when participants were 
asked to classify stimuli into the congruent categories (pair- 
ing unhealthiness with enjoyment) was 767.76 millisec- 
onds, compared with 1137.91 milliseconds when partici- 
pants were asked to classify stimuli into the incongruent 
categories (pairing healthiness with enjoyment); this differ- 
ence in response time was significant (D = .648, SD = .497, 
t(135) = 17.19, p « .0001, d = 1.42). That participants were 
able to respond significantly faster when asked to group 
unhealthy foods with words describing tastiness and enjoy- 
ment than with words describing lack of taste and lack of 
enjoyment suggests that participants have a stronger 
implicit association between unhealthiness and tastiness 
than they do between healthiness and tastiness. 
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To test whether the unhealthy = tasty intuition influ- 
ences the response times of even the participants who did 
not report explicit belief in the intuition, we first averaged 
the items used to measure explicitness of belief to form an 
explicitness-of-belief index. We then assessed the mean dif- 
ference in response time for the congruent versus incongru- 
ent condition at each level of reported explicitness of belief 
in the intuition (see Irwin and McClelland 2001). The 
results revealed that this difference was significant at every 
level of belief. That is, at all reported levels of explicit dis- 
agreement and agreement with the intuition, the difference 
in response latencies when participants were asked to pair 
healthiness with enjoyment versus when they were asked to 
pair unhealthiness with enjoyment was significantly differ- 
ent from zero (all t's > 3.23, all ps « .01). Participants at all 
levels of belief were able to classify unhealthiness with 
enjoyment more quickly than they were able to classify 
healthiness with enjoyment. ; 

Thus, the results of Experiment 1 are consistent with 
our hypothesis that consumers implicitly subscribe to the 
view that healthiness and tastiness are inversely related to 
each other, regardless of their level of explicit belief in this 
relationship. As such, we expect that even consumers who 
do not report agreeing that healthiness and tastiness are 
inversely related to each other will make judgments and 
decisions as if they did; we tested this hypothesis in the next 
three experiments. 


Experiment 2: Taste Inferences of 
Crackers 


The results from Experiment 1 demonstrated that people 
implicitly associate unhealthiness with tastiness. Thus, we 
expect that the influence of the unhealthy = tasty intuition 
on decision making can take place outside of awareness 
such that even participants who do not report believing that 
healthiness and tastiness are inversely related to each other 
make decisions as if they believe in such a relationship. In 
general, the objective of Experiment 2 was to test this pre- 


diction in the context of inference making. Specifically, the 
objective of this experiment was twofold: First, we wanted 
to assess whether a food product portrayed as unhealthy 
would be inferred to taste better (Hj) than one portrayed as 
healthy. Second, we wanted to assess whether such an infer- 
ence is made both by people who explicitly report believing 
that tastiness is inversely related to healthiness and, to a 
lesser degree, by those who do not (Hy). 


Participants, Stimuli, and Procedure 


One hundred ten undergraduate students (51 men and 59 
women) took part in this experiment for course extra credit. 
Participants were told that the purpose of the experiment 
was to assess consumers' taste perceptions of three new 
brands of cheddar-flavored snack crackers. To lend credibil- 
ity to the cover story, participants were told that a national 
cracker manufacturer was interested in introducing these 
brands of crackers into the local market. The brands were 
portrayed as varying in terms of unhealthiness. After being 
exposed to nutritional information for the cracker versions, 
participants were asked to infer the tastiness of the three 
versions. 


Unhealthiness manipulation. The most natural way to 
manipulate unhealthiness perceptions is to portray offerings 
as differing in the content of an ingredient (e.g., fat) that 1s 
generally accepted as being unhealthy. However, altering 
the content of fat (or any other ingredient) can alter taste 
perceptions directly (i.e., without altering perceptions of 
unhealthiness). As a result, varying content of a specific 
ingredient would confound our prediction with the compet- 
ing possibility that the ingredient may independently 
enhance inferred tastiness of the crackers. To overcome this 
problem, we chose to manipulate perceptions of unhealthi- 
ness by varying the purported content of "bad" versus 
“good” fat, while controlling for the crackers' total fat con- 
tent. This allowed us to manipulate the perceived unhealthi- 
ness of the crackers selectively while controlling for any 
(potential) effects due to a direct association of fat content 
with taste. 

The unhealtbiness manipulation unfolded as follows: 
Before seeing the nutritional information for each cracker 
version, participants were presented with the following 
information about the characteristics of bad (saturated) and 
good (unsaturated) fat: 


Medical opinion suggests that consuming unsaturated fat 
(good fat) can raise the level of high-density lipoproteins 
(HDL), which carry cholesterol from the heart to the liver 
and thereby eliminate excess cholesterol; in contrast, con- 
suming saturated fat (bad fat) can raise the level of low- 
density lipoproteins (LDL), which carry cholesterol from 
the liver to the rest of the body and thereby lead to 
blocked coronary arteries. (adapted from Harvard School 
of Public Health Nutrition Source 2004) 


Then, participants were told that each of the three new 
brands of crackers contained 13 grams of total fat per serv- 
ing. One brand (the healthy alternative) was portrayed as 
containing 11 grams of good fat and 2 grams of bad fat, and 
another (the unhealthy alternative) was portrayed as con- 
taining 2 grams of good fat and 11 grams of bad fat. A third, 
“control” version was portrayed as containing 6.5 grams of 


both types of fat. All participants were exposed to informa- 
tion about all three versions of the crackers in a counterbal- 
anced manner. 


Dependent and other variables. After exposure to infor- 
mation about the fat content of the three cracker versions, 
participants were asked to rate “How tasty do you think 
these crackers would be?” and “How much do you think you 
would enjoy eating them?” on a ten-point scale (1 = “not at 
all?" 10 = “very”) for each cracker version. To test whether 
the influence of the unhealthy = tasty intuition can occur 
implicitly, we assessed explicitness of participants’ beliefs in 
the intuition by eliciting participants’ agreement with the 
same two statements we used in Experiment 1. Finally, par- 
ticipants provided information on their liking for cheddar- 
flavored snack crackers and also indicated their perceptions 
of the unhealthiness of the three cracker versions. 


Results 


We dropped 4 participants from the analyses for indicating 
that they did not like any type of cheddar-flavored snack 
cracker (a rating of 1 on the scale that assessed liking). This 
left 106 participants (48 men and 58 women). 


Manipulation check. We first wanted to assess whether 
our unhealthiness manipulation worked as intended. We 
regressed perceived unhealthiness of crackers on their pur- 
ported bad fat content. The results revealed a positive 
monotonic relationship between purported bad fat content 
and perceived unhealthiness (B = 4.39, t(101) = 9.32, p < 
001), revealing that the unbealthiness manipulation worked 
as we intended. l 


Taste inferences. The items used to measure tastiness 
and enjoyment of the crackers were highly correlated 
(Cronbach’s o = .86); thus, we averaged them to form a 
tastiness index. The items used to measure explicitness of 
belief in the intuition were also highly correlated (Cron- 
bach's œ = .73); thus, we averaged them to form an 
explicitness-of-belief index, as in Experiment 1. Using the 
taste inference index of the crackers as the dependent 
variable, we conducted a mixed analysis of variance 
(ANOVA) with three levels of portrayed unhealthiness of 
the cracker (healthy, control, unhealthy) as a repeated factor 
and explicitness of belief in the intuition as a continuous 
between-subjects factor. The results revealed a marginal 
main effect of unhealthiness (F(2, 103) = 2.44, p = .09). As 
we predicted, mean inferred tastiness was highest for the 
cracker portrayed as containing the highest amount of bad 
fat (M = 7.33) and lowest for the cracker portrayed as con- 
taining the lowest amount of bad fat (M — 5.16), with the 
control cracker falling in between (M = 6.38). Although the 
differences between the inferred tastiness of the unhealthy 
and the control cracker and those between the healthy and 
the control cracker were not significant (F(1, 104) = 1.53 
and 1.84, respectively, both ps » .12), the focal contrast 
between tastiness ratings of the unhealthy and healthy 
crackers was significant (F(1, 104) = 17.02, p < .001). 
These results offer evidence consistent with H, and indicate 
that consumers infer the taste of food products when infor- 
mation relevant to assessing their healthiness is provided in 
a manner consistent with the unhealthy = tasty intuition. 
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Role of explicitness of belief. The results of the ANOVA 
also revealed a significant unhealthiness x explicitness-of- 
belief interaction (F(2, 103) = 12.12, p < .001), suggesting 
that the inference that crackers with more bad fat tasted bet- 
ter was made to differing degrees, depending on partici- 
pants’ strength of belief in the unhealthy = tasty intuition. 
To test whether the unhealthy = tasty intuition influenced 
the taste inferences of participants who did not report 
explicit belief in it, we assessed the mean difference in 
inferred tastiness ratings of the unhealthy versus healthy 
cracker at each level of reported explicitness of belief in the 
intuition (Irwin and McClelland 2001). The results revealed 
that this difference was significant at every level of belief. 
That is, as we depict in Figure 2, at all reported levels of 
disagreement and agreement with the intuition, the differ- 
ence in inferred tastiness of the unhealthy versus healthy 
cracker was significantly different from zero (all t's » 2.68, 
all ps « .01).? 


Discusslon 


The results from Experiment 2 lead us to conclude that the 
unhealthy = tasty intuition influences taste inferences. 
Although participants in the experiment did not taste any of 
the crackers, and the total amount of fat was the same 
across the different versions of the cracker, they readily 
assumed that crackers with more bad fat would taste better. 
Furthermore, we also found that though the effect of por- 
trayed healthiness on taste inferences was particularly 
strong among those who reported an explicit belief in the 
unhealthy = tasty intuition, we also obtained the effect 
among participants who reported disagreement with the 
intuition. This suggests that consumers may implicitly sub- 
scribe to the view that healthiness and tastiness are 
inversely correlated, thus conceptually corroborating the 
results we obtained in Experiment 1. 


Experiment 3: Enjoyment of Mango 
Lassi 


We wanted to expand on our findings in several ways in 
Experiment 3. First, we attempted to address a potential 


3To ensure that our results hold when the unhealthy = tasty intu- 
ition is measured as such (i.e., that unhealthy implies tasty rather 
than that healthy implies not tasty), we conducted an additional 
study with 287 undergraduate students that reversed the wording 
of the explicitness-of-belief measure. That is, participants were 
asked to rate their agreement with the statement “The more 
unhealthy a food, the better it tastes” rather than the statement 
“There is no way to make food healthier without sacrificing taste.” 
The results were nearly identical to those we found 1n Experiment 
2. Participants rated the unhealthy cracker (M = 7.31) as signifi- 
cantly tastier than the healthy cracker (M = 5.33; F(1, 286) = 
163.71, p < .0001), and this difference in taste ratings interacted 
with explicitness of belief (F(1, 285) = 10.99, p < .001), such that 
higher levels of belief were associated with greater differences in 
taste ratings. Furthermore, when we used Irwin and McClelland's 
(2001) method, the difference 1n rated taste between the unhealthy 
and the healthy cracker was significantly different from zero at 
every level of explicit belief in the intuition (all t's > 2.57, all ps < 
.05) except at the lowest level. At the lowest level of explicit 
belief, this difference approached significance (t = 1.55, p = .12). 
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limitation arising from the within-subjects design used in 
Experiment 2. Because participants were asked to infer the 
tastiness of three different cracker versions that varied only 
in terms of portrayed healthiness, it could be argued that the 
participants may have guessed our hypotheses. Although 
the significance of the two-way unhealthiness x 
explicitness-of-belief interaction suggests that demand 
effects cannot fully explain our findings, we wanted to act 
conservatively and conceptually replicate these findings 
using a between-subjects manipulation of unbealthiness. 
Second, we wanted to demonstrate that the portrayed 
healthiness of food affects actual (versus inferred) enjoy- 
ment (see Figure 1, Stage 2). Such a finding would indicate 
that the unhealthy = tasty intuition acts as a hypothesis that 
participants confirm after sampling stimuli (e.g., Hoch and 
Ha 1986; Shiv, Carmon, and Ariely 2005). Third, we 
wanted to assess whether the implicit nature of the influ- 
ence of the unhealthy = tasty intuition can be replicated in 
the context of actual enjoyment. Such a finding would fur- 
ther attest to the robustness of H4. Finally, we tested the 
generalizability of our results by using a different (1) prod- 
uct category, (2) unhealthiness manipulation, and (3) exper- 
imental setup. 

Forty adults were invited to a housewarming party at the 
residence of one of the authors of this article. In the invita- 
tion to the event (sent by e-mail), invitees were informed 
that refreshments would be served at the party. The refresh- 
ment item targeted for testing our predictions was Mango 
Lassi (an East Indian delicacy similar to a milkshake). We 
used the context of the housewarming party, which pro- 
vided a more realistic setting than the laboratory, to test our 
predictions. 


FIGURE 2 
Experiment 2: Difference in Inferred Tastiness of 
an Unhealthy Versus a Healthy Cracker at Each 
Level of Explicitness of Belief in the Unhealthy = 
Tasty Intultion 


Rating Difference 


A Bad-good rating 
difference 
Linear (Bad—good 
rating difference) 





Participants, Stimuli, and Procedure 


At the party, invitees were told that snacks had been ordered 
from a newly opened Indian restaurant (Hyderabad Masala) 
and that the operators of this restaurant were interested in 
catering mainly to an American audience. To lend credibil- 
ity to the cover story, during appropriate moments at the 
party, the host nonchalantly expressed that “when the 
restaurant operators realized that I was going to have a pre- 
dominantly American crowd at the party, they wondered if 
the guests would be willing to fill out a short questionnaire 
about the food. I didn’t see any harm in agreeing, especially 
since they promised to give you a 25% discount on your 
next visit to the restaurant.” A coupon for “25% off the next 
meal at Hyderabad Masala” was made available:to the invi- 
tees. To lend further credibility to the cover story, a confed- 
erate (a graduate student of East Indian origin) was hired to 
play the role of an agent representing the restaurant. The 
confederate handed out the one-page questionnaire that pro- 
vided information about the following food items offered at 
the party and.elicited participants’ judgments of the tasti- 
ness and enjoyment of the following items using a ten-point 
scale (1 = “not at all,’ 10 = “very’”): (1) Idlis, (2) Samosas, 
and (3) Mango Lassi.4 These items were placed ona dining 
table in the formal dining area. The Idlis and Samosas were 
served on paper plates, and the Mango Lassi was served in 
prefilled 12-ounce plastic cups. 

Because the invitees arrived at various times during a 
fairly large (four hours) time interval and because the din- 
ing area was physically distant from the family room, where 
people congregated, the invitees tended to be separated 
from the other guests at the time of sampling the food items 
and filling out the questionnaire. This minimized potential 
response bias effects due to social influence (Raghunathan 
and Corfman 2006). To encourage candidness further, the 
confederate (who was always present next to the food) 
stressed to every invitee, “Since we [the restaurant opera- 
tors] are interested in catering to the American taste, we 
_ request that you respond as candidly as possible. Please 
don’t worry about hurting our feelings.” 


Unhealthiness manipulation. Across all participants, the 
Idli was portrayed as a healthy item, and the Samosa was 
portrayed as an unhealthy one. These two items were not 
the focus of this experiment, but we included them to lend 
credibility to the cover story and to mask the main target of 
this experiment (the Mango Lassi). For roughly half of the 
participants, the Mango Lassi was portrayed as a healthy 
item (“made out of real mango pulp and milk; generally 
considered very healthy”), and for the other half, it was por- 
trayed as an unhealthy one (“made out of real mango pulp 
and milk; generally considered unhealthy”). Note that this 
manipulation (unlike the one we used in Experiment 2) 
serves to alter perceptions of unhealthiness directly (rather 
than manipulate it by altering the content of an ingredient 
considered unhealthy). 


^[dli is a traditional South Indian dish made by steaming a mıx- 
ture of mainly lentils and rice. Samosas are deep-fried vegetable 
dumplings. Lassi refers to sweet drinks made with a base of 


yogurt. 


Measurement of explicitness of belief in the intuition. 
Two days after the party, the host e-mailed the invitees indi- 
vidually, attaching an article that had recently appeared in 
the local newspaper. The newspaper article suggested that 
the “low-carb fad" was waning. The purported purpose of 
the e-mail was to alert the recipients to the content of the 
"interesting" article, but the true purpose was to gauge the 
invitees’ explicitness of belief in the unhealthy = tasty intu- 
ition. Using the news article as a backdrop, the host 
assessed the invitees' beliefs in the relationship between 
healthiness and tastiness by asking the following question, 
“On a scale of one to nine, how much would you agree with 
the following statement: ‘Food that is unhealthy generally 
tastes better?” We took invitees’ ratings as a measure of the 
explicitness of their beliefs in the unhealthy = tasty intu- 
ition; higher numbers indicated greater explicitness of 
belief. 


Results 


About a week after the party, the invitees were individually 
e-mailed and asked if they had suspected that the true pur- 
pose of the questionnaire administered during the party was 
different from the purported one. We dropped data from one 
individual who reported suspicion. 


Taste inferences. Tastiness and enjoyment of the Mango 
Lassi were highly correlated with each other (Cronbach's 
a = ./3); thus, we averaged them to form an enjoyment 
index. Using this index as the dependent variable, we con- 
ducted a mixed ANOVA with two levels of unhealthiness of 
Mango Lassi (healthy, unhealthy) as a between-subjects 
factor and explicitness of belief in the intuition as a contin- 
uous between-subjects factor. The results from this analysis 
appear in Figure 3. Consistent with our prediction (H5), the 
mean enjoyment ratings were higher for participants in the 
condition in which the milkshake was portrayed as being 
unhealthy (M = 5.13) than for participants in the condition 
in which the milkshake was portrayed as being healthy 
(M = 3.30; F(1, 35) = 24.29, p « .001). 


Role of explicitness of belief. Echoing findings from 
Experiment 2, the ANOVA also revealed a marginal health- 
iness X explicitness-of-belief interaction (F(1, 35) = 3.91, 
p = .06), indicating that the relationship between enjoyment 
of the milkshake and its perceived healthiness depended on 
participants’ explicitness of belief in the unhealthy = tasty 
intuition. Follow-up analyses revealed that the effect of 
unhealthiness on enjoyment was significant both at one 
standard deviation above the mean belief level (t(1, 35) — 
4.84, p « .001) and at one standard deviation below it (t(1, 
35) = 2.00, p = .05), indicating that portrayed healthiness of 
food significantly influenced participants’ enjoyment of the 
food, even among those who did not report agreeing with 
the unhealthy = tasty intuition. These results provide further 
support for our hypothesis that the influence of the intuition 
on consumption decisions can take place outside of 
awareness. 


Discussion 


Viewed in conjunction with the results from Experiments 1 
and 2, the results from Experiment 3 indicate that taste 
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FIGURE 3 
Experiment 3: Mango Lassi Enjoyment Ratings x 
Explictness of Belief That Food That Is Unhealthy 
Generally Tastes Better 





Healthy Condition Unhealthy Condition 


—@— Low belief that food that is 
unheatthy generally tastes 
better 


—— High belief that food that is 
unhealthy generally tastes 
better 





inferences made on the basis of perceived unhealthiness are 
confirmed on sampling the food, implying that a mecha- 
nism akin to the hypothesis confirmation bias may underlie 
the prevalence of the unhealthy — tasty intuition. Further- 
more, the finding that even participants who reported dis- 
agreeing that healthiness needs to be sacrificed in order to 
enhance tastiness enjoyed the milkshake more when it was 
portrayed as unhealthy (versus healthy) indicates that the 
influence of the unhealthy = tasty intuition can be implicit, 
thus conceptually replicating results from Experiments 1 
and 2. Finally, the results were consistent with our predic- 
tions even though we used a different (1) design (between- 
versus within-subjects), (2) product category, and (3) 
unhealthiness manipulation; this attests to the robustness of 
the influence of the unhealthy = tasty intuition. 


Experiment 4: Choice of Crackers 


The results from Experiments 2 and 3 suggest that people's 
belief in the unhealthy = tasty intuition is sufficiently strong 
to influence taste inferences and enjoyment of food. The 
objective in Experiment 4 was to further explore whether 
the intuition would influence choice decisions (see both 
Figure 1, Stage 3, and H5). We know from previous research 
that when consumers are faced with the task of choosing 
among food products, better-tasting options have a higher 
likelibood of being chosen when a hedonic goal is relatively 
more salient (e.g., Dhar and Simonson 1999; Shiv and 
Fedorikhin 1999). Extrapolating from these results, we 
expect that participants will prefer an option portrayed as 
being more (versus less) unhealthy when a hedonic goal is 
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relatively more salient because according to our hypothesis, 
people will infer this option to taste better (and thus will 
judge it as a better candidate for fulfilling the hedonic goal). 


Participants, Stimull, and Procedure 


Two hundred ninety-three undergraduate students (107 
men, 184 women, 2 unknown gender) participated in this 
experiment for course extra credit. They were first told that 
a national manufacturer of crackers was interested in 
assessing which of two versions of a cheddar-flavored 
snack cracker is preferred in choice tasks. Subsequently, 
information about good and bad fat (identical to the infor- 
mation we used in Experiment 1) was provided before par- 
ticipants chose between a cracker version with 11 grams of 
bad fat and 2 grams of good fat (unhealthy version) and one 
with 2 grams of bad fat and 11 grams of good fat (healthy 
version). 


Before indicating their choice, approximately half of the 
participants were told to imagine that they were “craving 
something really tasty” and that “they wanted to reward 
themselves with a nice snack.” The purpose of these state- 
ments was to prime a hedonic, or enjoyment, goal. The 
remaining participants were not exposed to such a prime; 
they were simply told to imagine that they were “in the 
mood for a snack.” The dependent variable was which 
cracker version participants selected in a forced-choice task. 
We elicited participants’ explicitness of belief in the 
unhealthy = tasty intuition using the same questions we 
used in Experiments 1 and 2. Consistent with H3, we 
expected that the relative preference for the unhealthy (ver- 
sus healthy) cracker version would be higher among partici- 
pants who were primed with the hedonic goal. 


Results and Discussion 


Manipulation check and choice. As we expected, par- 
ticipants perceived the cracker with the higher content of 
good (versus bad) fat content as healthier (Ms = 6.13 and 
3.05; t(291) = 1067.71, p < .01). To test our prediction that 
the relative preference for the unhealthy (versus healthy) 
option would be higher among those primed with a hedonic 
goal, we conducted a chi-square test. As we predicted, the 
proportion of participants who chose the unhealthy option 
was significantly greater among those who were primed 
with the hedonic goal (M = 21%) than among those who 
were not primed with this goal (M = 8%; y? = 15.88, p < 
.001; 9 = .23).° 


Role of explicitness of belief. To test the role of explicit- 
ness of belief, we conducted an ANOVA using the prime 
and explicitness of belief in the intuition as the predictor 
variables and choice as the dependent variable. The results 
revealed a nonsignificant prime x explicitness-of-belief 
interaction (F(1, 258) = .59, p > .44), indicating that, 
regardless of whether the participants reported agreeing 


*The proportion of participants who chose the unhealthy cracker 
was low even in the hedonic prime condition, presumably because 
participants did not want to indicate that they were interested in 
consuming a cracker that was explicitly portrayed as being 
unhealthy. 


with the unhealthy = tasty intuition, they made choice deci- 
sions as if they did. 

Thus, the results from Experiment 3 conceptually repli- 
cate those we obtained previously, providing evidence that 
the unhealthy = tasty intuition affects not just inferences 
and judgments but actual choice decisions as well. We 
found that when a hedonic goal was made temporarily more 
salient, participants were more likely to select the unhealthy 
cracker, presumably because they inferred that it would 
taste better. Furthermore, the lack of significance of the 
prime x explicitness-of-belief interaction suggests that the 
influence of the intuition can take place outside the aware- 
ness of consumers, indicating that controlling for the influ- 
ence of the intuition may be difficult. 


General Discussion 


Understanding the effects of food consumption habits on 
health has emerged as a topic of significance in recent 
years. The question of why consumers overconsume food 
despite the negative consequences of doing so has espe- 
cially piqued the interest of academics (e.g., Kozup, Creyer, 
and Burton 2003; Oakes 2005; Oakes and Slotterback 2005; 
Seiders and Petty 2004; Smith 2004; Tepper and Trail 1998; 
Urala and Lähteenmäki 2004; Verbeke 2005; Wansink 2004 
a, b; Wansink and Chandon 2006; Wansink and Huckabee 
2005). Previous findings are consistent with the idea that 
people overconsume food that is considered unhealthy 
either because they are unaware of the negative conse- 
quences of doing so (presumably because of poor marketing 
and/or presentation of nutritional information) or because 
they lack the willpower to resist such foods. In what we 
believe is the first rigorous empirical demonstration of its 
kind, our findings indicate that in addition to these factors, a 
more subtle force may be at work; namely, consumers over- 
consume foods that they perceive as unhealthy because they 
assume that such food tastes better. 

The results from Experiment 1, in which we used the 
well-established IAT procedure (Greenwald, McGhee, and 
Schwartz 1988), demonstrated not only that people sub- 
scribe to the unhealthy = tasty intuition but aiso that this 
intuition operates at an implicit level. Both participants who 
reported an explicit belief in the intuition and those who did 
not took longer to categorize stimuli in the incongruent 
(versus congruent) condition, which suggests that the 
implicit subscription to the unhealthy = tasty intuition is 
robust. 

The results from three other experiments confirmed our 
predictions that when information pertaining to the assess- 
ment of the healthiness of food items is provideli, foods per- 
ceived as less healthy are (1) inferred to taste better, (2) 
enjoyed more during actual consumption, and (3) preferred 
in a choice task when a hedonic goal is more (versus less) 
salient. We obtained these patterns of results even among 
those who disagreed that healthiness is inversely related to 
tastiness, which suggests that the influence of the 
unhealthy — tasty intuition can take place outside of aware- 
ness; this result conceptually replicates Experiment 1’s 
results, which demonstrate that subscription to the 
unhealthy = tasty intuition is implicit. 


Theoretical Implicatlons 


Validity of the intuition and reasons for its propagation. 
In light of our findings, an important question that emerges 
is whether the unhealthy = tasty intuition is valid, that is, 
whether the healthiness and tastiness of food items are 
indeed negatively correlated with each other in the real 
world. Although the intuition appears to enjoy widespread 
subscription (e.g., Keller, Sternthal, and Tybout 2002), there 
is little scientific evidence to support the view that tastiness 
and healthiness are negatively correlated with each other. 
Indeed, from an evolutionary standpoint, evidence points to 
the opposite, namely, that tastiness and healthiness are posi- 
tively correlated with each other. For example, in general, it 
is accepted among researchers (e.g., Drewnowski 1997; 
Smith 2004) that certain food groups (e.g., carbohydrates, 
fat) are perceived as tastier precisely because these foods 
have proved effective for survival; in these researchers’ 
view, tastiness has served humans well as a proxy for 
healthiness. 

Why then do negative health consequences result from 
consuming the food groups (e.g., carbohydrates, fat) that 
are, evolutionarily speaking, considered healthy? Medical 
evidence suggests that it is the overconsumption, not mod- 
erate levels of consumption, of these food groups that leads 
to negative health consequences (e.g., Drewnowski 1997).6 
Indeed, consuming fat and carbohydrates in moderation has 
been found to have salutary effects on health (Better Health 
Channel 2004). Consistent with this view, findings indicate 
that overconsumption of any food group can result in nega- 
tive health consequences. 

There is yet another reason to question the validity of 
the unhealthy = tasty intuition. Note that to assess the actual 
relationship between healthiness and tastiness, it is first 
necessary to measure the healthiness and tastiness of vari- 
ous food options accurately. We believe that this is a decep- 
tively difficult task because neither judgments of healthi- 
ness nor those of tastiness are likely to be stable across time 
or across people. Oakes's (2005) findings demonstrate how 
judgments of healthiness are notoriously susceptible to con- 
textual influences. In one study, for example, Oakes finds 
that a lower-calorie snack portrayed as being “disreputable” 
in terms of health (i.e., a Snickers miniature bar with 47 
calories) was inferred to promote greater weight gain than 
one portrayed as being more "reputable" in terms of health 
(1.e., a half cup of green peas with 62 calories). Thus, par- 
ticipants in this study relied on their subjective perceptions 
of the snacks' healthiness rather than on more diagnostic 
information (the snacks' caloric value) to infer their poten- 
tial for weight gain. There is reason to believe that judg- 
ments of tastiness may be similarly susceptible to contex- 
tual influences. For example, Carpenter and Nakamoto's 
(1989) findings suggest that what people happen to taste 


6Needless to say, underconsumption of nutrients can also lead to 
negative health consequences. Indeed, malnourishment is the 
world's deadliest cause of premature death (World Health Organi- 
zation 2002). However, because this research is restricted to the 
context of the United States, where food is plentiful, we restrict 
our discussion to negative consequences of overconsumption. 
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first (e.g., Coke or Pepsi) may become their preferred 
option. An overview of other relevant findings suggests that 
there is little agreement (among adults) about tastiness rat- 
ings of food (see, e.g., Drewnowski 1997, p. 240). 

Nonconscious influence. It is well known that con- 
sumers’ decisions are influenced by their lay theories and 
intuitions even when the lay theories are invalid (Broniar- 
czyk and Alba 1994; Shiv, Carmon, and Ariely 2005). What 
is noteworthy in the current research is that our predictions 
were confirmed across two different types of food products, 
three types of judgment tasks, and two conceptually differ- 
ent types of unhealthiness manipulations. The generality of 
our findings across these experimental differences hints at 
the prevalence and strength with which people believe in 
the unhealthy = tasty intuition. Even more intriguing, how- 
ever, is the indication of the implicit nature of the influence 
of the intuition. The results from Experiment 1 confirmed 
that people may have implicitly learned to correlate 
unhealthiness with tastiness. The results from Experiments 
2 and 3 offer conceptual replication by showing that even 
participants who reported disagreeing with the view that 
healthiness and tastiness are inversely correlated behaved as 
if they did. These findings indicate that the influence of the 
intuition may lie outside the awareness of consumers mak- 
ing judgments about what to consume. At a conceptual 
level, this notion shares similarities with Lewicki and col- 
leagues' (e.g., Lewicki 1986; Lewicki and Hill 1989; 
Lewicki, Hill, and Czyzewska 1997) work on nonconscious 
learning. The idea that a person's judgments and decisions 
may be influenced by factors outside his or her awareness is 
also echoed in various other findings, such as mere expo- 
sure effects (Zajonc 1980), truth effects (Hawkins and Hoch 
1992), and assimilation/contrast effects (e.g., Schwarz and 
Bless 1992). 


Managerlal, Consumer, and Public Policy 
Implications 


Although the obesity problem afflicting people in the 
United States may be due, in part, to the ubiquity of food 
high in fat content, our findings indicate that even if options 
with less fat were available, consumers may choose to con- 
sume foods high in fat because of a misguided notion that 
they need to eat unhealthy to eat tasty or that eating healthy 
means having food that is not tasty. Evidence from food sci- 
entists suggests that despite current concerns about healthy 
eating, health is secondary to taste in selecting many food 
products (for a discussion of taste versus health in con- 
sumer selection of corn chips, see Tepper and Trail 1998). 
There is also evidence in recent research on functional 
food—a new category of products that promise consumers 
improvements in targeted physiological functions, such as 
lowering cholesterol (Urala and Lähteenmäki 2004)—that 
consumers are unwilling to compromise on taste even for 
the sake of the health benefits these foods offer (Verbeke 
2006). This lesson was perhaps most starkly revealed in the 
interesting story of the McLean, a low-fat burger that 
McDonald's tried to market in the early 1990s. Although 
the burger was actually preferred to burgers that contained 
more fat in blind taste tests, it was a miserable market fail- 
ure (Gladwell 2001). 
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Given our evidence that the influence of the unhealthy = 
tasty intuition is strong and may take place outside the con- 
sumer's awareness, this intuition could be a major con- 
tributing factor to obesity in consumers. Therefore, a key 
question is, What (if anything) can be done to counteract its 
potentially devastating impact on consumer food choice 
decisions and, subsequently, on consumer health? 

More research is being directed at understanding both 
how to market products such that consumers make better 
food choices (Andrews, Netemeyer, and Burton 1998; 
Kozup, Creyer, and Burton 2003; Seiders and Petty 2004; 
Wansink and Huckabee 2005) and how to help consumers 
make changes in their environment to help control their 
food consumption (Wansink 2004a). These streams of 
research offer guidance to the food industry, which has been 
accused of contributing to the obesity epidemic, and to con- 
sumers who want to make healthier choices that are still 
enjoyable, despite the pervasiveness of a belief in the 
unhealthy — tasty intuition. Possibilities for controlling the 
effect of the unhealthy = tasty intuition (and its potential for 
causing negative health consequences) include (1) control- 
ling the volume of unhealthy but tasty food that 1s eaten, (2) 
changing unhealthy foods to make them less unhealthy but 
still tasty, and (3) providing consumers with better informa- 
tion about what constitutes “healthy.” We address each of 
these possibilities in the remainder of this discussion. 


Limiting consumption volume of unhealthy foods. If 
consumers infer that unhealthy foods are tastier, they will 
possess a strong desire to eat foods that are relatively 
unhealthy (e.g., foods that are high in fat or high in calo- 
ries). À possible solution to this problem would be for both 
marketers and consumers to take steps to ensure that if 
these foods are going to be eaten, they are eaten in reason- 
able quantities. Such steps are important because, as we 
mentioned previously, medical evidence suggests that it is 
the overconsumption, not moderate levels of consumption, 
of foods considered unhealthy that leads to negative health 
consequences (e.g. Drewnowski 1997). Therefore, mea- 
sures aimed at restricting consumption of foods considered 
unhealthy (but tasty) to moderate levels can go a long way 
toward counteracting the potential negative health conse- 
quences of a belief in the unhealthy = tasty intuition. 

Responsible marketers and individual consumers can 
take such steps to control consumption volume of unhealthy 
foods. For example, Wansink and Huckabee (2005) suggest 
that marketers can guide consumers' volume decisions by 
changing packaging (e.g., offering multipacks with smaller 
individual servings, offering premium-priced smaller pack- 
aging), changing the volume of implied serving sizes in 
packaging information (e.g., designating a serving as three 
cookies versus six), and providing consumers with easier 
ways to assess how much they are consuming (e.g., placing 
markers on the outsides of packaging to indicate a serving 
size, placing indicator lines inside packaging). Consumers 
can also make changes to their personal environment to 
help reduce consumption. 

Wansink (2004a) offers many practical alterations that 
can be made to both the eating and the food environments, 
including pre-serving portions when snacking while dis- 
tracted (e.g., while watching television); repackaging foods 


into small containers to suggest smaller consumption 
norms; never eating directly from a package; and using 
smaller plates, bowls, and eating utensils. Such steps can 
allow marketers to continue to market unhealthy foods in 
responsible ways and allow consumers to continue to enjoy 
the foods they consider tasty, albeit in moderation. 
Although many of these changes seem small, Wansink and 
Huckabee (2005) point out that 80% of the population gain 
weight because of a calorie excess of less than 50 calories a 
day. The changes suggested here could allow consumers to 
enjoy the foods they consider tasty in smaller, controlled 
quantities, perhaps enough to avoid consumption of those 
additional 50 calories. 


Reformulating unhealthy foods. Another possibility for 
controlling the effects of the unhealthy = tasty intuition is to 
change the composition of unhealthy foods. One alternative 
is to reformulate high-energy-density foods (i.e., foods high 
in calories) to lower their energy density (i.e., calorie con- 
tent) by replacing some of the fat with water, fiber filler, or 
vegetables (Wansink and Huckabee 2005). Although evi- 
dence from our studies suggests that people believe that 
food cannot be made healthier without sacrificing taste 
(and, thus, that these lower-energy-density foods would be 
less appetizing), taste tests indicate otherwise. Rolls, Ello- 
Martin, and Tohill’s (2004) research suggests that con- 
sumers who eat lower-energy-density foods eat the same 
volume as they would of the unmodified food (resulting in 
fewer calories consumed), rate themselves as equally satis- 
fied as those who eat full-calorie dense foods, and, most 
important in our research context, do not perceive the foods 
as tasting worse (Kral and Rolls 2004; Rolls 2005; Wansink 
and Huckabee 2005). Research suggests that up to 20% of 
the fat in a high-energy-density food can be replaced with 
low-density items (e.g., fruits, vegetables) without con- 
sumers noticing a difference in taste (Rolls, Ello-Martin, 
and Tohill 2004). 

Such findings suggest that many consumers are not 
knowledgeable enough about food composition to realize 
that the unhealthy foods they love can be successfully refor- 
mulated to be just as tasty without being as unhealthy. 
Because such modifications could have a large impact on 
the number of calories consumed, they could be very bene- 
ficial if consumers are educated to believe that the healthier 
version of their favorite unhealthy food is just as tasty. 
However, because of the pervasiveness of the unhealthy — 
tasty intuition, such a strategy could backfire (i.e., despite 
research evidence suggesting the taste had not truly 
changed, consumers could respond to such a modification 
as they did in Experiment 3, in which they 'allowed the 
knowledge that a food is now healthier to color their actual 
taste perceptions). 

To combat this problem, an obvious solution would be 
to stress both taste and health for these items, with the taste 
dimension being the dominant dimension. By stressing the 
taste dimension first, companies could work on breaking 
down the unhealthy — tasty intuition and increasing the 
probability of choice. Marketing strategies involving sam- 
pling, credible sources, and opinion leaders would be criti- 
cal. First, sampling would be a key factor in gaining accep- 
tance for these new, healthier products because consumers 


will not believe that they are tasty unless they have tried the 
product themselves. À free sample represents a low-risk 
way to experience the taste. These samples could be distrib- 
uted through the mail or, more appropriately, handed out in 
health clubs or at sporting/fitness events. 

Second, well-respected health or fitness experts could be 
employed to endorse these products. To change engrained 
beliefs, a credible source is needed. Consumers are more 
likely to accept a message about a product when the source 
is considered trustworthy and viewed as an expert on the 
topic (e.g., Goldsmith, Lafferty, and Newell 2000; Sternthal, 
Dholakia, and Leavitt 1978). This technique could be par- 
ticularly effective if these sources are opinion leaders. For 
example, health club employees and trainers are often 
looked to by consumers for tips on exercise and diet. À mar- 
keting or public policy program could educate and persuade 
these people on the tastiness and health benefits of the new 
products. They could then be important transmitters of this 
information to consumers. Thus, this message would be 
more likely to be accepted and acted on. 

Another possibility might be to stimulate "buzz" for 
certain products among consumers. If marketers could con- 
vince a key group of consumers in the word-of-mouth net- 
work that their food products taste good and are part of a 
healthy diet, this group could influence and excite other 
groups of consumers. A similar phenomenon occurred in 
the fast and widespread acceptance of the Atkins diet, in 
which consumers were willing to give up foods high in car- 
bohydrates for quick weight loss. 


Reeducating consumers about what is "healthy." A final 
avenue for counteracting the effects of the unhealthy = tasty 
intuition is to educate consumers about what constitutes 
healthy. Determining what is healthy or unhealthy is a 
deceptively difficult task because, as we discussed previ- 
ously, healthiness is not an inherent quality of food; rather, 
it depends on the conditions (e.g., quantity, genetic makeup 
of the person consuming it) under which it is consumed. 
Nevertheless, consumers can be educated to approach nutri- 
tion in a more balanced fashion than they currently do. In 
Oakes's (2005) study, which we described previously, con- 
sumers overemphasized fat content when they assessed the 
healthiness of foods and relied on a categorization system 
(of foods as bad versus good) using stereotypes about types 
of food/ingredients to determine how much weight a food 
would cause someone to gain. Had tbey relied instead on a 
more complete nutritional assessment of the foods, they 
would have found that the food they considered healthy on 
the basis of fat content (peas) would actually lead to more 
weight gain than the food they considered unhealthy 
(Snickers) because of the caloric content of the two foods. 

What can be done to convince consumers to consider all 
types of nutritional information? A possibility 1s to develop 
an easy-to-use nutrition index that combines the various 
types of relevant information into a single number, which 
consumers could then use to make easy comparisons across 
different types of foods. This would enable consumers to 
choose healthier food with minimal effort and confusion. 
Furthermore, reeducating consumers to evaluate the health- 
iness of a food could actually change consumers' percep- 
tions about what constitutes healthy and unhealthy. 
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Limitations and Further Research 


An interesting implication of our main hypothesis (that 
people subscribe to the unhealthy = tasty intuition) is that 
regardless of which food group is associated with unhealth- 
iness, food items that are higher (versus lower) in that 
ingredient will be inferred to be tastier. Thus, for example, 


in contexts in which carbohydrates are considered 
unhealthy (as is likely to be the case among those following 
the Atkins diet), food items higher (versus lower) in carbo- 
hydrates will be (1) inferred to taste better, (2) enjoyed 
more, and (3) preferred when a hedonic goal is more (ver- 
sus less) salient. Indeed, the results from Experiment 3, in 
which no mention was made of the particular ingredients 
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Enjoyable Not Enjoyable 
Tasty Disilked 
Delicious Less tasty 
Yummy Unappealing 
Appetizing Bland 
Flavortul Flavorless 
Appealing Unappetizing 
Mouthwatering Unpalatable 


that made the Mango milkshake more or less healthy offer 
conceptual support for this hypothesis. 

An important related empirical question "T is still 
unanswered by the current research is whether other 
manipulations of unhealthiness would produce similar 
results in terms of taste inferences, enjoyment, and choice. 
For example, would altering the content of preservatives 
(assuming that they are considered unhealthy) alter percep- 
tions of tastiness in a manner consistent with the 
unhealthy = tasty intuition? If not, identifying the manipula- 
tions that lead to the types of results we document in this 
research would be important for both theoretical and sub- 
stantive reasons. Similarly, further research should ascertain 
the boundaries of the content of unhealthy "ingredients 
within which results consistent with the unhealthy — tasty 
intuition will be obtained. Is there a limit beyond which 
increasing unhealthiness does not lead to enhanced infer- 
ences of tastiness? Levin and Gaeth's (1988) results suggest 
that there is. They find that beef that was portrayed as 2596 


fat was expected to taste worse than beef that was portrayed 
as 75% lean. These results indicate that there is a range 
within which increasing the portrayed unhealthiness of food 
leads to enhanced perceptions of tastiness but also a range 
beyond which it does not. 

A final and interesting extension of our work would be 
to assess whether and to what extent socioeconomic and 
cultural factors moderate our effects. In contexts in which 
food is not plentiful (e.g., in developing countries, such as 
India or China, and in underdeveloped countries, such as 
Somalia or Cambodia), people may believe tastiness is posi- 
tively correlated with healthiness, as evolution intended 
(Drewnowski 1997; Smith 2004). If so, we may expect to 
find a pattern of results that is opposite to that which we 
find in this research. Similarly, in epicurean cultures (e.g., 
France), in which food is valued primarily for its tastiness, 
it is possible that the perception of an inverse relationship 
between healthiness and tastiness is held with less 
conviction. 
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Broadening the Scope of Reference 
Price Advertising Research: A Field 
Study of Consumer Shopping 
Involvement 


The current understanding of reference pricing requires broadening to refiect contemporary advertising practices 
and shopping behavior. A content analysis of 13,594 newspaper retail advertisements reveals that it is uncommon 
for reference pricing to be used alone in advertisements: 87.2% of the advertisements featuring reference pricing 
also include limited-time availability (i.e., "Three days only!”) or sale announcements in the headline or copy. Using 
dual-process theories of persuasion as a conceptual framework, the authors conduct three field experiments that 
compare the conditions under which the use of limited-time availability and sale announcements in advertisements 
that feature reference prices affects consumer price perceptions and store shopping intentions. The findings yield 
three insights for reference price advertising. First, reference pricing results in favorable price perceptions and store 
shopping intentions only when consumers are shopping for a product. Second, when consumers are shopping for 
a product, reference pricing paired with limited-time availability results in more favorable price perceptions and store 
shopping intentions than the use of elther technique alone. Third, the use of sale announcements and limited-time 
availability in reference price advertisements has a favorable effect on price perceptions and store shopping 
intentions when consumers are not shopping for a product. 


Regularly: $599; Now: $359 
Sale! Three Days Only 


ost consumers have seen variations of one or both 
of these two headlines in a retail advertisement. 
The top headline features reference, pricing, in 
which a current, lower price is compared with a former, 
higher price. The bottom headline features scarcity— 
specifically, the limited-time availability of the sale offer. 
The particular wording and juxtaposition of each headline 
represent information and value cues that are intended to 
influence consumer price perceptions favorably. Does this 
information achieve its intended effect, and is there a differ- 
ence in the effect when consumers are or are not "in the 
market" for a product? Does the use of redundant value 
cues improve or have no effect on price perceptions, and 
what conditions influence this? In this article, we explore 
these questions and examine why and when commonly used 
limited-time availability and sale announcements affect 
consumer attitudes toward product prices and intentions of 
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shopping at stores featured in retail advertisements that use 
reference prices. In doing so, we attempt to broaden the 
scope of research on reference price advertising and under- 
stand its implications for practice. 

Although reference pricing, limited-time availability, 
and sale announcements appear with some regularity in 
retail advertising, little empirical evidence documents the 
incidence and coincidence of their use. For example, a con- 
tent analysis of sales advertisements by six major retail 
chains over the 2001 Labor Day weekend revealed that 
approximately 72% employed reference pricing in their 
advertisements (Chandrashekaran, Monroe, and Viswana- 
than 2004). Although not reported, it is reasonable to specu- 
late that many, if not most, of these advertisements also 
contained headlines or copy heralding a “Three days only!” 
Labor Day weekend sale to convey the limited-time avail- 
ability of price reductions offered. 

To obtain a broader perspective on the incidence and 
coincidence of reference pricing, limited-time availability, 
and sale announcements in retail advertising, we conducted 
a content analysis of 13,594 retail advertisements in four 
major, geographically dispersed U.S. newspapers over a 
six-month period in 2004.! We examined the Chicago Iri- 





1Sources for all four newspapers confirmed that advertisements 
that are 17 square inches or larger in size (e.g., two columns 
wide x four inches 1n height) account for more than 90% of the 
display ad revenues, and we examined all such advertisements. We 
collected data from four newspapers to provide representation of 
different regions of the country for a 195-day period that included 
Memorial Day through Thanksgiving. Two research assistants per- 
formed the coding, and we confirmed it. Reference pricing 
included all forms of price comparisons in which higb/low dollar 
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bune, The Dallas Morning News, Los Angeles Times, and 
The New York Times. Each advertisement was coded to 
identify whether reference pricing, limited-time availability, 
and sale announcements were included in the advertise- 
ment’s headline or copy. Table 1 presents the frequency 
counts based on the content analysis. Several findings 
deserve mention. Approximately 2996 of the advertisements 
featured reference pricing, limited-time availability, or a 
sale announcement in the headline or copy either alone or in 
combination. Sale announcements alone were displayed 
most often (9.0% of advertisements), followed by limited- 
time availability alone (7.9% of advertisements), and refer- 
ence pricing alone (.9% of advertisements). All three 
appeared concurrently in 2.7% of the advertisements. 
Limited-time availability was present in 58.8% of the 
advertisements that featured reference pricing, and sale 
announcements appeared in 69.3% of those advertisements. 
Sale announcements appeared in 47.096 of the advertise- 
ments that contained limited-time offers. Finally, and most 
important, 87.296 of the 912 advertisements that displayed 
reference prices also included sale announcements or 
limited-time availability in the headline or copy. These find- 
ings demonstrate that retailers commonly pair reference 
price information with limited-time availability and sale 
announcements in their newspaper advertisements. 

Our finding that reference pricing is rarely used in isola- 
ton in retail advertisement headlines and copy stands in 
stark contrast to behavioral research on reference price 
effects (for literature reviews, see Biswas, Wilson, and 
Licata 1993; Compeau and Grewal 1998). Prior behavioral 
reference-pricing research has focused almost exclusively 
on price comparisons and the manner in which they are por- 
trayed but does not consider the coincidental use of limited- 





amounts were explicitly stated (e.g., "Regularly $ , Now 
$ 7^; "Manufacturer's List Price $ , Sale Price $ a 
Limited-time availability included wording such as “ Days 
(Hours) Only"; “Hurry! Offer Ends Soon”; and “This Weekend 
Only!" Sale included the word "sale" and synonyms, such as 
"clearance/closeout," “liquidation,” “going out of business,” 
“deal,” or "special" (with the word “sale” itself accounting for 
80.196 of the occurrences). 





time availability and sale cues, which is the more common 
practice in retail advertisements. 

The managerial significance of understanding why and 
when these variables influence consumer price and value 
perceptions becomes apparent in light of the dollar expendi- 
tures for retail newspaper advertisements that display them. 
Assuming that our content analysis of retail advertisements 
in four major newspapers generalize nationally, we esti- 
mated that $8.2 billion was spent in 2004 on retail advertis- 
ing that contained one or more of these appeals, or approxi- 
mately 18.3% of total retail newspaper advertising 
expenditures. 


Theoretical Perspectives and 
Research Purpose 


The elaboration likelihood model (Petty and Cacioppo 
1986) and the heuristic-systematic model (Chaiken 1980, 
1987) are dual-process theories of persuasion that provide a 
foundation for this research. Both models postulate that 
high versus low motivation to process advertising informa- 
tion, operationalized through consumer involvement, results 
in different routes to persuasion: central/systematic 
(engaged through high-involvement processing) versus 
peripheral/heuristic (engaged through low-involvement pro- 
cessing). Different types of information are believed to have 
a persuasive impact under these different routes. This 
research examines the possibility that reference pricing, 
limited-time availability, and sale cues have inherent differ- 
ences in their information-processing requirements, and 
thus consumers who are shopping (high involvement) ver- 
sus those who are not shopping (low involvement) for a 
product featured in an advertisement respond to them dif- 
ferently. In particular, we argue that comprehending the full 
value implications inherent in reference price information 





2These estimates for 2004 assume zero ad revenue growth from 
$44.939 billion spent in 2003 (Newspaper Association of America 
2006), with 7096 of ad revenues coming from display advertise- 
ments (Belch and Belch 2004) and 90% of display revenues com- 
ing from advertisements that are 17 square inches or larger in size. 


TABLE 1 
Incidence of Reference Pricing, Limited-Time Offers, and Sale Announcements Featured In Newspaper 


Advertisements (N = 13,594) 
i a ie ea A 
Reference Pricing Featured in Advertisement 











Yes No 
Limited-Time Offer Limlted-Time Offer 
Featured in Advertisement Featured in Advertisement 

Sale Announcement Yes No Yes No Total 
—M—ÓMÓ—Ó—————M——À ene a e o dcr S 
Sale featured in advertisement 373 259 730 1222 2584 

(2.7%) (1.9%) (5.4%) (9.0%) (19.0%) 
Sale not featured in advertisement 163 117 1079 9651 11,010 

(1.2%) (.996) (7.996) (71.096) (81.096) 
Total 536 376 1809 10,873 13,594 

(3.996) (2 896) (13.396) (80.096) (10096) 
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requires more thought and effort and, thus, greater involve- 
ment than limited time or sale cues. 

The purpose of this investigation is to broaden the scope 
of reference price research to reflect contemporary retail 
advertising practices. Specifically, we examine the context 
in which and extent to which limited-time availability and 
sale cues in retail advertisements that feature reference pric- 
ing affect consumer price persuasion and store shopping 
intentions. We provide an overview of research on the major 
variables we examine, present our hypothesized expecta- 
tions, and report our findings from three field experiments. 


Reference Price Effects 


A reference price advertisement is one in which a lower, 
current price is compared with a higher price either previ- 
ously offered or offered by someone else. Considerable 
research in the marketing literature confirms the presence of 
reference price effects (Biswas, Wilson, and Licata 1993; 
Compeau and Grewal 1998; Kalyanaram and Winer 1995; 
Krishna et al. 2002; Mazumdar, Raj, and Sinha 2005). That 
is, when reference and retail (offering) prices are presented 
together, they are compared, and this comparison has a 
favorable effect on consumer deal perceptions relative to a 
condition in which reference prices are not presented. Dif- 
ferent theoretical arguments have been advanced to account 
for reference price effects, including assimilation—contrast 
(Sherif, Sherif, and Nebergall 1965), adaptation-level 
theory (Helson 1964), and anchoring-adjustment principles 
(Fversky and Kahneman 1971). Although these theories 
make different predictions, they also share commonalities 
(see Mussweiler 2003). Each agrees that all judgments are 
relative and how a person evaluates something depends on 
what he or she uses as a basis for comparison. Furthermore, 
the location of an initial stimulus on an evaluative contin- 
uum (e.g., good versus bad and high versus low) can influ- 
ence perceptions of a second stimulus on the same contin- 
uum, such that if the second is sufficiently different from 
the first, a person will judge it to be more different than it 
really is. In this way, a person may perceive a low price for 
a product as lower relative to a high reference price than 
when he or she evaluates the same low price in the absence 
of a reference price, all else being equal. 

However, there is research that questions the magnitude 
and, indeed, the presence of reference price effects. This 
research suggests that for price- and promotion-sensitive 
consumers in particular, reference pricing effects may be 
confounded by other variables (Chang, Siddarth, and Wein- 
berg 1999; Krishna, Currim, and Shoemaker 1991). In sum- 
marizing these and related studies, Neslin (2002) concludes 
that research is necessary to understand how other informa- 
tion contained in advertised promotions influences refer- 
ence price effects. Our content analysis of retai] newspaper 
advertisements confirms the need for such research. 


Limited-Time Availability and Sale Cue Effects 


With considerably less attention, the marketing literature 
has addressed limited-time availability or, more broadly, 
scarcity and sale cues in retail advertisements.? The evi- 


3Our newspaper content analysis examined different scarcity 
variations. We based our decision to focus on limited time in this 


dence that exists suggests that limiting a consumer's oppor- 
tunity to obtain a product makes the product more attractive 
and affects value perceptions. Empirical data in support of 
the effectiveness of scarcity in influencing value percep- 
tions have been reported for fast food (Brannon and Brock 
2001), batteries (Inman, Peter, and Raghubir 1997), nylon 
hosiery (Fromkin et al. 1971), and women's suits (Szybillo 
1976). Anecdotal evidence of its effectiveness in selling 
other products also has been reported (see The New York 
Times 1999; The Wall Street Journal 2003). 

With limited-time appeals in advertising, consumers are 
informed that they have a limited amount of time (e.g., 
“Three days only!") to act on an offering. Although there 
are disagreements about the mechanism by which scarcity 
cues, in general, affect value perceptions (see Lynn 1992), 
limited-time availability, in particular, seems to imply value 
because people have learned over the course of their lives to 
associate limited-time events with price deals (Cialdini 
2001). Thus, favorable price inferences are readily accessi- 
ble in memory on exposure to limited-time availability 
cues. Notably, research has documented the same response 
to sale cues that use this rationale (Inman and McAlister 
1993; Inman, McAlister, and Hoyer 1990), and it has been 
suggested that retail advertisements that contain reference 
price information should include terms such as “sale” or 
"special" to communicate that the lower price or discount is 
only temporary (Grewal et al. 1998). Similarly, the need to 
include time limits in reference price research and to study 
their effect on consumer response to advertised price deals 
has been emphasized (Krishna et al. 2002). However, no 
research has systematically examined these cues in a refer- 
ence price advertising context. 


Consumer Involvement Effects 


Following the work of Zaichkowsky (1985), we define 
"involvement" as the level of personal relevance that a 
product or purchase decision has for a consumer. Thus, we 
contend that people who are shopping for a product are 
more highly involved with information in an advertisement 
for that product than people who are shopping for some- 
thing else. Consumer involvement has been shown to influ- 
ence both the extent to which people process an advertise- 
ment and the type of information included (Chaiken 1980, 
1987; Petty and Cacioppo 1986). Highly involved con- 
sumers are motivated to scrutinize information, such as 
prices, more fully, whereas less involved consumers are 
likely to apply simple heuristics, or judgment-relevant cues 
that are easily understood (Chen and Chaiken 1999), when 
processing information (Meyers-Levy and Peracchio 1996), 
such as the presence of limited-time availability and sale 
notifications in an advertisement. 

From an information-processing perspective, we pro- 
pose that people should make favorable price inferences 
more easily on the basis of limited-time availability and sale 
cues than reference price information in retail advertise- 
ments. We expect to find this because the implications of 
limited-time availability and sale announcements have been 


research on the finding that limited time not only was the most fre- 
quently used but also was used 3.5 times more often than the next 
most common variation (limited number/quantity). 


Broadening the Scope of Reference Price Advertising Research / 187 


learned (Cialdini 2001) and can function as heuristic or 
peripheral cues (Chaiken 1980, 1987; Petty and Cacioppo 
1986) that signal value. However, the value implied by ref- 
erence pricing is not as easily inferred. Given the great vari- 
ability that exists in the use of discounts in reference price 
advertisements (our content analysis revealed that offering 
prices ranged from 1% to 90% off advertised reference 
prices), consumers who want to have confidence in the 
accuracy of their judgments about price must derive, at 
some level, the value being offered rather than retrieve it 
from memory, as with limited-time availability and sale 
related cues. Thus, the cognitive processes involved in 
judging price favorability should be more complex when 
using reference price information than when using limited- 
time availability and sale cues. These differences in 
information-processing requirements should affect con- 
sumers when reference price information and limited-time 
availability or sale cues are most persuasively presented in 
retail advertisements. 

Although multiple methods of conceptualizing and 
manipulating involvement have been used (for an overview, 
see Peter and Olson 2005), an operationalization that 
directly links active purchase intent among consumers with 
the processing of information common to retail advertise- 
ments in the marketplace is notably absent in the reference 
pricing literature. This is understandable because approxi- 
mately 90% of the behavioral studies on reference pricing 
have been conducted in laboratory settings with student 
samples (Biswas, Wilson, and Licata 1993; Compeau and 
Grewal 1998). We contend that a clear stratification of per- 
sonal product relevance is evident among actual consumers 
who are (are not) actively shopping for a particular good.4 
For example, a consumer who is engaged in shopping for a 
bedroom set will be more likely to process information in 
an advertisement for that product carefully than will a per- 
son who is interested in buying a sofa. A consumer who is 
shopping for a sofa may only incidentally or peripherally 
process information in an advertisement for a bedroom set. 
To our knowledge, no reference price advertising research 
has operationalized consumer involvement in a field setting 
and examined its influence on consumer price perceptions. 


Study 1 
Hypothesis 


We designed Study 1 to examine the effects of reference 
price information and a limited-time availability cue on 
consumer price perceptions across levels of consumer 





^Involvement may be conceptualized as either "enduring" or 
“situational” (see Hoyer and MacInnis 2004). Endunng involve- 
ment reflects an inherent interest in a product independent of a 
purchase situation (see Chandrashekaran 2004; Chandrashekaran 
and Grewal 2003). Situational involvement reflects interest in a 
product caused by situational events, such as a purchase situation. 
We designed our manipulation of involvement to be situational: 
We wanted to evaluate ad effects on consumers who were actively 
engaged in marketplace shopping behavior. 
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involvement. We hypothesize that reference price informa- 
tion in an advertisement is likely to be effective in favorably 
influencing price perceptions only for consumers who are 
in the market for a product (high involvement). We expect 
that only these consumers will be motivated to expend the 
cognitive effort necessary to process reference price infor- 
mation fully. Conversely, we expect that consumers who are 
not in the market for a product (low involvement) will not 
have an interest in processing information to the degree 
necessary to make an informed judgment about the value 
represented by the reference price. This expectation is con- 
sistent with Yalch and Elmore-Yalch's (1984) suggestion 
that peripheral processors sometimes avoid numerical infor- 
mation. Thus, reference price information in advertisements 
should affect consumer price attitudes under high- but not 
low-involvement conditions. 

We also hypothesize that limited-time availability in an 
advertisement should have a favorable effect on price judg- 
ments under both high- and low-involvement situations. As 
Petty and Wegener (1999) note, the same information can 
have an effect on persuasion across conditions of involve- 
ment, but different information-processing mechanisms are 
used. Specifically, although limited-time information can 
operate as a peripheral cue under low-involvement condi- 
tions, it would be evaluated "like all other available infor- 
mation" (Petty and Wegener 1999, p. 59) under high- 
involvement conditions. Its effect on price persuasion under 
high-involvement conditions would then depend on whether 
the limited-time availability of the offer has merit in evalu- 
ating price. Inman, Peter, and Raghubir (1997) argue that 
scarcity can operate both as a peripheral cue and as infor- 
mation evaluated on its merits. They find that consumers 
believe that limited-time availability has merit as an indica- 
tor of likely demand for the product. We believe that this 
should be especially true in times of consumer uncertainty, 
for example, when people consider the purchase of infre- 
quently shopped goods, such as consumer durables. Under 
such conditions, and before the advertised product is physi- 
cally examined, limited-time information should prove use- 
ful as a market valuation attribute, and therefore people 
should consider it along with other knowledge structures 
when evaluating price. 

H,: Limited-time availability significantly affects price atti- 

tudes among consumers who are not shopping for a 
product. 


Ho: Limited-time availability and reference pricing signifi- 
cantly affect price attitudes among consumers who are 
sbopping for a product. 


These hypotheses are based on our expectation that con- 
sumers who are shopping (not shopping) for a product will 
(will not) allocate the cognitive effort necessary to process 
fully the value implications of reference prices in an adver- 
tisement for that product. Reference prices should affect 
price perceptions under high- but not under low- 
involvement conditions. However, the value implications of 
limited-time information should be easily cued from mem- 
ory and used when consumers form price attitudes under 
both high- and low-involvement conditions. Thus, the type 
of ad information (limited-time availability, reference pric- 
ing) to which consumers are exposed will be differentially 


persuasive depending on the level of consumer shopping 
involvement. 


Products and Prices 


We tested our hypothesis over two different products: bed- 
room sets and sofas. Both products come in a wide range of 
prices and quality grades within and across furniture stores. 
Most people infrequently purchase bedroom sets and sofas 
and are inexperienced shoppers for these goods, Interviews 
with personnel at the store chain we used in this'study, cou- 
pled with furniture industry research (Roy 2002), indicate 
that brand familiarity, store loyalty, and purchase frequency 
are low among furniture shoppers. We selected these prod- 
ucts with the'intent of heightening marketplace ambiguity 
for even high-involvement consumers. As Chandrashekaran 
(2004; see also Chandrashekaran and Grewal 2003) dis- 
cusses, when consumers have a high degree of product and 
price knowledge, reference pricing may not wórk. Indeed, 
the effectiveness of advertised reference prices appears to 
depend on some level of consumer uncertainty about prod- 
uct price levels in the marketplace. We believe that such 
uncertainty will be true in general for infrequently pur- 
chased goods with low levels of brand familiarity. 

Another reason for selecting bedroom sets and sofas is 
that they are often priced differently, thus allowing an 
examination of whether experimental results generalize 
across products with such prices. Store personnel and 
prospective customers (from stores in which study partici- 
pants were obtained) were interviewed to determine the 
purchase price range for the advertised products and the 
content and layout of the advertisements. As a result of 
these interviews, we set reference prices at the upper end of 
the most common purchase price range, and we set the 
offering prices at the lower end of the most common pur- 
chase price range. We did this to enhance respondent per- 
ceptions of the lower prices as "deals" (Lichtenstein and 
Bearden 1989).5 The offering prices for both products rep- 
resent a 40% markdown from the reference price. Our ad 
review revealed that the mean markdown (off reference 
price) for furniture advertised in the area was 40.296. Retail 
store personnel and prospective customers were also shown 
copies of the test advertisements. We confirmed that the ref- 
erence prices were credible for the products being sold (see 
Compeau, Grewal, and Chandrashekaran 2002) and that the 
ad content and layout were representative of what was com- 
mon to newspaper furniture advertisements in the area. 


Procedure 


Furniture shoppers were intercepted entering four different 
outlets of a well-known, moderately priced, regional furni- 
ture store chain in a large city in the Southwest. Bedroom 


*This pricing procedure also makes theoretical sense. By setting 
the advertised reference prices at the upper end of the most com- 
mon purchase price range, we maximized the likelihood that the 
prices would influence consumers’ internal reference prices, which 
are often conceptualized as expected or normal market prices or 
willingness to pay (see, e.g., Chandrashekaran 2004). 


sets and sofas constitute approximately 60% of the retailer's 
unit sales. 

Shoppers were first asked to participate in a research 
study that was being jointly conducted by a local university 
and a (fictitious) local advertising association. Participants 
verbally responded to three questions, including what furni- 
ture they were shopping for that day. Those who answered 
either bedroom sets or sofas (but not both) were asked to 
participate in an advertising test. Participants were told they 
would examine print advertisements scheduled to be run the 
next day in an area newspaper. Those who agreed were 
handed a portfolio of seven professionally produced black- 
and-white print advertisements (8 x 10 inch reproductions 
of newspaper advertisements) and were verbally asked to 
look through them as they normally would.® After partici- 
pants examined the advertisements, they filled out a ques- 
tionnaire and were debriefed. No participant compensation 
was provided. 


Experimental Manipulatlons and Design 


We manipulated product involvement using the portfolio of 
seven advertisements to which participants were exposed. 
Six advertisements were filler advertisements for products 
or services unrelated to furniture. The fourth (target) adver- 
tisement was for a fictitious store (Furniture Direct) that 
sold either bedroom sets or sofas. Half of the consumers 
shopping for bedroom sets were given a portfolio with the 
advertisement for the bedroom set in the fourth position 
(high-involvement condition), and half were given the 
advertisement for the sofa in the fourth position (low- 
involvement condition). Likewise, half of the consumers 
shopping for sofas were given a portfolio with the sofa 
advertisement in the fourth position (high-involvement con- 
dition), and half were given the advertisement for the bed- 
room set in the fourth position (low-involvement condition). 
sawyer and Howard (1991; see also Petty, Cacioppo, and 
Schumann 1983) use a similar involvement manipulation, in 
which different product advertisements alternatively served 
as the high- versus low-processing targets for people mak- 
ing a choice (versus not making a choice) for that product. 
This manipulation provides maximum design efficiency 
because it also allows a test of whether the results general- 
ize across different products. 

The two product test advertisements were similarly 
designed. The top (which stated the store name, Furniture 
Direct), subheadline (which stated, ““We have more choices 
than anyone in town"), bottom (which provided a geo- 
graphic address, a Web address, and a telephone number), 
and left-hand column (which provided financing-related 
information) of the advertisements were identical. However, 
one advertisement had a headline that asked, "Looking for a 


6We used black-and-white test advertisements on the basis of 
our ad review, which revealed that a majority (72.296) of limited- 
time and reference price advertisements were black and white. 
This percentage was about the same for furniture advertisements. 
For consistency with the test advertisements, all contro] advertise- 
ments were also in black and white. 
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Bedroom Set?” and the other asked, “Looking for a Sofa?” 
Under the subheadline was either a photo of a five-piece 
bedroom set or photos of two sofas. Brief descriptions were 
provided for both products. Under the second sofa photo 
was the statement, “Take Your Pick—Either Sofa!" and 
under the bedroom set picture was the statement, “Five 
Piece Bedroom Set.” There was additional information pre- 
sented both under and above these statements that manipu- 
lated the presence of a “deal.” 

In the “no-deal” condition, the bedroom set was priced 
at “Now: $1725,” and the sofas were priced at “Now: $734.” 
In the limited-time condition, the words “Three Days 
Only!” were displayed above the statements “Five Piece 
Bedroom Set” or “Take Your Pick—Either Sofa.” Again, the 


products were priced either “Now: $1725” (bedroom set) or - 


“Now: $734” (sofas). In the reference price conditions, 
‘Regularly $2876” (bedroom set) or “Regularly $1223” 
(sofas) was stated to the left of the current (“Now”) prices. 
The combined condition was defined by the presence of 
both the reference price and the limited-time information 
(see Figure 1).7 

We used a 2 (involvement: high versus low) x 4 (deal: 
no deal versus limited-time availability versus reference 
price versus combined) x 2 (product: bedroom set versus 
sofa) between-subjects design. A sample of 160 consumers 
participated in the experiment. 


Measures 


In the opinion book, participants were informed that differ- 
ent people were being asked questions about different 
advertisements from the portfolio. They were being asked 
for their opinions about the advertisement for the bedroom 
set or sofas. After opening the opinion book, participants 
were not permitted to review the advertisements again. 

The dependent variable was consumer attitude toward 
the advertised product price.8 Participants were asked, “I 
personally feel that the advertised selling price for the bed- 
room set/sofas is,...” and they responded on seven-point 
scales anchored by “good/bad,” “undesirable/desirable,” 
and "satisfactory/unsatisfactory." We performed a check on 
the involvement manipulation using two items. Participants 
were asked, “I found the advertisement for the bedroom set/ 
sofas to be,...” and they responded on seven-point scales 
anchored by “personally involving/not personally involv- 
ing” and “not relevant to me/relevant to me” (see Chaiken 
and Maheswaran 1994; Petty and Cacioppo 1979; 
Zaichkowsky 1985). All scale items were coded so that 
higher scores indicated more favorable price attitudes and 
greater involvement/relevance. Finally, we used three open- 
ended items to check for the possibility of hypothesis 
guessing and demand artifacts. These items revealed no 
problems in any of the experiments, and thus we do not dis- 


7In all conditions across experiments, the advertised selling 
price ("Now") was presented. Only ın the reference price and 
combined conditions was the higher reference price also included 
in the advertisements. 

8This dependent variable is conceptually similar to “attitude 
toward the offer" used in previous studies (Bobinski, Cox, and 
Cox 1996; Lichtenstein and Bearden 1989). 
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cuss them further. We performed multiple comparisons 
using t-tests, with critical value adjustments based on the 
Bonferroni procedure (Rosenthal and Rosnow 1985; Rosen- 
thal and Rubin 1984). 


Manipulation Checks 


We first performed a check of the involvement manipula- 
tion. Analysis of variance (ANOVA) results for both items 
revealed an involvement main effect. Consumers who were 
shopping for a bedroom set (sofa) rated advertisements for 
the respective products as more personally involving (M = 
6.19; F(1, 144) = 179.47, p < .0001; i? = .54) and relevant 
(M = 5.65; F(1, 144) = 88.66, p « .0001; 112 = .36) than 
those who were shopping for a sofa (bedroom set) (person- 
ally involving: M = 4.43; relevant to me: M = 4.36). 


Price Persuaslon Hesults 


We found that the three items measuring attitude toward the 
advertised product price reliably measured the same under- 
lying construct; we summed them (œ = .90). We found a 
significant main effect of deal (F(3, 144) = 21.95, p < 
.0001; 112 = .24), in which price attitudes were less favor- 
able in the no-deal condition than in any other condition. 
However, this was qualified by a significant interaction 
between involvement and deal (F(3, 144) = 16.53, p < 
.0001; 112 = .18). Figure 2 plots the results of the involve- 
ment x deal interaction for Study 1. 

Under low-involvement conditions, price attitudes were 
significantly higher for the combined treatment (M = 15.3) 
than for the reference price condition (M = 13.05; t = 2.63, 
p « .05). The mean for the limited-time condition (M = 
16.0) was also more favorable than it was for the reference 
price treatment (t = 3.45, p « .005). Attitudes in the refer- 
ence price treatment were identical to those in the no-deal 
condition (M = 13.05). 

Under high-involvement conditions, the most favorable 
price attitudes were for the combined condition (M = 18.5), 
which were significantly higher (t = 2.04, p « .05, one- 
tailed test) than the reference price treatment (M = 16.85). 
Participants in the reference price condition perceived the 
advertised price more favorably (t = 4.15, p « .0002) than 
those who were exposed to the limited-time treatment (M = 
13.3), who, in turn, judged the price more favorably (t — 
2.22, p « .05, one-tailed test) than those who examined the 
advertisement with no deal (M = 11.5). 


Discussion 


The results of Study 1 indicate that different deal informa- 
tion in advertisements is effective depending on whether 
consumers are or are not shopping for a particular product. 
When consumers are not shopping for a product, limited- 
time availability affects price perceptions, but reference 
pricing does not. Both the limited-time and the combined 
treatments resulted in more favorable price attitudes than 
the reference price condition, which did not differ signifi- 
cantly from the control (no-deal) condition. This is 
explained by viewing limited-time availability as a learned, 
value-oriented heuristic that is easily remembered and inter- 
preted with respect to an advertised price. Limited time has 
utility in cuing value associations under low-involvement 


FIGURE 1 
Reference Price Advertisement with Limited-Time Appeal 
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conditions. However, the value implications inherent in ref- and the no-deal conditions is consistent with the position 
erence pricing require a level of cognitive processing that that low-involvement consumers do not meaningfully 
exceeds what low-involvement consumers typically expend. process the reference price information. Importantly, the 
The equality of the attitudinal means for the reference price lack of a difference between the reference price and the no- 
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FIGURE 2 
Involvement x Deal Interaction In Study 1 
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deal conditions further indicates that the mere presence of 
an advertised price reduction in the former treatment did 
not significantly affect price perceptions. Such a result 
might have occurred if simply the presence of a price reduc- 
tion, in and of itself, served as an effective heuristic cue sig- 
naling a deal. 

Under high-involvement conditions, consumer price 
attitudes in the combined condition were more favorable 
than the use of either reference prices or limited-time avail- 
ability alone, which indicates that both factors jointly con- 
tributed to price persuasion. Given that the value indicated 
by the limited-time cue was directionally congruent with 
the reference prices, there was an additive effect on atti- 
tudes (also see Chaiken, Lieberman, and Eagly 1989). The 
finding that reference pricing had a stronger effect on atti- 
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High 


tudes than limited-time availability under high-involvement 
conditions indicates that consumers judged the former 
information as more salient or relevant than the latter when 
forming their attitudes toward the product prices. Such a 
judgment appears rational in that reference pricing provides 
information that directly (versus indirectly) addresses price. 
Thus, under high-involvement conditions, reference price 
information is more diagnostic of price value than limited- 
time information. 

Finally, we did not obtain significant product effects 
(either main or interaction). This consistency in the results 
is notable given the different prices of the two furniture 
items. In summary, the results of Study 1 support the view 
that limited-time availability signals value when consumers 
are either shopping or not shopping for a product. Refer- 


ence prices in advertisements are meaningfully processed 
only by consumers who are shopping for a particular 
product. | 


Study 2 


Hypothesis 


We conducted Study 2 to provide a further test of the posi- 
tion that the lack (presence) of respondent motivation to 
process reference prices in an advertisement accounts for 
the differential impact of that tactic in low- (high-) involve- 
ment conditions. To test this, we manipulated the use of 
"percentage-off" statements, which summarized the mark- 
down size represented by the reference prices. For example, 
an advertisement that states, “Regularly $1223; Now: $734" 
would also state “40% Off" next to the prices. Although this 
practice is not common (our content analysis indicated that 
it is used in 20.096 of the advertisements that employ refer- 
ence prices), it represents a potentially advantageous one. If 
the size of a markdown represented by the reference prices 
is salient to low-involvement consumers but requires too 
much effort for them to process, providing the percentage 
off in an advertisement should result in a significant 
increase in perceived favorability of the price. Conversely, 
for high-involvement consumers, who thoughtfully weigh 
and consider the information in an advertisement, providing 
a markdown summary would be unlikely to increase price 
perceptions favorably. These consumers are expected to 
derive the value implications of the price markdown on 
their own. We hypothesize that providing the size of a 
markdown in a reference price advertisement will increase 
price persuasion for low-involvement consumers but not for 
high-involvement consumers. 

study 2 investigates an additional dependent measure to 
broaden the domain of effects: consumer intentions to shop 
at the retail store that sponsored the advertisement (1.e., Fur- 
niture Direct). We added this element to test whether the 
independent variables are effective at bringing consumers 
into the market (shopping) for the advertised products, and 
it responds to the call to broaden reference price research by 
considering store patronage intentions (Mazumdar, Raj, and 
Sinha 2005). As a prerequisite for purchasing the goods, 
consumers must first be willing to shop at the store at which 
the products are sold. We anticipated the same effects for 
both the price persuasion and the store sbopping intention 
measures. There are two bases for this expectation. First, 
given what is known about the attitude-behavioral inten- 
tions relationship (e.g., Fishbein and Ajzen 1975), theoreti- 
cally, we expected a similar pattern of mean effects. Sec- 
ond, in the current experimental context, if consumers 
associate the price deal in the advertisement with the spon- 
soring retail store, favorable shopping intentions should 
result. Thus, we anticipate that the effects of involvement 
and deal on price attitudes and shopping intentions will be 
similar: 


H3: Providing reference price markdown size favorably affects 
price attitudes and shopping intentions for low- 
involvement but not for high-involvement consumers. 


We tested this hypothesis using an expanded version of the 
design implemented in Study 1. 


Procedure, Manipulations, and Measures 


We used the same procedure and involvement manipulation 
as in Study 1 with the same products (bedroom set and 
sofa). We also manipulated the deal variable as in Study 1, 
except that in certain conditions, the percentage off (“40% 
Off’) was prominently stated (inside a shaded circle) to the 
left of the prices (see Figure 3). We restricted the conditions 
in which we used the percentage-off manipulations to those 
necessary for hypothesis testing. We used percentage-off 
statements in the reference pricing (called "reference price" 
versus "reference percentage off") and combined (called 
"combined" versus "combined percentage off") treatments. 

We implemented a 2 (involvement: high versus low) x 6 
(deal: no deal versus limited-time availability versus refer- 
ence price versus reference percentage off versus combined 
versus combined percentage off) x 2 (product: bedroom set 
versus sofa) between-subjects design. A sample of 240 con- 
sumers participated in the experiment. 

We used the price attitude items used in Study 1, fol- 
lowed by measures of store shopping intentions. Consumers 
were asked, “Think for a moment about the distance you 
traveled to get to this [retail furniture] store. If a Furniture 
Direct store were located about the same distance from 
where you live, how likely or unlikely would you be to shop 
at that Furniture Direct store?"? We assessed consumers’ 
responses on two seven-point scales anchored by "very 
likely/very unlikely" and “not possible/very possible." 


Manipulation Checks 


As in Study 1, the ANOVA results for both manipulation 
check items revealed an involvement main effect. Con- 
sumers who were shopping for a bedroom set or sofa rated 
the respective advertisements for those products as more 
personally involving (M = 6.36; F(1, 216) = 296.84, p « 
.0001; 112 = .55) and relevant (M = 5.69; F(1, 216) = 88.58, 
p < .0001; 11? = .28) than those who were not shopping for 
these products (personally involving: M - 4.38; relevant to 
me: M z 4.19). 


Price Persuasion Results 


The three price attitude items reliably measured the same 
construct, and we summed them (a = .87). The ANOVA 
results revealed a main effect of deal (F(5, 216) = 14.06, p « 
.0001; 1? = .21), in which participants exposed to a deal 
were more favorable toward the offer price than those who 
were not exposed. Again, this was qualified by a significant 
interaction between involvement and deal (F(5, 216) = 6.81, 
p < .0001; 2? = .10). Figure 4 shows the results of the 
involvement x deal interaction for Study 2. 


9The wording of this item was necessary to control for distance 
effects on shopping intentions because the four stores used for 
shopper intercepts were located in different parts of the city and 
surrounding areas. 
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FIGURE 3 
Reference Price Advertisement with Percentage-Off Summary and Limited-Time Appeal 
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Under low-involvement conditions, the combined .005) than the combined group (M = 14.45). Price attitudes 
percentage-off group reported the most favorable price atti- for the reference percentage-off treatment (M = 15.2) were 
tudes (M = 17.2); this was significantly higher (t = 3.10, p « also higher (t = 2.53, p « .05) than the reference price group 
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' FIGURE 4 
Involvement x Deal Interaction in Study 2 
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(M = 12.95). The limited-time availability treatment (M = 
15.1) resulted in more favorable price attitudes (t = 2.42, 
p « .05) than the reference price treatment, which did not 
significantly differ from the no-deal treatment (M = 12.8; 
t « 1). Across levels of the percentage-off manipulation, the 
combined treatment resulted in more favorable attitudes 
(M = 15.83) than the reference price treatment (M = 14.08; 
t = 2.79, p < .05). | 

Under high-involvement conditions, participants who 
were exposed to the combined treatment reported the most 
favorable price attitude (M — 17.8), though they were not 
reliably different (t = 1.63, p > .10) from those in the com- 
bined percentage-off condition (M = 16.35). The attitudes 
of those in the reference price condition (M = 15.95) were 
also not reliably different from those in the reference 
percentage-off treatment (M = 14.55; t = 1.58, p > .10), 
though the reference price treatment was more favorable 


than the limited-time condition (M = 13.25; t = 3.04, p « 
.005). Consumers in the limited-time condition had more 
favorable price attitudes than those who received the no- 
deal treatment (M = 11.25; t = 2.25, p « .05, one-tailed test). 
Finally, across levels of the percentage-off manipulation, 
the combined treatment resulted in more favorable price 
attitudes (M = 17.08) than the reference price condition 
(M = 15.25; t= 2.91, p < .01). 


Store Shopping Intentlons Results 


The two items measuring store shopping intentions reliably 
measured the same construct (œ = .85); we recoded and 
summed them, with higher scores indicating greater store 
shopping intentions. The ANOVA results closely mirrored 
those we obtained with the price persuasion measure. There 
was a main effect of involvement (F(1, 216) = 11.02, p < 
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.001; 72 = .03), such that high-involvement consumers were 
more likely to shop at Furniture Direct than low- 
involvement customers. There was also a main effect of 
deal (F(5, 216) = 14.93, p < .0001; 112 = .22), such that par- 
ticipants exposed to an advertisement containing a deal 
were more likely to shop at Furniture Direct than those in 
the no-deal condition. However, these main effects were 
qualified by a significant involvement x deal interaction 
(F(5, 216) = 5.91, p < .0001; 12 = .09). 

Under low-involvement conditions, store shopping 
intentions for the combined percentage-off group (M = 
11.7) were greater than those for the combined condition 
(M = 9.45; t = 3.45, p « .003). Similarly, the reference 
percentage-off group had a higher mean (M - 10.9) than the 
reference price condition (M = 7.7; t = 4.91, p < .0001). 
Participants who were exposed to a limited-time advertise- 
ment were more likely to indicate favorable store shopping 
intentions (M = 9.75) than those in the reference price treat- 
ment (t = 3.15, p « .005), which did not differ from the no- 
deal condition (M = 7.8; t < 1). Across levels of the 
percentage-off manipulation, the combined treatment had 
higher store shopping intentions (M = 10.58) than those in 
the reference price condition (M = 9.3; t = 2.78, p « .01). 

Under high-involvement conditions, we found no sig- 
nificant difference between the combined percentage-off 
group (M z 11.6) and the combined condition (M - 12.05; 
t « 1). There was no difference between the reference 
percentage-off treatment (M = 10.25) and the reference 
price group (M = 10.8; t « 1). However, participants 
exposed to reference prices evidenced greater store shop- 
ping intentions than those in the limited-time treatment 
(M = 9.5; t = 1.99, p « .05, one-tailed test). The limited- 
time treatment resulted in marginally higher shopping 
intentions than for those in the no-deal condition (M = 8.4; 
t = 1.69, p « .10, one-tailed test). Summed over the 
percentage-off manipulation, the combined condition had 
greater store shopping intentions (M = 11.83) than the ref- 
erence price treatment (M = 10.53; t = 2.82, p « .01).10 





10At the request of a reviewer, we collected additional data to 
provide mediational evidence of the high-involvement reference 
pricing effect we observed 1n Studies 1 and 2. We compared prior 
search/price knowledge versus ad price processing explanations as 
potential mediators. We implemented a 2 (involvement: high ver- 
sus low) x 2 (deal: reference price versus none) x 2 (product: bed- 
room set versus sofas) design using 80 furniture shoppers. We 
used the same dependent measures as in Studies 1 and 2 (price 
attitude: o& = .83; shopping intentions: œ = .81). We obtained two 
measures of prior search/price knowledge. Respondents were 
asked, “Within the past 30 days, how many stores have you 
shopped at that have sold bedroom sets (sofas)?" (coded 0—5) and 
"How knowledgeable or unknowledgeable are you regarding the 
prices of bedroom sets (sofas)?” (measured on scales anchored by 
"very knowledgeable/not knowledgeable"). We also collected two 
measures of ad price processing. Respondents were asked, “How 
much effort did you devote to thinking about the price information 
presented in the advertisement for the bedroom set (sofas)?" (mea- 
sured on scales anchored by “high effort/low effort") and “I found 
the prices 1n the advertisement for the bedroom set (sofas) to 
be..." (measured on scales anchored by “not thought 
provoking/thought provoking”). All scaled 1tems used seven-point 
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Discussion 


Study 2 extends the findings we obtained in Study 1. The 
results indicate that summarizing reference price informa- 
tion in an advertisement has different effects depending on 
whether consumers are shopping for a product. 

Under low-involvement conditions, attitudes and store 
shopping intentions for both the combined and the refer- 
ence price treatments with the percentage-off statement 
were more favorable than the same respective treatments 
without the percentage-off statement. These findings are 
consistent with the position that providing summary infor- 
mation eases the processing requirements for comprehend- 
ing the value implications of reference prices in advertise- 
ments. These results also replicate Chandrashekaran’s 
(2004) and Chandrashekaran and Grewal’s (2003) findings 
that reference prices with summary information can have a 
persuasive effect in low-involvement conditions. The lack 
of a significant difference in price attitudes and store shop- 
ping intentions for the low-involvement reference price and 
no-deal conditions indicates that without assistance, con- 
sumers who are not shopping for a product do not compre- 
hend the value implications of reference prices; this is con- 
sistent with Study 1. However, the more favorable price 
attitudes and store shopping intentions for the combined 
and limited-time treatments than for the reference price 
condition indicate that limited-time availability has an 
effect on persuasion under low-involvement conditions. 

A different pattern emerged under high-involvement 
conditions. When consumers were shopping for a product, 
we observed no reliable difference between exposure to 
either the combined or the reference price treatments and 





scales. The two ad price processing items reliably measured the 
same underlying construct (X = 87); we summed them to form an 
ad price processing index. We first examined basic ANOVA 
effects. We found main effects of involvement and deal for both 
price attitude and store shopping intentions. More important, the 
critical involvement x deal interaction was also significant for both 
price attitude (F(1, 72) = 18.57, p < .0001; 11? =.16) and store 
shopping intentions (F(1, 72) = 10.02, p < .05; 1? = .08). The pat- 
tern of mean scores in these interactions mirrored previous find- 
ings (i.e., a differentiation between the two deal conditions for 
high-involvement but not for low-involvement respondents). We 
found main effects of involvement for the number of stores 
(F(1, 72) = 16.28, p < .0001; rj? = .18), price knowledge 
(FC, 72) = 19.22, p < .0001; 11? = .21), and the ad price processing 
index (F(1, 72) = 82.09, p < .0001; 1? = .48), with higher scores 
present in the high-involvement condition. We also found an 
involvement x deal interaction for the ad price processing index 
(F(1, 72) = 6.56, p < .01; 1? = .04). We examined mediation of the 
interaction effect using Baron and Kenny's (1986) methodology. 
Neither the number of stores nor self-reported price knowledge 
was a significant or marginally significant mediator of the attitudi- 
nal or store shopping intention effects. However, the ad price pro- 
cessing index was a significant mediator of both (price attitude: 
F(1, 71) = 6.66, p < .01; 1? = .05; store shopping intentions: 
F(1, 71) = 5.16, p < .03; 11? = .04). The ad processing index par- 
tally mediated the deal x involvement interaction for price attitude 
(F(1, 71) = 12.51, p < .001; 1? = .10) and store shopping inten- 
tions (F(1, 71) = 6.05, p < .02; 7]? = .05). These results support the 
position that scrutiny of price information presented 1n the adver- 
tisements by high-involvement respondents underlies the reference 
price effect we observed in Studies 1 and 2. 


the same respective advertisements that included the per- 
centage off. The price attitude and shopping intention 
means for the latter conditions were actually lower (less 
favorable) than those we obtained for treatments that did 
not summarize the markdown. Although the differences 
were not statistically significant at traditional levels, these 
results are conceptually congruent with the position that 
summarizing a benefit or conclusion for consumers who 
can be expected to process information actively on their 
own detracts from persuasion. Sawyer and Howard (1991) 
find that it is more persuasive for highly involved con- 
sumers to come to their own conclusions about the implica- 
tions of a message. They argue that leaving an advertise- 
ment “open” (not providing a conclusion) causes central ad 
arguments to be processed more thoroughly, and resulting 
attitudes become more positive than in a condition in which 
conclusions are explicitly provided. 

Consistent with Study 1, limited-time availability and 
reference pricing jointly contribute to price persuasion 
under high-involvement conditions in Study 2. The persua- 
sive effect of limited-time availability under high- 
involvement conditions also replicates the Study 1 results. 
Finally, and also consistent with Study 1, we obtained no 
significant effects of the product variable. 


Study 3 


Hypothesis 

We conducted Study 3 to examine informational and cue 
redundancy in a retail reference price advertisement under 
different involvement conditions. Despite the observation 
that multiple cues are common in many persuasion settings, 
limited research exists on their effectiveness (cf. Kalyanara- 
man and Oliver 2001). Study 3 provides insights into the 
operation of reference pricing and limited-time availability 
in retail advertisements by examining their effects when 
used with another value cue: the word “sale.” 

The suggestion that the word “sale” has acquired cue- 
related properties is not surprising given that its use is asso- 
ciated with the other persuasion techniques we examine 
herein. Our content analysis found that the word “sale” and 
related synonyms were used in 47.0% of advertisements 
with limited-time offers and in 69.3% of advertisements 
featuring reference pricing. Similar to the limited-time cue, 
we argue that the word “sale,” through learned associations 
with advertised deals, has a favorable affect on price per- 
suasion. Anderson and Simester’s (2003) finding that sim- 
ply displaying the word “sale” next to products in a store 
could increase sales up to 50% highlights its attention- 
grabbing ability, and it is consistent with the idea of auto- 
matic responding that typifies cue-related behaviors (Cial- 
dini 2001). 

Consistent with dual-process theories of persuasion 
(Chaiken 1987; Chen and Chaiken 1999; Petty and 
Cacioppo 1986), our contention has been that limited-time 
availability in retail advertisements operates as a peripheral 
or heuristic cue that facilitates persuasion under low- 
involvement conditions. However, under high-involvement 
conditions, limited-time availability is processed like other 


available information and is evaluated on its merits. If this 
is true, pairing the word “sale” with limited-time availabil- 
ity in an advertisement should result in no additional effect 
under high-involvement conditions, but it should contribute 
to persuasion under low-involvement conditions. We expect 
to find this because consumers who cogently process infor- 
mation in an advertisement would be more likely to deduce 
that limited-time prices are sales. Thus, when people exam- 
ine an advertisement for a product that they are actively 
considering purchasing (1.e., under high-involvement condi- 
tions), the word "sale" is redundant information that should 
lack an effect, given the existence of a “Three days only!” 
price reduction. 

However, redundancy between two cues should not 
impede the persuasive effects of either one for consumers 
who do not perceive them as redundant. We expect that rec- 
ognizing the redundancy shared by “Three days only!” and 
the word “sale” will require a level of thinking that exceeds 
what is expended by consumers who view an advertisement 
for a product that does not match their current needs or 
interests (1.e., under low-involvement conditions). When 
advertisements try to communicate with people who are not 
shopping for a good, we believe that there is likely to be 
utility in stating what should be obvious because most con- 
sumers will not bother connecting the idea of the “Three 
days only!" message with that of the word "sale" unless the 
connection is made for them. In our view, leading con- 
sumers to a conclusion and then telling them what that con- 
clusion is should have a persuasive impact in low- 
involvement contexts. 

Finally, what would be the expectation when the word 
“sale” is used without being coupled with the limited-time 
treatment (i.e., when it is used alone with reference price 
information)? As a cue, it should add to persuasion under 
low-involvement conditions and should be evaluated on its 
merits under high-involvement conditions. However, we do 
not expect that the merits of a sale announcement under 
high-involvement conditions will have the same effect as 
announcing a limited-time event, given that the latter pro- 
vides explicit parameters that define unavailability, whereas 
the former does not. In turn, the implied demand for a prod- 
uct associated with limited-time availability (Inman, Peter, 
and Raghubir 1997) and its corresponding market valuation 
utility should also be arguably stronger with limited-time 
availability than with sale announcements. Finally, the gen- 
eral idea of a sale from a consumer's perspective, though 
not the particulars of a limited-time event, is likely to be 
redundant with information logically obtained by consider- 
ing reference price information under high-involvement 
conditions. That is, most consumers would interpret a price 
markdown as a sale. 

In summary, we hypothesize that pairing the word 
“sale” with limited-time availability and reference prices in 
advertisements will increase price persuasion and store 
shopping intentions for low-involvement consumers but not 
for high-involvement consumers. 


H4: Sale announcements in advertisements with limited-time 
availability and reference price appeals favorably affect 
price attitudes and store shopping intentions for low- 
involvement but not high-involvement consumers. 
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Procedure and Manipulations 


We used the same procedure as in Studies 1 and 2. We 
manipulated involvement using the same products (bed- 
room set and sofa). We defined the deal variable with six 
treatment levels. The first three levels were the reference 
price, limited-time availability, and combined treatments, as 
we defined them in Studies 1 and 2. The next three levels 
were identical to the first three, except that the word “‘sale” 
was prominently highlighted (in a shaded circle) to the left 
of the pricing information in the advertisements (see Figure 
5). We did not use the no-deal condition in Study 3 for cell 
size reasons. 

We implemented a 2 (involvement: high versus low) x 6 
(deal: limited time versus reference price versus combined 
versus limited-time sale versus reference price sale versus 
combined sale) x 2 (product: bedroom set versus sofa) 
between-subjects design. A sample of 240 consumers par- 
ticipated in the experiment. We used the same two depen- 
dent measures as in Study 2. 


Manipulation Checks 


The ANOVA results for both manipulation check items 
revealed a main effect of involvement. Participants who 
were shopping for a bedroom set or sofa rated the corre- 
sponding advertisements more personally involving (M = 
5.87; F(1, 216) = 264.58, p < .00001; 11? = .69) and relevant 
(M = 5.83; F(1, 216) = 153.13; p « .0001; 112 = .57) than 
those who were shopping for the alternative product (sofa 
or bedroom set; personally involving: M = 4.28; relevant to 
me: M = 4.10). 


Price Persuasion Results 


Again, the three price persuasion items reliably measured 
the same construct, and we summed them to form an index 
(œ = .83). The ANOVA results revealed main effects of 
involvement (F(1, 216) = 4.52, p « .04; 11? = .01) and deal 
(F(5, 216) = 9.62, p < .0001; 1? = .15), qualified by a sig- 
nificant interaction between them (F(2, 216) = 945, p < 
.0001; 11? = .15). 

Under low-involvement conditions, pairing reference 
prices with the word “sale” (M = 15.05) was more persua- 
sive than the use of reference prices without the word "sale" 
(M = 12.25; t = 3.78, p « .0005). Similarly, pairing limited- 
time availability with the word "sale" was more persuasive 
(M = 17.1) than its use without the word “sale” (M = 15.2; 
t = 2.57, p < .005). Combining limited-time availability and 
reference price with the word "sale" was more persuasive 
(M = 16.6) than their use without the word "sale" (M = 
14.95; t = 2.23, p < .05). Across levels of the sale variable, 
price attitudes for the combined condition (M = 15.78) were 
greater than those for the reference price treatment (M = 
13.65; t = 4.10, p < .0002). Collapsed across levels of the 
sale variable, price attitudes in the limited-time availability 
condition (M = 16.15) were also more favorable than those 
in the reference price condition (t = 4.81, p « .0001). 

Under high-involvement conditions, pairing limited 
time with the word "sale" did not result in more favorable 
price perceptions (M = 14.3) than its use without the word 
sale (M = 14.05; t « 1). Similarly, pairing reference prices 
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with the word “sale” (M = 15.95) was not more favorable 
than its use without the word “sale” (M = 15.55; t « 1). The 
combined use of both techniques with the word “sale” (M = 
17.7) was not more favorable than their use without the 
word "sale" (M = 17.45; t « 1). Across sale levels, the use 
of reference prices resulted in more favorable price attitudes 
(M = 15.75) than the use of limited-time availability (M = 
14.18; t = 3.02, p « .05). Across sale levels, the combined 
reference price and limited-time condition resulted in more 
favorable price attitudes (M — 17.58) than the use of refer- 
ence pricing by itself (M = 15.75; t = 2.47, p < .01). Figure 
6 shows the results of the involvement x deal interaction. 


Store Shopping Intentions Results 


The two store shopping intentions items reliably measured 
the same underlying construct, and we summed them (œQ = 
.80). The results paralleled those for price persuasion. Main 
effects of involvement (F(1, 216) = 23.72, p < .0001; 2 = 
.06 ) and deal (F(5, 216) = 18.26, p < .0001; 72 = .23) were 
qualified by a significant interaction between those 
variables (F(5, 216) = 9.84, p < .0001; 712 = .13). 

Under low-involvement conditions, when limited-time 
availability was advertised as a sale (M = 10.45), it was 
more persuasive than when it was advertised without the 
word "sale" (M = 8.75; t = 3.47, p « .0005). The same was 
true for the reference price (with sale — 8.6, without sale — 
7.45; t = 2.35, p « .05) and combined (with sale = 11.0, 
without sale = 9.95; t = 2.14, p « .05) conditions. Collapsed 
across levels of sale, participants who were exposed to ref- 
erence prices were less inclined to shop at Furniture Direct 
(M = 8.03) than those who were exposed to the limited-time 
treatment (M = 9.6; t = 4.53, p « .0001). Across sale levels, 
combining reference prices and limited-time availability 
resulted in greater store shopping intentions (M = 10.48) 
than reference prices by themselves (M = 8.03; t= 5.00, p < 
.000001). 

Under high-involvement conditions, there was no sig- 
nificant difference in the persuasiveness of limited-time 
appeals when it was (M = 8.90) or was not (M = 8.95) 
advertised with the word “sale” (t « 1). The same results 
held for the reference price (with the word “sale” = 10.3, 
without the word “sale” = 10.45; t < 1) and the combined 
(with the word “sale” = 12.05, without the word “sale” = 
11.40; t = 1.33, p > .10) treatments. Across sale levels, ref- 
erence pricing was more persuasive (M = 10.38) than 
limited-time availability (M = 8.93; t = 4.14, p < .0001) on 
store shopping intentions. Across sale levels, the combined 
condition resulted in greater store shopping intentions (M = 
11.73) than the reference price condition (M = 10.38; t = 
2.69, p < .01). 


Discussion 


Study 3’s results demonstrate that under low-involvement 
conditions, advertised announcements of a sale and a lim- 
ited time to take advantage of a lower price independently 
contribute to favorable consumer price perceptions and 
higher store shopping intentions. This can be explained by 
noting that though sales, by definition, are supposed to be 
limited-time events, when consumers are perusing adver- 
tisements without personal interest, they do not think at a 


FIGURE 5 
Limited-Time Sale Advertisement 
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level deep enough to make the connection. Thus, value cues Under high-involvement conditions, however, the value 
that are otherwise logically redundant can independently implications of the word “sale” are no different from those 
contribute to persuasion under low-involvement conditions. of a limited-time event (or reference prices). These findings 
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FIGURE 6 
Involvement x Deal Interaction In Study 3 
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do not suggest that the word "sale" has no utility under 
high-involvement conditions. Rather, when consumers 
cogently process ad-related information, sale announce- 
ments contribute nothing unique to price perceptions or 
shopping intentions that these consumers do not already 
attain when they consider limited-time availability and ref- 
erence price information. Finally, this study replicates Stud- 
ies 1 and 2 in terms of effect (limited-time availability and 
reference pricing) differentiation across levels of consumer 
involvement and consistency of the results across 
products. 11 


lAs a reviewer requested, we gathered additional data to exam- 
ine whether low-involvement furniture shoppers versus nonfurni- 
ture shoppers would respond differently to the limited-time + sale 
appeals. Using the same method we previously discussed, we 
exposed low-involvement furniture (N = 40) and nonfurniture (N = 
40; electronics products) shoppers to either the bedroom set or the 
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General Discussion 


Are limited-time availability and sale cues effective in 
influencing consumer attitudes toward product prices in 
retail newspaper advertisements that feature reference pric- 


sofa advertisements across the limited-time + sale versus the no- 
deal conditions. We implemented a 2 (shopper: furniture versus 
electronics) x 2 (deal: limited time -- sale versus no deal) x 2 
(product: bedroom set versus sofa) between-subjects design. We 
examined price attitude (a = .85) and store shopping intentions 
(a = .80) Only the effect of deal was significant for attitude 
(F(1, 72) = 35.24, p < .0001; 112 = .33) and store shopping inten- 
tions (F(1, 72) = 19.18, p < .0001; 1? = .21), and the limited- 
time + sale advertisement resulted in more favorable price atti- 
tudes and shopping intentions at the designated store than 
exposure to the no-deal condition. The lack of interaction effects 
indicates that this finding is statistically consistent for furniture 
and nonfurniture shoppers across the advertised products. 


ing? Can they also draw people into stores? If it is assumed 
that retailer advertising practices represent what “works” in 
the marketplace, the answer is yes according to our content 
analysis of 13,594 retail advertisements. This analysis 
revealed that 87.2% of retail advertisements featuring refer- 
ence pricing also included limited-time availability or sale 
announcements in the headline or copy. However, the 
results of three field experiments across two different prod- 
ucts consistently demonstrate significant variability in the 
effectiveness of these techniques, depending on whether 
consumers are or are not shopping for an advertised 
product. 


Substantive Findings 


Using dual-process theories of persuasion (the elaboration 
likelihood model and the heuristic-systematic model) as a 
foundation, we have specified the conditions under which 
these techniques result in more favorable price attitudes and 
store shopping intentions and when they do not. Under low- 
involvement conditions (when consumers are not actively 
shopping for a product), a limited-time price reduction or a 
sale announcement affects price perceptions and shopping 
intentions, but the size of the reduction, represented by ref- 
erence prices, does not. Under high-involvement conditions 
(when consumers are actively shopping for a product), both 
reference prices and limited-time information favorably 
affect price perceptions and shopping intentions, though 
additional explanatory power is not achieved ugh the 
use of sale announcements. | 

The reference price results can be understood by recog- 
nizing that comprehension of the value implications of 
those prices requires a level of processing ,effort that 
- exceeds (does not exceed) the level that people typically 
expend when viewing advertisements for products that do 
not have (have) immediate relevance to them. Mediational 
findings support the position that more diligent processing 
of prices presented in the advertisements, as opposed to 
prior store search or product price knowledge, explains the 
favorable effect of reference pricing under high- 
involvement conditions. The effects of limited-time avail- 
ability and sale announcements can be understood in terms 
of their learned heuristic value under low-involvement con- 
ditions and their informational merit under-high involve- 
ment conditions. Under high-involvement conditions, the 
implications of a sale are redundant with the information 
derived through a careful consideration of limited-time 
availability and reference price information. Under low- 
involvement conditions, however, consumers do not per- 
ceive the redundancy between limited-time availability and 
sale announcements, and therefore both cues affect persua- 
sion. Thus, in general, we argue that reference pricing, 
limited-time availability, and sale cues have inherent differ- 
ences in terms of their information-processing requirements 
and outcomes. Overall, the three experiments help predict 
the effects of jointly using different value-oriented persua- 
sion techniques with reference price information in retail 
advertisements. | 

Notably, not only do our findings demonstrate strength 
and consistency, but they are also the result of field experi- 
ments. Intercepts for all three studies used engaged shop- 


pers who had other objectives in mind when they volun- 
teered to participate in the studies (without compensation). 
The outdoor environment in which testing took place was 
complete with noise and other distractions. Yet, despite 
these interferences, the effects were reliable across multiple 
studies. We suspect that one reason for the strength and 
consistency of the effects was the nature of the involvement 
manipulation. We believe that intercepting consumers in the 
process of shopping provides a stronger manipulation of 
involvement as personal product relevance than is typically 
found in laboratory manipulations of the construct. The 
average variance (across experiments) explained (112) in the 
two manipulation check measures was 49.8%. The average 
variance explained for the critical involvement x deal inter- 
action was 14.896 for price persuasion and 10.096 for store 
shopping intentions. These effect sizes are about two to 
three times the average sizes that Peterson, Albaum, and 
Beltramini (1985) report in their review of effect sizes. 


Managerial Perspectives 


When used by itself, reference price advertising results 
in favorable price perceptions and store shopping inten- 
tions only among consumers who are shopping for a par- 
ticular product. Reference pricing is an effective persuasion 
strategy among consumers who are of the greatest interest 
to retailers, namely, those with an immediate product inter- 
est. Consumers who lack product involvement do not 
process the implications of reference prices in retail adver- 
tisements to the degree necessary to grasp the value impli- 
cations. However, managers might still consider using refer- 
ence pricing with a markdown summary statement to appeal 
to a broader audience. We found that a markdown summa- 
rization results in increased price persuasion and shopping 
intentions for low-involvement shoppers but does not sig- 
nificantly detract from persuasion for high-involvement 
shoppers. The value in marketing to consumers without an 
immediate product interest lies in the likelihood of future 
rather than immediate sales being facilitated by associating 
the retailer with favorable pricing practices in the minds of 
consumers. 


Reference pricing paired with limited-time availability 
results in more favorable price perceptions and store shop- 
ping intentions than the use of either technique alone 
among consumers who are shopping for a product. Practi- 
tioner insight regarding this point is shown by the finding 
that reference pricing and limited-time availability are used 
together in retail advertisements more frequently than refer- 
ence pricing is used alone. Importantly, however, the three 
studies we report herein qualify this insight by providing 
consistent evidence across two products that both tech- 
niques independently contribute to price persuasion and 
shopping intentions only when consumers are actively 
shopping for a product. Nevertheless, as a general state- 
ment, the joint use of reference pricing and limited-time 
appeals might still be advisable because the effects of the 
two techniques appear to compensate for each other under 
different conditions of involvement. Specifically, reference 
price effects are (are not) significant when consumers are 
(are not) shopping for a product, but significant limited- 
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time effects are weaker (stronger) under those same 
conditions. 


Sale announcements in reference price advertisements 
appear to have no drawbacks. In developing this study, dis- 
cussions with retailers revealed a growing reluctance to use 
the term “sale” in advertisements because of a perception that 
it is overused. Markdowns in retail advertising are common- 
place, and consumers often discount them (Bobinski, Cox, 
and Cox 1996). Our content analysis indicates that the word 
“sale” is used with moderate frequency. In our field studies, 
however, a sale announcement (when used in a reference 
price advertisement) did not detract from persuasion when 
consumers were shopping for a product, but it contributed 
to persuasion when they were not. The potential utility of 
sale cues being presented to disinterested consumers should 
not be underestimated, especially because those cues do not 
appear to lower evaluations of consumers who are currently 
in the market. For example, sale cues might have the poten- 
tial to convert nonshoppers into shoppers. 


Directions for Further Research 


Further research needs to examine conditions other than 
involvement that have the potential to moderate the effects 
of reference pricing. Given their potential influence on con- 
sumer motivation, two constructs that might affect the pro- 
cessing of reference price information are the level of prior 
product knowledge (Alba and Hutchinson 1987) and deal 
proneness (Krishna 1994). Research should also explore 
variables that have the potential to magnify reference price 
effects. The joint effect of reference prices and limited-time 
availability under high-involvement conditions suggests 
that when consumers are shopping for a product, the value 
implications of reference prices are accentuated when they 
are limited in time. The utility of different reference price 
formats in converting nonshoppers into shoppers also needs 
to be examined. 

Testing whether our limited-time findings extend to 
other scarcity variations would develop the generality of our 
results. For example, it is an empirical question whether 
respondents would be equally sensitive to both limited-time 
and limited-quantity manipulations across involvement con- 
ditions. Thus, appropriate caution needs to be taken in terms 
of generalizing our results across different types of scarcity 
because we examine only limited-time in these studies. 

Further research needs to consider different methodolo- 
gies for investigating reference price advertising effects. 


Prior research has typically employed forced exposure to 
print advertisements and a passive manipulation of involve- 
ment within the context of laboratory studies using student 
participants. The current research employed free exposure 
to print advertisements and an active manipulation of 
involvement in a field setting in which consumers were 
engaged in shopping behavior. Although both methodolo- 
gies have merit, we selected the latter because of its closer 
correspondence to the processing realities of consumers in 
the marketplace, as Wells (1993) and Kover (1995) advo- 
cate. Nevertheless, we recognize that our manipulation of 
involvement also has potential limitations. For example, by 
intercepting shoppers, the study samples were composed of 
goal-directed consumers. Some commentators might argue 
that low-involvement consumers may have been overly dis- 
missive of product advertisements that did not reflect their 
current interests. In the leisurely environment of a home, 
low-involvement processors might pay more attention to 
such advertisements than they did in our studies. We 
acknowledge that there are different ways to manipulate 
involvement, and the nature of that manipulation may mani- 
fest itself in the results obtained. 

The possibility that demand characteristics explain the 
results appears remote given the nature of the interaction 
effects we obtained. Specifically, a demand explanation for 
our findings can account for neither the low-involvement 
results nor the pattern of effects observed for the high- 
involvement condition. Furthermore, there was no evidence 
of hypothesis guessing or compliant participant behavior in 
answers to postexperimental questions. The results varied 
consistently with theoretical expectations, which is also 
inconsistent with a demand explanation for the results. 

Although much of modern advertising uses multiple 
persuasive techniques in the same advertisement, academic 
research has often focused on examining individual tech- 
niques in isolation. A particular value of our studies is that 
they consider how different techniques are used in adver- 
tisements that feature reference pricing and how consumers 
should theoretically respond to them and then test the corre- 
sponding predictions in a field setting. Further research on 
reference price advertising should be expanded to explore 
the use of multiple persuasion tools commonly employed in 
retail advertising practice. We hope that our efforts will pro- 
vide a framework for future investigations that broaden the 
scope of reference price advertising research. 


————————— o 
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Customer Management 

It is no longer what you sell, but how you sell it— 
meaning rt is now vital to bridge the gap between 
sales and marketing. Customer Message Management 
does just that, eradicating the disconnect that for 
years has plagued the two departments 

Tim Riesterer 
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Competing for Customers and Capital illustrates a 
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The White Paper Marketing Handbook 
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marketers how to create effective white papers using 
a variety of media. 
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Farce to Force: Building Profitable 
E-Commerce Strategies 

Though it's a powerful driving force for commerca, 
early misuse of business principles can lead to bad 
business practices—as wall as perceptions that the 
Internet is not a viable sales channel Farce to Force 
sets the record straight with techniques and examples 
of successful e-commerce marketing 

Sarah McCue 
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Managing Global Accounts: Nine Critical 
Factors for a World-Class Program 

Drawing on their experiences, the authors detail nine 
critical success factors for designing and executing a 
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Noo] Capon, Dave Potter, and Fred Schindler 
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action plan to mprove sales organization results. 
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customers, grow market share, increase the productivity 
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Creating Do-it- Yourself Customers: 

How Great Customer Experiences Build 
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Self-checkout, self-custorzation, and other 
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to business only if customers do their share. This book 
helps readers recognize situations where high-tech or 
human innovation can bolster customer mvolvement 
Peter Hogebein and Roy Cammarano 
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